mirror of
https://github.com/vale981/ray
synced 2025-03-10 05:16:49 -04:00

* Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder * Deleted unneccesary vtrace.py file * Update pong-impala.yaml * Cleaned PPO Code * Update pong-impala.yaml * Update pong-impala.yaml * wip * new ifle * refactor * add vtrace off option * revert * support any space * docs * fix comment * remove kl * Update cartpole-appo-vtrace.yaml
13 lines
294 B
YAML
13 lines
294 B
YAML
cartpole-appo-vt:
|
|
env: CartPole-v0
|
|
run: APPO
|
|
stop:
|
|
episode_reward_mean: 100
|
|
timesteps_total: 100000
|
|
config:
|
|
sample_batch_size: 10
|
|
train_batch_size: 10
|
|
num_envs_per_worker: 5
|
|
num_workers: 1
|
|
num_gpus: 0
|
|
vtrace: true
|