ray/python/ray/rllib/tuned_examples/regression_tests/cartpole-appo-vtrace.yaml
Michael Luo 16f7ca45e4 Appo (#3779)
* Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder

* Deleted unneccesary vtrace.py file

* Update pong-impala.yaml

* Cleaned PPO Code

* Update pong-impala.yaml

* Update pong-impala.yaml

* wip

* new ifle

* refactor

* add vtrace off option

* revert

* support any space

* docs

* fix comment

* remove kl

* Update cartpole-appo-vtrace.yaml
2019-01-18 13:40:26 -08:00

13 lines
294 B
YAML

cartpole-appo-vt:
env: CartPole-v0
run: APPO
stop:
episode_reward_mean: 100
timesteps_total: 100000
config:
sample_batch_size: 10
train_batch_size: 10
num_envs_per_worker: 5
num_workers: 1
num_gpus: 0
vtrace: true