ray/python/ray/rllib/tuned_examples/regression_tests/cartpole-appo-vtrace.yaml at 16f7ca45e40364e76e5887a751baba38b64c9c3c - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 05:16:49 -04:00

Michael Luo 16f7ca45e4 Appo (#3779 )

* Deleted old fork, updated new ray and moved PPO-impala to APPO in ppo folder

* Deleted unneccesary vtrace.py file

* Update pong-impala.yaml

* Cleaned PPO Code

* Update pong-impala.yaml

* Update pong-impala.yaml

* wip

* new ifle

* refactor

* add vtrace off option

* revert

* support any space

* docs

* fix comment

* remove kl

* Update cartpole-appo-vtrace.yaml

2019-01-18 13:40:26 -08:00

13 lines

294 B

YAML

Raw Blame History

 cartpole-appo-vt:
     env: CartPole-v0
     run: APPO
     stop:
         episode_reward_mean: 100
         timesteps_total: 100000
     config:
         sample_batch_size: 10
         train_batch_size: 10
         num_envs_per_worker: 5
         num_workers: 1
         num_gpus: 0
         vtrace: true