ray/rllib/tuned_examples/regression_tests/cartpole-ddppo.yaml at 31b40b00f67163e9723648fac8a4de5cb99063dc - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

Eric Liang 026f6884b5

[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )

2020-02-10 15:28:27 -08:00

8 lines

170 B

YAML

Raw Blame History

 cartpole-ddppo:
     env: CartPole-v0
     run: DDPPO
     stop:
         episode_reward_mean: 100
         timesteps_total: 100000
     config:
         num_gpus_per_worker: 0