ray/rllib/tuned_examples/regression_tests/pendulum-td3.yaml at 57544b1ff9f97d4da9f64d25c8ea5a3d8d247ffc - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

Eric Liang 9d012626e5

[rllib] Distributed exec workflow for impala (#8321 )

2020-05-11 20:24:43 -07:00

8 lines

166 B

YAML

Raw Blame History

 pendulum-td3-tf:
     env: Pendulum-v0
     run: TD3
     config:
         use_pytorch: false
     stop:
         episode_reward_mean: -900
         timesteps_total: 100000