ray/rllib/tuned_examples/regression_tests/pendulum-ddpg.yaml at 10d49a3f6fb5f113933d8cf128de28dac9509160 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 21:06:39 -04:00

Eric Liang 5d7afe8092

[rllib] Try moving RLlib to top level dir (#5324 )

2019-08-05 23:25:49 -07:00

10 lines

224 B

YAML

Raw Blame History

 pendulum-ddpg:
     env: Pendulum-v0
     run: DDPG
     stop:
         episode_reward_mean: -900
         timesteps_total: 100000
     config:
         use_huber: True
         clip_rewards: False
         exploration_fraction: 0.1