ray/rllib/tuned_examples/regression_tests/pendulum-ddpg.yaml

11 lines
224 B
YAML
Raw Normal View History

2018-04-11 15:08:39 -07:00
pendulum-ddpg:
env: Pendulum-v0
run: DDPG
stop:
episode_reward_mean: -900
timesteps_total: 100000
2018-04-11 15:08:39 -07:00
config:
use_huber: True
clip_rewards: False
exploration_fraction: 0.1