ray/rllib/tuned_examples/regression_tests/pendulum-ddpg-tf.yaml

11 lines
220 B
YAML
Raw Normal View History

pendulum-ddpg-tf:
2018-04-11 15:08:39 -07:00
env: Pendulum-v0
run: DDPG
stop:
episode_reward_mean: -900
timesteps_total: 100000
2018-04-11 15:08:39 -07:00
config:
use_pytorch: false
use_huber: true
clip_rewards: false