mirror of
https://github.com/vale981/ray
synced 2025-03-12 22:26:39 -04:00

* WIP. * WIP. * WIP. * WIP. * WIP. * Fix * WIP. * Add TD3 quick Pendulum regresison. * Cleanup. * Fix. * LINT. * Fix. * Sort quick_learning test cases, add TD3. * Sort quick_learning test cases, add TD3. * Revert test_checkpoint_restore.py (debugging) changes. * Fix old soft_q settings in documentation and test configs. * More doc fixes. * Fix test case. * Fix test case. * Lower test load. * WIP.
9 lines
190 B
YAML
9 lines
190 B
YAML
pendulum-ddpg:
|
|
env: Pendulum-v0
|
|
run: DDPG
|
|
stop:
|
|
episode_reward_mean: -900
|
|
timesteps_total: 100000
|
|
config:
|
|
use_huber: True
|
|
clip_rewards: False
|