ray/rllib/tuned_examples at 22ccc43670dac93eb7fe81520a84cf3979d05693 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

History

Sven Mika 22ccc43670 [RLlib] DQN torch version. (#7597 ) * Fix. * Rollback. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * Fix. * Fix. * Fix. * Fix. * Fix. * WIP. * WIP. * Fix. * Test case fixes. * Test case fixes and LINT. * Test case fixes and LINT. * Rollback. * WIP. * WIP. * Test case fixes. * Fix. * Fix. * Fix. * Add regression test for DQN w/ param noise. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Comment * Regression test case. * WIP. * WIP. * LINT. * LINT. * WIP. * Fix. * Fix. * Fix. * LINT. * Fix (SAC does currently not support eager). * Fix. * WIP. * LINT. * Update rllib/evaluation/sampler.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/evaluation/sampler.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/utils/exploration/exploration.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/utils/exploration/exploration.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * WIP. * Fix. * LINT. * LINT. * Fix and LINT. * WIP. * WIP. * WIP. * WIP. * Fix. * LINT. * Fix. * Fix and LINT. * Update rllib/utils/exploration/exploration.py * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Fixes. * WIP. * LINT. * Fixes and LINT. * LINT and fixes. * LINT. * Move action_dist back into torch extra_action_out_fn and LINT. * Working SimpleQ learning cartpole on both torch AND tf. * Working Rainbow learning cartpole on tf. * Working Rainbow learning cartpole on tf. * WIP. * LINT. * LINT. * Update docs and add torch to APEX test. * LINT. * Fix. * LINT. * Fix. * Fix. * Fix and docstrings. * Fix broken RLlib tests in master. * Split BAZEL learning tests into cartpole and pendulum (reached the 60min barrier). * Fix error_outputs option in BAZEL for RLlib regression tests. * Fix. * Tune param-noise tests. * LINT. * Fix. * Fix. * test * test * test * Fix. * Fix. * WIP. * WIP. * WIP. * WIP. * LINT. * WIP. Co-authored-by: Eric Liang <ekhliang@gmail.com>		2020-04-06 11:56:16 -07:00
..
regression_tests	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
atari-a2c.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-apex.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-ddppo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-dist-dqn.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-dqn.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-duel-ddqn.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-impala-large.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-impala.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
atari-ppo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
cartpole-grid-search-example.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
cartpole-marwil.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
compact-regression-test.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
halfcheetah-appo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
halfcheetah-ddpg.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
halfcheetah-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
halfcheetah-sac.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
hopper-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
humanoid-es.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
humanoid-ppo-gae.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
humanoid-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
hyperband-cartpole.yaml	Update hyperband-cartpole.yaml (#6121 )	2019-11-09 19:39:03 -08:00
invertedpendulum-td3.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
mountaincarcontinuous-apex-ddpg.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
mountaincarcontinuous-ddpg.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
mujoco-td3.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
pendulum-apex-ddpg.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
pendulum-ddpg.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pendulum-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
pendulum-sac.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pendulum-td3.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
pong-a3c-pytorch.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-a3c.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-apex.yaml	Update pong-apex tuned example (#6462 )	2019-12-12 10:57:55 -08:00
pong-appo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-dqn.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-impala-fast.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-impala-vectorized.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-impala.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-ppo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
pong-rainbow.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
swimmer-ars.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
walker2d-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00