ray/rllib/tuned_examples/regression_tests at 31b40b00f67163e9723648fac8a4de5cb99063dc - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 22ccc43670 [RLlib] DQN torch version. (#7597 ) * Fix. * Rollback. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * Fix. * Fix. * Fix. * Fix. * Fix. * WIP. * WIP. * Fix. * Test case fixes. * Test case fixes and LINT. * Test case fixes and LINT. * Rollback. * WIP. * WIP. * Test case fixes. * Fix. * Fix. * Fix. * Add regression test for DQN w/ param noise. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Fixes and LINT. * Comment * Regression test case. * WIP. * WIP. * LINT. * LINT. * WIP. * Fix. * Fix. * Fix. * LINT. * Fix (SAC does currently not support eager). * Fix. * WIP. * LINT. * Update rllib/evaluation/sampler.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/evaluation/sampler.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/utils/exploration/exploration.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/utils/exploration/exploration.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * WIP. * WIP. * Fix. * LINT. * LINT. * Fix and LINT. * WIP. * WIP. * WIP. * WIP. * Fix. * LINT. * Fix. * Fix and LINT. * Update rllib/utils/exploration/exploration.py * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Update rllib/policy/dynamic_tf_policy.py Co-Authored-By: Eric Liang <ekhliang@gmail.com> * Fixes. * WIP. * LINT. * Fixes and LINT. * LINT and fixes. * LINT. * Move action_dist back into torch extra_action_out_fn and LINT. * Working SimpleQ learning cartpole on both torch AND tf. * Working Rainbow learning cartpole on tf. * Working Rainbow learning cartpole on tf. * WIP. * LINT. * LINT. * Update docs and add torch to APEX test. * LINT. * Fix. * LINT. * Fix. * Fix. * Fix and docstrings. * Fix broken RLlib tests in master. * Split BAZEL learning tests into cartpole and pendulum (reached the 60min barrier). * Fix error_outputs option in BAZEL for RLlib regression tests. * Fix. * Tune param-noise tests. * LINT. * Fix. * Fix. * test * test * test * Fix. * Fix. * WIP. * WIP. * WIP. * WIP. * LINT. * WIP. Co-authored-by: Eric Liang <ekhliang@gmail.com>		2020-04-06 11:56:16 -07:00
..
cartpole-a2c-microbatch.yaml	[rllib] Add microbatch optimizer with A2C example (#6161 )	2019-11-14 12:14:00 -08:00
cartpole-a2c-torch.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
cartpole-a3c.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
cartpole-appo-vtrace.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
cartpole-appo.yaml	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
cartpole-ars.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
cartpole-ddppo.yaml	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
cartpole-dqn-tf-param-noise.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
cartpole-dqn-tf.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
cartpole-dqn-torch-param-noise.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
cartpole-dqn-torch.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
cartpole-es.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
cartpole-pg-tf.yaml	[RLlib] Add PG torch regression test (#6828 )	2020-01-18 15:57:12 -08:00
cartpole-pg-torch.yaml	[RLlib] Add PG torch regression test (#6828 )	2020-01-18 15:57:12 -08:00
cartpole-ppo-tf.yaml	[RLlib] PPO(torch) on CartPole not tuned well enough for consistent learning (#7556 )	2020-03-11 20:31:27 -07:00
cartpole-ppo-torch.yaml	[RLlib] PPO(torch) on CartPole not tuned well enough for consistent learning (#7556 )	2020-03-11 20:31:27 -07:00
cartpole-sac.yaml	[RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814 )	2020-04-03 10:44:25 -07:00
cartpole-simpleq-tf.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
cartpole-simpleq-torch.yaml	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
pendulum-appo-vtrace.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
pendulum-ddpg.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
pendulum-ppo.yaml	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
pendulum-sac.yaml	[rllib] SAC no_done_at_end should default to False (#7594 )	2020-03-14 11:16:54 -07:00
pendulum-td3.yaml	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00