ray/rllib/tuned_examples
Sven Mika 22ccc43670
[RLlib] DQN torch version. (#7597)
* Fix.

* Rollback.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* Fix.

* Fix.

* Fix.

* Fix.

* Fix.

* WIP.

* WIP.

* Fix.

* Test case fixes.

* Test case fixes and LINT.

* Test case fixes and LINT.

* Rollback.

* WIP.

* WIP.

* Test case fixes.

* Fix.

* Fix.

* Fix.

* Add regression test for DQN w/ param noise.

* Fixes and LINT.

* Fixes and LINT.

* Fixes and LINT.

* Fixes and LINT.

* Fixes and LINT.

* Comment

* Regression test case.

* WIP.

* WIP.

* LINT.

* LINT.

* WIP.

* Fix.

* Fix.

* Fix.

* LINT.

* Fix (SAC does currently not support eager).

* Fix.

* WIP.

* LINT.

* Update rllib/evaluation/sampler.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/evaluation/sampler.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/utils/exploration/exploration.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/utils/exploration/exploration.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* WIP.

* WIP.

* Fix.

* LINT.

* LINT.

* Fix and LINT.

* WIP.

* WIP.

* WIP.

* WIP.

* Fix.

* LINT.

* Fix.

* Fix and LINT.

* Update rllib/utils/exploration/exploration.py

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Fixes.

* WIP.

* LINT.

* Fixes and LINT.

* LINT and fixes.

* LINT.

* Move action_dist back into torch extra_action_out_fn and LINT.

* Working SimpleQ learning cartpole on both torch AND tf.

* Working Rainbow learning cartpole on tf.

* Working Rainbow learning cartpole on tf.

* WIP.

* LINT.

* LINT.

* Update docs and add torch to APEX test.

* LINT.

* Fix.

* LINT.

* Fix.

* Fix.

* Fix and docstrings.

* Fix broken RLlib tests in master.

* Split BAZEL learning tests into cartpole and pendulum (reached the 60min barrier).

* Fix error_outputs option in BAZEL for RLlib regression tests.

* Fix.

* Tune param-noise tests.

* LINT.

* Fix.

* Fix.

* test

* test

* test

* Fix.

* Fix.

* WIP.

* WIP.

* WIP.

* WIP.

* LINT.

* WIP.

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-04-06 11:56:16 -07:00
..
regression_tests [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
atari-a2c.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-apex.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-ddppo.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-dist-dqn.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-dqn.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-duel-ddqn.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-impala-large.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-impala.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
atari-ppo.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
cartpole-grid-search-example.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
cartpole-marwil.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
compact-regression-test.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
halfcheetah-appo.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
halfcheetah-ddpg.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
halfcheetah-ppo.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
halfcheetah-sac.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
hopper-ppo.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
humanoid-es.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
humanoid-ppo-gae.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
humanoid-ppo.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
hyperband-cartpole.yaml Update hyperband-cartpole.yaml (#6121) 2019-11-09 19:39:03 -08:00
invertedpendulum-td3.yaml [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
mountaincarcontinuous-apex-ddpg.yaml [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
mountaincarcontinuous-ddpg.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
mujoco-td3.yaml [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
pendulum-apex-ddpg.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
pendulum-ddpg.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pendulum-ppo.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
pendulum-sac.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pendulum-td3.yaml [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
pong-a3c-pytorch.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-a3c.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-apex.yaml Update pong-apex tuned example (#6462) 2019-12-12 10:57:55 -08:00
pong-appo.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-dqn.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-impala-fast.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-impala-vectorized.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-impala.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-ppo.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
pong-rainbow.yaml [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
swimmer-ars.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
walker2d-ppo.yaml [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00