.. |
data
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
git_bisect
|
[ci] Clean up ci/ directory (refactor ci/travis) (#23866)
|
2022-04-13 18:11:30 +01:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
conftest.py
|
[CI] Create zip of ray session_latest/logs dir on test failure and upload to buildkite via /artifact-mount (#23783)
|
2022-04-22 09:48:53 +01:00 |
mock_worker.py
|
[RLlib] Filter.clear_buffer() deprecated (use Filter.reset_buffer() instead). (#22246)
|
2022-02-10 02:58:43 +01:00 |
run_regression_tests.py
|
[RLlib] Removed deprecated code with error=True (#23916)
|
2022-04-15 13:51:12 +02:00 |
test_algorithm_imports.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_catalog.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
test_checkpoint_restore.py
|
[RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting . (#24372)
|
2022-05-02 12:51:14 +02:00 |
test_dependency_tf.py
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
test_dependency_torch.py
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
test_dnc.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
test_eager_support.py
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
test_execution.py
|
[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. (#25346)
|
2022-06-02 16:47:05 +02:00 |
test_export.py
|
[RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting . (#24372)
|
2022-05-02 12:51:14 +02:00 |
test_filters.py
|
[RLlib] Filter.clear_buffer() deprecated (use Filter.reset_buffer() instead). (#22246)
|
2022-02-10 02:58:43 +01:00 |
test_gpus.py
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
test_io.py
|
[RLlib]: Rename input_evaluation to off_policy_estimation_methods . (#25107)
|
2022-05-27 13:14:54 +02:00 |
test_local.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_lstm.py
|
[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. (#25346)
|
2022-06-02 16:47:05 +02:00 |
test_model_imports.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
test_multi_agent_env.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_multi_agent_pendulum.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
test_nested_action_spaces.py
|
[CI] Check test files for if __name__... snippet (#25322)
|
2022-06-02 10:30:00 +01:00 |
test_nested_observation_spaces.py
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
test_nn_framework_import_errors.py
|
[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. (#25346)
|
2022-06-02 16:47:05 +02:00 |
test_perf.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
test_pettingzoo_env.py
|
[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 (#22519)
|
2022-03-01 11:23:27 +01:00 |
test_placement_groups.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_ray_client.py
|
[RLlib] Move (A/DD)?PPO and IMPALA algos to algorithms dir and rename policy and trainer classes. (#25346)
|
2022-06-02 16:47:05 +02:00 |
test_reproducibility.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_rllib_train_and_evaluate.py
|
[RLlib] Vectorized envs: Gracefully handle sub-environments failing by restarting them (if configured so). (#24967)
|
2022-05-28 10:50:03 +02:00 |
test_supported_multi_agent.py
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
test_supported_spaces.py
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
test_timesteps.py
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
test_vector_env.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |