..
data
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes ( #16531 )
2021-06-30 12:32:11 +02:00
git_bisect
[ci] Clean up ci/ directory (refactor ci/travis) ( #23866 )
2022-04-13 18:11:30 +01:00
__init__.py
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
conftest.py
[CI] Create zip of ray session_latest/logs
dir on test failure and upload to buildkite via /artifact-mount
( #23783 )
2022-04-22 09:48:53 +01:00
mock_worker.py
[RLlib] Filter.clear_buffer() deprecated (use Filter.reset_buffer() instead). ( #22246 )
2022-02-10 02:58:43 +01:00
run_regression_tests.py
[RLlib] Removed deprecated code with error=True ( #23916 )
2022-04-15 13:51:12 +02:00
test_algorithm_imports.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_catalog.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_checkpoint_restore.py
[RLlib] Deprecate timesteps_per_iteration
config key (in favor of min_[sample|train]_timesteps_per_reporting
. ( #24372 )
2022-05-02 12:51:14 +02:00
test_dependency_tf.py
[RLlib] A2C + A3C move to algorithms
folder and re-name into A2C/A3C (from ...Trainer). ( #25314 )
2022-06-01 09:29:16 +02:00
test_dependency_torch.py
[RLlib] A2C + A3C move to algorithms
folder and re-name into A2C/A3C (from ...Trainer). ( #25314 )
2022-06-01 09:29:16 +02:00
test_dnc.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_eager_support.py
[RLlib] Replay Buffer API and Ape-X. ( #24506 )
2022-05-17 13:43:49 +02:00
test_execution.py
[RLlib] Migrate PPO Impala and APPO policies to use sub-classing implementation. ( #25117 )
2022-05-25 14:38:03 +02:00
test_export.py
[RLlib] Deprecate timesteps_per_iteration
config key (in favor of min_[sample|train]_timesteps_per_reporting
. ( #24372 )
2022-05-02 12:51:14 +02:00
test_filters.py
[RLlib] Filter.clear_buffer() deprecated (use Filter.reset_buffer() instead). ( #22246 )
2022-02-10 02:58:43 +01:00
test_gpus.py
[RLlib] A2C + A3C move to algorithms
folder and re-name into A2C/A3C (from ...Trainer). ( #25314 )
2022-06-01 09:29:16 +02:00
test_io.py
[RLlib]: Rename input_evaluation
to off_policy_estimation_methods
. ( #25107 )
2022-05-27 13:14:54 +02:00
test_local.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_lstm.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_model_imports.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_multi_agent_env.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_multi_agent_pendulum.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_nested_action_spaces.py
[CI] Check test files for if __name__...
snippet ( #25322 )
2022-06-02 10:30:00 +01:00
test_nested_observation_spaces.py
[RLlib] A2C + A3C move to algorithms
folder and re-name into A2C/A3C (from ...Trainer). ( #25314 )
2022-06-01 09:29:16 +02:00
test_nn_framework_import_errors.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_perf.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_pettingzoo_env.py
[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 ( #22519 )
2022-03-01 11:23:27 +01:00
test_placement_groups.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_ray_client.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00
test_reproducibility.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_rllib_train_and_evaluate.py
[RLlib] Vectorized envs: Gracefully handle sub-environments failing by restarting them (if configured so). ( #24967 )
2022-05-28 10:50:03 +02:00
test_supported_multi_agent.py
[RLlib] Replay Buffer API and Ape-X. ( #24506 )
2022-05-17 13:43:49 +02:00
test_supported_spaces.py
[RLlib] Replay Buffer API and Ape-X. ( #24506 )
2022-05-17 13:43:49 +02:00
test_timesteps.py
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits ( #24896 )
2022-05-19 18:30:42 +02:00
test_vector_env.py
[CI] Format Python code with Black ( #21975 )
2022-01-29 18:41:57 -08:00