.. |
debug
|
[RLlib] restart_failed_sub_environments now works for MA cases and crashes during reset() ; +more tests and logging; add eval worker sub-env fault tolerance test. (#26276)
|
2022-07-15 08:55:14 +02:00 |
exploration
|
[air] update rllib example to use Tuner API. (#26987)
|
2022-07-27 12:12:59 +01:00 |
metrics
|
[RLlib] Tolerate nan metrics in LearnerInfoBuilder. (#27981)
|
2022-08-23 10:07:32 -07:00 |
pre_checks
|
Bump gym dep to 0.24 (#26190)
|
2022-07-22 12:37:16 -07:00 |
replay_buffers
|
[RLlib] Move learning_starts logic from buffers into training_step() . (#26032)
|
2022-08-11 13:07:30 +02:00 |
schedules
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
spaces
|
[docs] Fix the remaining style violations in docstrings and add lint rule (#27033)
|
2022-07-27 22:24:20 -07:00 |
tests
|
[RLLib] Fix MultiDiscrete not being one-hotted correctly (#26558)
|
2022-07-20 15:25:53 -07:00 |
__init__.py
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
actors.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
annotations.py
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
compression.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
deprecation.py
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
error.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
filter.py
|
[RLlib] EnvRunnerV2 and EpisodeV2 that support Connectors. (#25922)
|
2022-06-30 08:44:10 +02:00 |
filter_manager.py
|
[RLlib] Add timeout to filter synchronization. (#25959)
|
2022-06-24 14:37:43 +02:00 |
framework.py
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
from_config.py
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
images.py
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
memory.py
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
numpy.py
|
[RLlib] Simplify agent collector (#26803)
|
2022-07-25 13:17:17 -07:00 |
policy.py
|
[RLlib] Fix connector examples (#27583)
|
2022-08-07 17:48:09 -07:00 |
serialization.py
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
sgd.py
|
[RLlib] Get rid of all these deprecation warnings. (#27085)
|
2022-07-27 10:48:54 -07:00 |
test_utils.py
|
[RLlib] Fix dqn reproducibility (#27459)
|
2022-08-09 15:56:44 -07:00 |
tf_run_builder.py
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
tf_utils.py
|
[RLlib] Warn on PPO infinite KL loss term. (#26629)
|
2022-08-01 12:55:26 +02:00 |
threading.py
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
timer.py
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
torch_utils.py
|
[RLlib] Fix PPOTorchPolicy producing float metrics when not using critic. (#27980)
|
2022-08-22 09:41:36 -07:00 |
typing.py
|
[docs][rllib] Documentation for connectors. (#27528)
|
2022-08-19 14:35:07 -07:00 |
window_stat.py
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |