ray/rllib/utils
2022-08-23 10:07:32 -07:00
..
debug [RLlib] restart_failed_sub_environments now works for MA cases and crashes during reset(); +more tests and logging; add eval worker sub-env fault tolerance test. (#26276) 2022-07-15 08:55:14 +02:00
exploration [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
metrics [RLlib] Tolerate nan metrics in LearnerInfoBuilder. (#27981) 2022-08-23 10:07:32 -07:00
pre_checks Bump gym dep to 0.24 (#26190) 2022-07-22 12:37:16 -07:00
replay_buffers [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
schedules [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00
spaces [docs] Fix the remaining style violations in docstrings and add lint rule (#27033) 2022-07-27 22:24:20 -07:00
tests [RLLib] Fix MultiDiscrete not being one-hotted correctly (#26558) 2022-07-20 15:25:53 -07:00
__init__.py Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
actors.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
annotations.py [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
compression.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
deprecation.py [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00
error.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
filter.py [RLlib] EnvRunnerV2 and EpisodeV2 that support Connectors. (#25922) 2022-06-30 08:44:10 +02:00
filter_manager.py [RLlib] Add timeout to filter synchronization. (#25959) 2022-06-24 14:37:43 +02:00
framework.py Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
from_config.py Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
images.py [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00
memory.py [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 2021-11-01 21:46:02 +01:00
numpy.py [RLlib] Simplify agent collector (#26803) 2022-07-25 13:17:17 -07:00
policy.py [RLlib] Fix connector examples (#27583) 2022-08-07 17:48:09 -07:00
serialization.py [RLlib] Fix a bunch of issues related to connectors. (#26510) 2022-07-13 18:55:20 +02:00
sgd.py [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
test_utils.py [RLlib] Fix dqn reproducibility (#27459) 2022-08-09 15:56:44 -07:00
tf_run_builder.py [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00
tf_utils.py [RLlib] Warn on PPO infinite KL loss term. (#26629) 2022-08-01 12:55:26 +02:00
threading.py [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00
timer.py [RLlib] Cleanup some deprecated metric keys and classes. (#26036) 2022-06-23 21:30:01 +02:00
torch_utils.py [RLlib] Fix PPOTorchPolicy producing float metrics when not using critic. (#27980) 2022-08-22 09:41:36 -07:00
typing.py [docs][rllib] Documentation for connectors. (#27528) 2022-08-19 14:35:07 -07:00
window_stat.py [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 2021-11-01 21:46:02 +01:00