ray/rllib/utils
2021-03-25 12:33:23 +01:00
..
exploration [RLlib] Fix param noise test case on CI. (#14926) 2021-03-25 12:33:23 +01:00
schedules [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 2020-10-12 13:49:11 -07:00
spaces [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 2020-10-01 16:57:10 +02:00
tests Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
actors.py Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
annotations.py Fix overriden typo (#11227) 2020-10-07 19:11:07 -07:00
compression.py Stop vendoring pyarrow (#7233) 2020-02-19 19:01:26 -08:00
debug.py [rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893) 2020-06-12 20:17:27 -07:00
deprecation.py [RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363) 2021-01-14 14:44:33 +01:00
error.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
filter.py [rllib] Rrk/12079 custom filters (#12095) 2020-11-19 13:20:20 -08:00
filter_manager.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
framework.py [RLlib] Preparatory PR for: Documentation on Model Building. (#13260) 2021-01-08 10:56:09 +01:00
from_config.py [RLlib] Make envs specifiable in configs by their class path. (#8750) 2020-06-03 08:14:29 +02:00
memory.py [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 2020-07-29 21:15:09 +02:00
numpy.py [RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397) 2020-11-25 11:27:46 -08:00
sgd.py [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 2021-03-08 15:41:27 +01:00
test_utils.py [RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522) 2021-02-02 13:05:58 +01:00
tf_ops.py [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 2021-01-19 14:22:36 +01:00
tf_run_builder.py [RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()). (#9136) 2020-06-30 10:13:20 +02:00
threading.py [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
timer.py [rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299) 2020-02-28 16:44:17 -08:00
torch_ops.py [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 2021-01-19 14:22:36 +01:00
tracking_dict.py [RLlib] Obsolete usage tracking dict via sample batch. (#13065) 2021-03-17 08:18:15 +01:00
typing.py [RLlib] Attention Nets: tf (#12753) 2020-12-20 20:22:32 -05:00
window_stat.py [RLLib] WindowStat bug fix (#9213) 2020-07-12 23:01:32 +02:00