ray/rllib/utils
2020-07-17 12:14:34 +02:00
..
exploration [RLlib] Issue 8384: QMIX doesn't learn anything. (#9527) 2020-07-17 12:14:34 +02:00
schedules [RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). (#9304) 2020-07-09 10:44:10 +02:00
spaces [RLlib] Retire try_import_tree (should be installed along with other requirements). (#9211) 2020-07-02 13:06:34 +02:00
tests Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
__init__.py [RLlib] Retire try_import_tree (should be installed along with other requirements). (#9211) 2020-07-02 13:06:34 +02:00
actors.py Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
annotations.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
compression.py Stop vendoring pyarrow (#7233) 2020-02-19 19:01:26 -08:00
debug.py [rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893) 2020-06-12 20:17:27 -07:00
deprecation.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
error.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
filter.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
filter_manager.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
framework.py [RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). (#9304) 2020-07-09 10:44:10 +02:00
from_config.py [RLlib] Make envs specifiable in configs by their class path. (#8750) 2020-06-03 08:14:29 +02:00
memory.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
numpy.py [RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269) 2020-07-14 04:27:49 +02:00
policy_client.py [RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590) 2020-05-30 22:48:34 +02:00
policy_server.py [rllib] Add high-performance external application connector (#7641) 2020-03-20 12:43:57 -07:00
sgd.py WIP. (#8456) 2020-05-15 21:43:27 +02:00
test_utils.py [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
tf_ops.py [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
tf_run_builder.py [RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()). (#9136) 2020-06-30 10:13:20 +02:00
timer.py [rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299) 2020-02-28 16:44:17 -08:00
torch_ops.py [RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497) 2020-07-16 14:55:50 +02:00
tracking_dict.py This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970) 2020-06-27 20:50:01 +02:00
tuple_actions.py [RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143) 2020-04-28 14:59:16 +02:00
types.py [RLlib] Type annotations for policy. (#9248) 2020-07-05 13:09:51 +02:00
window_stat.py [RLLib] WindowStat bug fix (#9213) 2020-07-12 23:01:32 +02:00