ray/rllib/utils
2021-09-02 09:28:16 +02:00
..
exploration [RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587) 2021-08-05 11:39:15 -04:00
schedules [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 2020-10-12 13:49:11 -07:00
spaces [RLlib] Add multi-GPU learning tests to nightly. (#17778) 2021-08-18 17:21:01 +02:00
tests Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
__init__.py [RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530) 2021-08-03 18:30:02 -04:00
actors.py Change Python's ObjectID to ObjectRef (#9353) 2020-07-10 17:49:04 +08:00
annotations.py [RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999) 2021-08-31 12:21:49 +02:00
compression.py Stop vendoring pyarrow (#7233) 2020-02-19 19:01:26 -08:00
debug.py [RLlib] Minor renaming and cleanups related to last rollout worker seed fix. (#18155) 2021-09-02 06:57:46 +02:00
deprecation.py [RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530) 2021-08-03 18:30:02 -04:00
error.py [RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249) 2021-09-02 09:28:16 +02:00
filter.py [rllib] Rrk/12079 custom filters (#12095) 2020-11-19 13:20:20 -08:00
filter_manager.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
framework.py [RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530) 2021-08-03 18:30:02 -04:00
from_config.py [RLlib] Make envs specifiable in configs by their class path. (#8750) 2020-06-03 08:14:29 +02:00
images.py [RLlib] CV2 to Skimage dependency change (#16841) 2021-07-21 22:24:18 -04:00
memory.py [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 2020-07-29 21:15:09 +02:00
multi_agent.py [RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254) 2021-08-31 22:03:23 +02:00
numpy.py [RLlib] Policies get/set_state fixes and enhancements. (#16354) 2021-06-15 13:08:43 +02:00
sgd.py [RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046) 2021-07-15 05:51:24 -04:00
test_utils.py [RLlib; Testing] Fix smoke-test settings for nightly learning_tests and stress_test; Add pybullet_envs to app-config. (#18274) 2021-09-01 21:46:06 +02:00
tf_ops.py [RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use ray.get_gpu_ids() (b/c no GPUs assigned by ray). (#17444) 2021-08-02 17:29:59 -04:00
tf_run_builder.py [RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()). (#9136) 2020-06-30 10:13:20 +02:00
threading.py [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
timer.py [rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299) 2020-02-28 16:44:17 -08:00
torch_ops.py [RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371) 2021-08-03 11:35:49 -04:00
typing.py [RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046) 2021-07-15 05:51:24 -04:00
window_stat.py [RLLib] WindowStat bug fix (#9213) 2020-07-12 23:01:32 +02:00