ray/rllib/utils at 73f5c4039b2bc70b8590f6408ff6523b8d652459 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 73f5c4039b [RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035 )		2021-10-04 13:23:51 +02:00
..
exploration	[RLlib] Fix crash when using StochasticSampling exploration (most PG-style algos) w/ tf and numpy > 1.19.5 (#18366 )	2021-09-06 12:14:00 +02:00
metrics	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
schedules	[RLlib] Allow for more than 2^31 policy timesteps. (#11301 )	2020-10-12 13:49:11 -07:00
spaces	[RLlib] No Preprocessors; preparatory PR #1 (#18367 )	2021-09-09 08:10:42 +02:00
tests	[RLlib] Better error messages and hints; + failure-mode tests; (#18466 )	2021-09-10 16:52:47 +02:00
__init__.py	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
actors.py	Change Python's `ObjectID` to `ObjectRef` (#9353 )	2020-07-10 17:49:04 +08:00
annotations.py	[RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999 )	2021-08-31 12:21:49 +02:00
compression.py	Stop vendoring pyarrow (#7233 )	2020-02-19 19:01:26 -08:00
debug.py	[RLlib] Set random seed (if provided) to Trainer process as well. (#18307 )	2021-09-04 11:02:30 +02:00
deprecation.py	[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530 )	2021-08-03 18:30:02 -04:00
error.py	[RLlib] Better error messages and hints; + failure-mode tests; (#18466 )	2021-09-10 16:52:47 +02:00
filter.py	[RLlib] No Preprocessors (part 2). (#18468 )	2021-09-23 12:56:45 +02:00
filter_manager.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
framework.py	[RLlib] No Preprocessors (part 2). (#18468 )	2021-09-23 12:56:45 +02:00
from_config.py	[RLlib] Better error messages and hints; + failure-mode tests; (#18466 )	2021-09-10 16:52:47 +02:00
images.py	[RLlib] CV2 to Skimage dependency change (#16841 )	2021-07-21 22:24:18 -04:00
install_atari_roms.sh	[rllib/docker] silent unzip of atari roms (#18340 )	2021-09-03 17:55:03 +01:00
memory.py	[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678 )	2020-07-29 21:15:09 +02:00
multi_agent.py	[RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035 )	2021-10-04 13:23:51 +02:00
numpy.py	[RLlib] Policies get/set_state fixes and enhancements. (#16354 )	2021-06-15 13:08:43 +02:00
sgd.py	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
test_utils.py	[RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035 )	2021-10-04 13:23:51 +02:00
tf_ops.py	[RLlib] Discussion 3644: Fix bug for complex obs spaces containing `Box([2D shape])` and discrete component. (#18917 )	2021-09-30 16:39:38 +02:00
tf_run_builder.py	[Lint] Add flake8-bugbear (#19053 )	2021-10-03 23:24:11 -07:00
threading.py	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
timer.py	[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299 )	2020-02-28 16:44:17 -08:00
torch_ops.py	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
typing.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
window_stat.py	[RLLib] WindowStat bug fix (#9213 )	2020-07-12 23:01:32 +02:00