ray/rllib/execution at 1dbe7fc26a06c614154e9946d5bbd05d374eea4a - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Sven Mika 924f11cd45 [RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371 )		2021-08-03 11:35:49 -04:00
..
tests	[RLlib] Remove old SegmentTree from tests dir and unflake respective segment tree test. (#14450 )	2021-03-03 14:31:30 +01:00
__init__.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
common.py	[RLlib] Issue #13802 : Enhance metrics for `multiagent->count_steps_by=agent_steps` setting. (#14033 )	2021-03-18 20:27:41 +01:00
concurrency_ops.py	[RLlib] Execution Annotation (#13036 )	2020-12-24 09:30:33 -05:00
learner_thread.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
metric_ops.py	[RLlib] CQL iteration count fixes: Remove dummy buffer and unnecessary store op from exec_plan. (#16332 )	2021-06-10 07:49:17 +02:00
minibatch_buffer.py	[RLlib] Execution Annotation (#13036 )	2020-12-24 09:30:33 -05:00
multi_gpu_impl.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
multi_gpu_learner.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
multi_gpu_learner_thread.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
replay_buffer.py	[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218 )	2021-06-03 22:29:00 +02:00
replay_ops.py	[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218 )	2021-06-03 22:29:00 +02:00
rollout_ops.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
segment_tree.py	[RLlib] APEX-DQN: Bug fix for torch and add learning test. (#15762 )	2021-05-20 09:27:03 +02:00
train_ops.py	[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371 )	2021-08-03 11:35:49 -04:00
tree_agg.py	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00