ray/rllib/policy at master - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-04 17:41:43 -05:00

History

Artur Niederfahrenhorst 2ce80d8163 [RLlib] Rename connector's from/to config methods to better reflect that they include state. (#27806 )		2022-08-29 14:37:21 +02:00
..
tests	[RLlib] Add Segmentation Buffer for DT (#27829 )	2022-08-16 15:20:41 -07:00
__init__.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py	[RLlib] Include SampleBatch.T column in all collected batches. (#25926 )	2022-06-21 13:20:22 +02:00
dynamic_tf_policy_v2.py	[RLlib] Migrating DDPG to PolicyV2. (#26054 )	2022-06-28 15:52:56 +02:00
eager_tf_policy.py	[RLlib] more connector polishes and fixes. (#26645 )	2022-07-19 08:50:28 -07:00
eager_tf_policy_v2.py	[RLlib] more connector polishes and fixes. (#26645 )	2022-07-19 08:50:28 -07:00
policy.py	[RLlib] Rename connector's from/to config methods to better reflect that they include state. (#27806 )	2022-08-29 14:37:21 +02:00
policy_map.py	[RLlib] Checkpoint and restore connectors. (#26253 )	2022-07-09 01:06:24 -07:00
policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
rnn_sequencing.py	[RLlib] User-friendly RNN sequencing. (#27087 )	2022-08-01 15:32:22 +02:00
sample_batch.py	[RLlib] Add Segmentation Buffer for DT (#27829 )	2022-08-16 15:20:41 -07:00
tf_mixins.py	[RLlib] Unify gnorm mixin for tf and torch policies. (#26102 )	2022-07-24 15:31:09 +02:00
tf_policy.py	[RLlib] more connector polishes and fixes. (#26645 )	2022-07-19 08:50:28 -07:00
tf_policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
torch_mixins.py	[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869 )	2022-06-20 15:54:00 +02:00
torch_policy.py	[RLlib] Warn on PPO infinite KL loss term. (#26629 )	2022-08-01 12:55:26 +02:00
torch_policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
torch_policy_v2.py	[RLlib] Warn on PPO infinite KL loss term. (#26629 )	2022-08-01 12:55:26 +02:00
view_requirement.py	[RLlib] Simplify agent collector (#26803 )	2022-07-25 13:17:17 -07:00