ray/rllib/policy at d90c6cfbd643668ee210eb8d40a370447825dab9 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Sven Mika d90c6cfbd6 [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871 )		2022-06-17 20:12:16 +02:00
..
tests	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
__init__.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
dynamic_tf_policy_v2.py	[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871 )	2022-06-17 20:12:16 +02:00
eager_tf_policy.py	[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060 )	2022-05-24 22:14:25 -07:00
eager_tf_policy_v2.py	[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871 )	2022-06-17 20:12:16 +02:00
policy.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
policy_map.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
rnn_sequencing.py	[RLlib] Replay Buffer API documentation. (#24683 )	2022-06-10 16:47:51 +02:00
sample_batch.py	[RLlib] Fix sample batch concat samples. (#25572 )	2022-06-14 12:47:29 +02:00
tf_mixins.py	[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871 )	2022-06-17 20:12:16 +02:00
tf_policy.py	[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871 )	2022-06-17 20:12:16 +02:00
tf_policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
torch_mixins.py	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00
torch_policy.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
torch_policy_template.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
torch_policy_v2.py	[RLlib] Fix `action_sampler_fn` call in `TorchPolicyV2` (`obs_batch` instead of `input_dict` arg). (#25877 )	2022-06-17 08:39:39 +02:00
view_requirement.py	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00