ray/rllib/policy
2022-05-17 17:16:08 +02:00
..
tests [RLlib] Fix time dimension shaping for PyTorch RNN models. (#21735) 2022-04-29 10:39:03 +02:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py [RLlib] Add additional return values to action_sampler_fn. (#22721) 2022-04-29 10:34:48 +02:00
dynamic_tf_policy_v2.py [RLlib] Introduce new policy base classes. (#24742) 2022-05-13 21:48:30 +02:00
eager_tf_policy.py [RLlib] Add additional return values to action_sampler_fn. (#22721) 2022-04-29 10:34:48 +02:00
eager_tf_policy_v2.py [RLlib] Introduce new policy base classes. (#24742) 2022-05-13 21:48:30 +02:00
policy.py [RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271) 2022-04-28 13:43:21 +02:00
policy_map.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
rnn_sequencing.py [RLlib] Automate sequences in timeslice_along_seq_lens_with_overlap(). (#24561) 2022-05-09 11:55:06 +02:00
sample_batch.py Issue 24143: Fix a few f-strings missing the f. (#24232) 2022-05-02 16:11:33 +02:00
tf_mixins.py [RLlib] Clean up Policy mixins. (#24746) 2022-05-17 17:16:08 +02:00
tf_policy.py [RLlib] Clean up Policy mixins. (#24746) 2022-05-17 17:16:08 +02:00
tf_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
torch_mixins.py [RLlib] Clean up Policy mixins. (#24746) 2022-05-17 17:16:08 +02:00
torch_policy.py [RLlib] Clean up Policy mixins. (#24746) 2022-05-17 17:16:08 +02:00
torch_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
torch_policy_v2.py [RLlib] Introduce new policy base classes. (#24742) 2022-05-13 21:48:30 +02:00
view_requirement.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00