ray/rllib/policy
2022-04-28 13:43:21 +02:00
..
tests [RLlib] Issue 23907: SampleBatch.shuffle does not flush intercepted_values dict (which it should). (#24005) 2022-04-19 17:55:59 +02:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py [RLlib] SlateQ (tf GPU + multi-GPU) + Bandit fixes (#23276) 2022-03-18 13:45:16 +01:00
eager_tf_policy.py [RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271) 2022-04-28 13:43:21 +02:00
policy.py [RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271) 2022-04-28 13:43:21 +02:00
policy_map.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
rnn_sequencing.py [RLlib] Issue 21991: Fix SampleBatch slicing for SampleBatch.INFOS in RNN cases (#22050) 2022-04-25 11:40:24 +02:00
sample_batch.py [RLlib] Issue 21991: Fix SampleBatch slicing for SampleBatch.INFOS in RNN cases (#22050) 2022-04-25 11:40:24 +02:00
tf_policy.py [RLlib] Add support for writing env 'info' dicts to output datasets for TFPolicies (for TorchPolicies, these are part of the view-requirements by default and thus written either way). (#24041) 2022-04-25 11:17:50 +02:00
tf_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
torch_policy.py [rllib] Fix incorrect sequence length for rnn (#23830) 2022-04-12 21:07:18 +01:00
torch_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
view_requirement.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00