.. |
tests
|
[RLlib] Fix time dimension shaping for PyTorch RNN models. (#21735)
|
2022-04-29 10:39:03 +02:00 |
__init__.py
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
dynamic_tf_policy.py
|
[RLlib] Add additional return values to action_sampler_fn . (#22721)
|
2022-04-29 10:34:48 +02:00 |
dynamic_tf_policy_v2.py
|
[RLlib] Introduce new policy base classes. (#24742)
|
2022-05-13 21:48:30 +02:00 |
eager_tf_policy.py
|
[RLlib] Add additional return values to action_sampler_fn . (#22721)
|
2022-04-29 10:34:48 +02:00 |
eager_tf_policy_v2.py
|
[RLlib] Introduce new policy base classes. (#24742)
|
2022-05-13 21:48:30 +02:00 |
policy.py
|
[RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271)
|
2022-04-28 13:43:21 +02:00 |
policy_map.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
policy_template.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
rnn_sequencing.py
|
[RLlib] Automate sequences in timeslice_along_seq_lens_with_overlap() . (#24561)
|
2022-05-09 11:55:06 +02:00 |
sample_batch.py
|
Issue 24143: Fix a few f-strings missing the f. (#24232)
|
2022-05-02 16:11:33 +02:00 |
tf_mixins.py
|
[RLlib] Clean up Policy mixins. (#24746)
|
2022-05-17 17:16:08 +02:00 |
tf_policy.py
|
[RLlib] Clean up Policy mixins. (#24746)
|
2022-05-17 17:16:08 +02:00 |
tf_policy_template.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
torch_mixins.py
|
[RLlib] Clean up Policy mixins. (#24746)
|
2022-05-17 17:16:08 +02:00 |
torch_policy.py
|
[RLlib] Clean up Policy mixins. (#24746)
|
2022-05-17 17:16:08 +02:00 |
torch_policy_template.py
|
[CI] Format Python code with Black (#21975)
|
2022-01-29 18:41:57 -08:00 |
torch_policy_v2.py
|
[RLlib] Introduce new policy base classes. (#24742)
|
2022-05-13 21:48:30 +02:00 |
view_requirement.py
|
[docs] fix doctests and activate CI (#23418)
|
2022-03-24 17:04:02 -07:00 |