ray/rllib/policy
2022-04-07 10:16:22 +02:00
..
tests [RLlib] Issue 22625: MultiAgentBatch.timeslices() does not behave as expected. (#22657) 2022-03-08 14:25:48 +01:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py [RLlib] SlateQ (tf GPU + multi-GPU) + Bandit fixes (#23276) 2022-03-18 13:45:16 +01:00
eager_tf_policy.py Revert "Revert "[RLlib] Speedup A3C up to 3x (new training_iteration function instead of execution_plan) and re-instate Pong learning test."" (#18708) 2022-02-10 13:44:22 +01:00
policy.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00
policy_map.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
rnn_sequencing.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00
sample_batch.py [RLlib] Issue 23639: Error in client/server setup when using LSTMs (#23740) 2022-04-07 10:16:22 +02:00
tf_policy.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00
tf_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
torch_policy.py [RLlib] Issue #21671: Handle callbacks and model metrics for TorchPolicy while using multi-GPU optimizers (#21697) 2022-02-23 08:30:38 +01:00
torch_policy_template.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
view_requirement.py [docs] fix doctests and activate CI (#23418) 2022-03-24 17:04:02 -07:00