ray/rllib/policy
2021-05-18 11:05:42 +02:00
..
tests [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 2021-04-30 19:26:30 +02:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 2021-04-30 19:26:30 +02:00
eager_tf_policy.py [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 2021-04-30 19:26:30 +02:00
policy.py [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
policy_template.py [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 2021-05-04 19:06:19 +02:00
rnn_sequencing.py [RLlib] Issue 15556: Fix R2D2 using chunks from previous episodes in the "burn-in" window. (#15737) 2021-05-18 11:05:42 +02:00
sample_batch.py [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 2021-04-30 19:26:30 +02:00
tf_policy.py [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
tf_policy_template.py [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
torch_policy.py [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
torch_policy_template.py [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 2021-05-04 19:06:19 +02:00
view_requirement.py [RLlib] Remove all non-trajectory view API code. (#14860) 2021-03-23 09:50:18 -07:00