ray/rllib/policy
2021-02-22 17:09:40 +01:00
..
tests [RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522) 2021-02-02 13:05:58 +01:00
__init__.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py [RLlib] Issue #13507: Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037) 2021-02-11 18:58:46 +01:00
eager_tf_policy.py [RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584) 2021-02-08 15:02:19 +01:00
policy.py [RLlib] Implement TorchPolicy.export_model. (#13989) 2021-02-22 17:09:40 +01:00
policy_template.py [RLlib] Trajectory view API docs. (#12718) 2020-12-30 17:32:21 -08:00
rnn_sequencing.py [RLlib] Attention Nets: tf (#12753) 2020-12-20 20:22:32 -05:00
sample_batch.py [RLlib] Add __len__() method to SampleBatch (#13371) 2021-01-12 20:15:23 +01:00
tf_policy.py [RLlib] TFPolicy.export_model: Add timestep placeholder to model's signature, if needed. (#13988) 2021-02-10 15:21:46 +01:00
tf_policy_template.py [RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029) 2020-12-21 18:38:34 -08:00
torch_policy.py [RLlib] Implement TorchPolicy.export_model. (#13989) 2021-02-22 17:09:40 +01:00
torch_policy_template.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
view_requirement.py [RLlib] Trajectory View API: Atari framestacking. (#13315) 2021-01-13 08:53:34 +01:00