ray/rllib/policy at c1a1be1da64e453e022e8501b905a1f67c5383bb - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Sven Mika 95ef04b71a [RLlib] Implement `TorchPolicy.export_model`. (#13989 )		2021-02-22 17:09:40 +01:00
..
tests	[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522 )	2021-02-02 13:05:58 +01:00
__init__.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
dynamic_tf_policy.py	[RLlib] Issue #13507 : Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037 )	2021-02-11 18:58:46 +01:00
eager_tf_policy.py	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
policy.py	[RLlib] Implement `TorchPolicy.export_model`. (#13989 )	2021-02-22 17:09:40 +01:00
policy_template.py	[RLlib] Trajectory view API docs. (#12718 )	2020-12-30 17:32:21 -08:00
rnn_sequencing.py	[RLlib] Attention Nets: tf (#12753 )	2020-12-20 20:22:32 -05:00
sample_batch.py	[RLlib] Add `__len__()` method to SampleBatch (#13371 )	2021-01-12 20:15:23 +01:00
tf_policy.py	[RLlib] TFPolicy.export_model: Add timestep placeholder to model's signature, if needed. (#13988 )	2021-02-10 15:21:46 +01:00
tf_policy_template.py	[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029 )	2020-12-21 18:38:34 -08:00
torch_policy.py	[RLlib] Implement `TorchPolicy.export_model`. (#13989 )	2021-02-22 17:09:40 +01:00
torch_policy_template.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
view_requirement.py	[RLlib] Trajectory View API: Atari framestacking. (#13315 )	2021-01-13 08:53:34 +01:00