.. |
tests
|
[RLlib] Add compute log likelihoods test for CRR. (#25905)
|
2022-06-21 16:06:10 +02:00 |
__init__.py
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
dynamic_tf_policy.py
|
[RLlib] Include SampleBatch.T column in all collected batches. (#25926)
|
2022-06-21 13:20:22 +02:00 |
dynamic_tf_policy_v2.py
|
[RLlib] Migrating DDPG to PolicyV2. (#26054)
|
2022-06-28 15:52:56 +02:00 |
eager_tf_policy.py
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
eager_tf_policy_v2.py
|
[RLlib] Migrating DDPG to PolicyV2. (#26054)
|
2022-06-28 15:52:56 +02:00 |
policy.py
|
[RLlib] Include SampleBatch.T column in all collected batches. (#25926)
|
2022-06-21 13:20:22 +02:00 |
policy_map.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
rnn_sequencing.py
|
[RLlib] Replay Buffer API documentation. (#24683)
|
2022-06-10 16:47:51 +02:00 |
sample_batch.py
|
[RLlib] Fix sample batch concat samples. (#25572)
|
2022-06-14 12:47:29 +02:00 |
tf_mixins.py
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
tf_policy.py
|
[api] Annotate as public / move ray-core APIs to _private and add enforcement rule (#25695)
|
2022-06-21 15:13:29 -07:00 |
tf_policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
torch_mixins.py
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
torch_policy.py
|
[api] Annotate as public / move ray-core APIs to _private and add enforcement rule (#25695)
|
2022-06-21 15:13:29 -07:00 |
torch_policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
torch_policy_v2.py
|
[RLlib] Fix get_num_samples_loaded_into_buffer in TorchPolicyV2. (#25956)
|
2022-06-22 13:11:41 +02:00 |
view_requirement.py
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |