.. |
tests
|
[RLlib]: Fix SampleBatch.split_by_episode to use dones if episode id is not available (#26492)
|
2022-07-22 16:46:05 -07:00 |
__init__.py
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
dynamic_tf_policy.py
|
[RLlib] Include SampleBatch.T column in all collected batches. (#25926)
|
2022-06-21 13:20:22 +02:00 |
dynamic_tf_policy_v2.py
|
[RLlib] Migrating DDPG to PolicyV2. (#26054)
|
2022-06-28 15:52:56 +02:00 |
eager_tf_policy.py
|
[RLlib] more connector polishes and fixes. (#26645)
|
2022-07-19 08:50:28 -07:00 |
eager_tf_policy_v2.py
|
[RLlib] more connector polishes and fixes. (#26645)
|
2022-07-19 08:50:28 -07:00 |
policy.py
|
[RLlib] Try to checkpoint a durable policy name (#27016)
|
2022-07-27 00:01:14 -07:00 |
policy_map.py
|
[RLlib] Checkpoint and restore connectors. (#26253)
|
2022-07-09 01:06:24 -07:00 |
policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
rnn_sequencing.py
|
[RLlib] Fix docstring and add unit tests for rnn sequencing. (#26197)
|
2022-07-06 14:32:57 +02:00 |
sample_batch.py
|
[RLlib] Get rid of all these deprecation warnings. (#27085)
|
2022-07-27 10:48:54 -07:00 |
tf_mixins.py
|
[RLlib] Unify gnorm mixin for tf and torch policies. (#26102)
|
2022-07-24 15:31:09 +02:00 |
tf_policy.py
|
[RLlib] more connector polishes and fixes. (#26645)
|
2022-07-19 08:50:28 -07:00 |
tf_policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
torch_mixins.py
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
torch_policy.py
|
[RLlib] more connector polishes and fixes. (#26645)
|
2022-07-19 08:50:28 -07:00 |
torch_policy_template.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
torch_policy_v2.py
|
[RLlib] more connector polishes and fixes. (#26645)
|
2022-07-19 08:50:28 -07:00 |
view_requirement.py
|
[RLlib] Simplify agent collector (#26803)
|
2022-07-25 13:17:17 -07:00 |