.. |
a3c
|
[RLlib] Mask out padded values for A3C loss with recurrent policy (#15525)
|
2021-04-27 08:36:04 +02:00 |
ars
|
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
2021-04-16 09:16:24 +02:00 |
cql
|
[RLlib] Remove all (already soft-deprecated) SampleBatch.data from code. (#15335)
|
2021-04-15 19:19:51 +02:00 |
ddpg
|
[Rllib] Offline Learning Bug, different shapes (#15132)
|
2021-04-27 17:18:17 +02:00 |
dqn
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
dreamer
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
es
|
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
2021-04-16 09:16:24 +02:00 |
impala
|
[RLlib] Discussion 1928: Initial lr wrong if schedule used that includes ts=0 (both tf and torch). (#15538)
|
2021-04-27 17:19:52 +02:00 |
maml
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
marwil
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
mbmpo
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
pg
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
ppo
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
qmix
|
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
2021-04-16 09:16:24 +02:00 |
sac
|
[Rllib] Offline Learning Bug, different shapes (#15132)
|
2021-04-27 17:18:17 +02:00 |
slateq
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
mock.py
|
Auto report object store memory usage; remove some deprecated code (#14260)
|
2021-03-01 13:19:44 -08:00 |
registry.py
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
trainer.py
|
[RLlib] Evaluation parallel to training check, key-error hotfix (#15345)
|
2021-04-27 08:38:10 +02:00 |
trainer_template.py
|
[RLlib] Support parallelizing evaluation and training (optional). (#15040)
|
2021-04-13 09:53:35 +02:00 |