.. |
a3c
|
[RLlib] Trainer sub-class A2C/A3C (instead of build_trainer ). (#20635)
|
2021-11-24 22:07:13 +01:00 |
ars
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
cql
|
[RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541)
|
2021-11-22 21:46:45 +01:00 |
ddpg
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
dqn
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
dreamer
|
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
2021-11-03 10:00:46 +01:00 |
es
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
impala
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
maml
|
Revert "Revert [RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
|
2021-11-16 14:49:41 +01:00 |
marwil
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
mbmpo
|
Revert "Revert [RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
|
2021-11-16 14:49:41 +01:00 |
pg
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
ppo
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
qmix
|
[RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408)
|
2021-11-05 14:39:28 +01:00 |
sac
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
slateq
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
tests
|
[RLlib] Trainer sub-class for APPO (instead of using build_trainer() ). (#20424)
|
2021-11-22 22:14:21 +01:00 |
__init__.py
|
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815)
|
2021-05-18 13:23:00 +02:00 |
callbacks.py
|
[RLlib] Add a comment in the doc string of on_learn_on_batch callback function. (#20456)
|
2021-11-19 10:49:07 +01:00 |
mock.py
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
registry.py
|
[RLlib] Trainer sub-class PPO/DDPPO (instead of build_trainer() ). (#20571)
|
2021-11-23 23:01:05 +01:00 |
trainer.py
|
[RLlib] Trainer sub-class A2C/A3C (instead of build_trainer ). (#20635)
|
2021-11-24 22:07:13 +01:00 |
trainer_template.py
|
[RLlib] Trainer sub-class for APPO (instead of using build_trainer() ). (#20424)
|
2021-11-22 22:14:21 +01:00 |