.. |
a3c
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
ars
|
[RLlib] Custom view requirements (e.g. for prev-n-obs) work with compute_single_action and compute_actions_from_input_dict . (#18921)
|
2021-09-30 15:03:37 +02:00 |
cql
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
ddpg
|
[RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. (#18937)
|
2021-10-04 13:29:00 +02:00 |
dqn
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
dreamer
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
es
|
[RLlib] DDPPO fixes and benchmarks. (#18390)
|
2021-09-08 19:39:01 +02:00 |
impala
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
maml
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
marwil
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
mbmpo
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
pg
|
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
2021-10-25 15:00:00 +02:00 |
ppo
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
qmix
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
sac
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
slateq
|
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
2021-10-26 20:56:02 +02:00 |
tests
|
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
2021-09-30 16:39:05 +02:00 |
__init__.py
|
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815)
|
2021-05-18 13:23:00 +02:00 |
callbacks.py
|
[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. (#19758)
|
2021-10-27 19:15:35 +02:00 |
mock.py
|
[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). (#17331)
|
2021-07-26 10:52:55 -04:00 |
registry.py
|
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)
|
2021-08-03 18:30:02 -04:00 |
trainer.py
|
[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. (#19758)
|
2021-10-27 19:15:35 +02:00 |
trainer_template.py
|
[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. (#19758)
|
2021-10-27 19:15:35 +02:00 |