..
a3c
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
ars
[RLlib] Custom view requirements (e.g. for prev-n-obs) work with compute_single_action
and compute_actions_from_input_dict
. ( #18921 )
2021-09-30 15:03:37 +02:00
cql
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
ddpg
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests ( #19809 )
2021-10-28 18:06:47 +02:00
dqn
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests ( #19809 )
2021-10-28 18:06:47 +02:00
dreamer
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
es
[RLlib] DDPPO fixes and benchmarks. ( #18390 )
2021-09-08 19:39:01 +02:00
impala
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
maml
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
marwil
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
mbmpo
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
pg
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). ( #19693 )
2021-10-25 15:00:00 +02:00
ppo
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
qmix
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests ( #19809 )
2021-10-28 18:06:47 +02:00
sac
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests ( #19809 )
2021-10-28 18:06:47 +02:00
slateq
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
tests
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. ( #18879 )
2021-09-30 16:39:05 +02:00
__init__.py
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. ( #15815 )
2021-05-18 13:23:00 +02:00
callbacks.py
[RLlib; Docs overhaul] Docstring cleanup: Environments. ( #19784 )
2021-10-29 10:46:52 +02:00
mock.py
[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). ( #17331 )
2021-07-26 10:52:55 -04:00
registry.py
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. ( #17530 )
2021-08-03 18:30:02 -04:00
trainer.py
Revert "[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. ( #19758 )" ( #19806 )
2021-10-27 23:30:07 +02:00
trainer_template.py
Revert "[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. ( #19758 )" ( #19806 )
2021-10-27 23:30:07 +02:00