..
a3c
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
ars
[RLlib] Custom view requirements (e.g. for prev-n-obs) work with compute_single_action
and compute_actions_from_input_dict
. ( #18921 )
2021-09-30 15:03:37 +02:00
cql
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
ddpg
[RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. ( #18937 )
2021-10-04 13:29:00 +02:00
dqn
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
dreamer
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
es
[RLlib] DDPPO fixes and benchmarks. ( #18390 )
2021-09-08 19:39:01 +02:00
impala
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
maml
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
marwil
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
mbmpo
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
pg
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). ( #19693 )
2021-10-25 15:00:00 +02:00
ppo
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
qmix
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
sac
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
slateq
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
tests
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. ( #18879 )
2021-09-30 16:39:05 +02:00
__init__.py
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. ( #15815 )
2021-05-18 13:23:00 +02:00
callbacks.py
[RLlib] Add policies
arg to callback: on_episode_step
(already exists in all other episode-related callbacks) ( #18119 )
2021-08-27 16:12:19 +02:00
mock.py
[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). ( #17331 )
2021-07-26 10:52:55 -04:00
registry.py
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. ( #17530 )
2021-08-03 18:30:02 -04:00
trainer.py
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00
trainer_template.py
[RLlib] Unify the way we create local replay buffer for all agents ( #19627 )
2021-10-26 20:56:02 +02:00