.. |
a2c
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
a3c
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
alpha_star
|
[RLlib] Move IMPALA and APPO back to exec plan (for now; due to unresolved learning/performance issues). (#25851)
|
2022-06-29 08:41:47 +02:00 |
alpha_zero
|
[RLlib] Make QMix use the ReplayBufferAPI (#25560)
|
2022-06-23 22:55:22 -07:00 |
apex_ddpg
|
[RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924)
|
2022-06-20 19:53:47 +02:00 |
apex_dqn
|
[RLlib] Eval WorkerSet crashes when trying to re-add a failed worker (eval set does not have local worker). (#26134)
|
2022-06-30 13:25:22 +02:00 |
appo
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
ars
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
bandit
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
bc
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
cql
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
crr
|
[RLlib] Make Dataset reader default reader and enable CRR to use dataset (#26304)
|
2022-07-08 12:43:35 -07:00 |
ddpg
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
ddppo
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
dqn
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
dreamer
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
es
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
impala
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
maddpg
|
[RLlib] Save serialized PolicySpec. Extract num_gpus related logics into a util function. (#25954)
|
2022-06-30 11:38:21 +02:00 |
maml
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
marwil
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
mbmpo
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
pg
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
ppo
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
qmix
|
[RLlib] Make QMix use the ReplayBufferAPI (#25560)
|
2022-06-23 22:55:22 -07:00 |
r2d2
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
sac
|
[RLlib] Migrating DDPG to PolicyV2. (#26054)
|
2022-06-28 15:52:56 +02:00 |
simple_q
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
slateq
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
td3
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
tests
|
[RLlib] Eval WorkerSet crashes when trying to re-add a failed worker (eval set does not have local worker). (#26134)
|
2022-06-30 13:25:22 +02:00 |
__init__.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
algorithm.py
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
algorithm_config.py
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
callbacks.py
|
[RLlib] Fix all the erroneous on_trainer_init warning. (#26433)
|
2022-07-13 18:56:01 +02:00 |
mock.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
registry.py
|
[RLlib] Fixes logging of all of RLlib's Algorithm names as warning messages. (#25840)
|
2022-06-17 08:41:18 +02:00 |