.. |
a2c
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
|
2022-06-14 13:59:15 -07:00 |
a3c
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
alpha_star
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
alpha_zero
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
apex_ddpg
|
[RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848)
|
2022-06-17 14:10:36 +02:00 |
apex_dqn
|
[RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848)
|
2022-06-17 14:10:36 +02:00 |
appo
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
ars
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
bandit
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
bc
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
cql
|
[RLlib] Move offline input into replay buffer using rollout ops in CQL. (#25629)
|
2022-06-17 17:08:55 +02:00 |
crr
|
[RLlib] CRR documentation. (#25667)
|
2022-06-14 12:45:36 +02:00 |
ddpg
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
ddppo
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
|
2022-06-14 13:59:15 -07:00 |
dqn
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
dreamer
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
|
2022-06-14 13:59:15 -07:00 |
es
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
impala
|
[RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848)
|
2022-06-17 14:10:36 +02:00 |
maddpg
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
|
2022-06-14 13:59:15 -07:00 |
maml
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
marwil
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
mbmpo
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
pg
|
Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776)
|
2022-06-14 13:59:15 -07:00 |
ppo
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
qmix
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
r2d2
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
sac
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
simple_q
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
slateq
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
td3
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
tests
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
__init__.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
algorithm.py
|
[tune] Refactor Syncer / deprecate Sync client (#25655)
|
2022-06-14 14:46:30 +02:00 |
algorithm_config.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
callbacks.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
mock.py
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
registry.py
|
[RLlib] Fixes logging of all of RLlib's Algorithm names as warning messages. (#25840)
|
2022-06-17 08:41:18 +02:00 |