ray/rllib/algorithms
2022-06-20 15:54:00 +02:00
..
a2c Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
a3c [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
alpha_star [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
alpha_zero [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
apex_ddpg [RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848) 2022-06-17 14:10:36 +02:00
apex_dqn [RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848) 2022-06-17 14:10:36 +02:00
appo [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
ars [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
bandit [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
bc [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
cql [RLlib] Move offline input into replay buffer using rollout ops in CQL. (#25629) 2022-06-17 17:08:55 +02:00
crr [RLlib] CRR documentation. (#25667) 2022-06-14 12:45:36 +02:00
ddpg [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
ddppo Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
dqn [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
dreamer Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
es [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
impala [RLlib] IMPALA/APPO multi-agent mix-in-buffer fixes (plus MA learning tests). (#25848) 2022-06-17 14:10:36 +02:00
maddpg Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
maml [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
marwil [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
mbmpo [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
pg Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
ppo [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
qmix [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
r2d2 [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
sac [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
simple_q [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
slateq [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
td3 [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
tests [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
__init__.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
algorithm.py [tune] Refactor Syncer / deprecate Sync client (#25655) 2022-06-14 14:46:30 +02:00
algorithm_config.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
callbacks.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
mock.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
registry.py [RLlib] Fixes logging of all of RLlib's Algorithm names as warning messages. (#25840) 2022-06-17 08:41:18 +02:00