ray/rllib/algorithms
2022-06-22 09:43:10 +02:00
..
a2c Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
a3c [RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924) 2022-06-20 19:53:47 +02:00
alpha_star [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
alpha_zero [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
apex_ddpg [RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924) 2022-06-20 19:53:47 +02:00
apex_dqn [RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924) 2022-06-20 19:53:47 +02:00
appo [RLlib] Aggregate Impala learner info. (#25856) 2022-06-22 09:43:10 +02:00
ars [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
bandit [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
bc [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
cql [RLlib] Move offline input into replay buffer using rollout ops in CQL. (#25629) 2022-06-17 17:08:55 +02:00
crr [RLlib] Add compute log likelihoods test for CRR. (#25905) 2022-06-21 16:06:10 +02:00
ddpg [RLlib] Fix DDPG test ignoring framework_iterator-modified config. (#25913) 2022-06-21 16:17:42 +02:00
ddppo Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
dqn [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
dreamer Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
es [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
impala [RLlib] Aggregate Impala learner info. (#25856) 2022-06-22 09:43:10 +02:00
maddpg Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
maml [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
marwil [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
mbmpo [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
pg Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624)" (#25776) 2022-06-14 13:59:15 -07:00
ppo [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
qmix [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
r2d2 [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
sac [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
simple_q [RLlib] SimpleQ PolicyV2 (sub-classing). (#25871) 2022-06-17 20:12:16 +02:00
slateq [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
td3 [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
tests [RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924) 2022-06-20 19:53:47 +02:00
__init__.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
algorithm.py [RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924) 2022-06-20 19:53:47 +02:00
algorithm_config.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
callbacks.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
mock.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
registry.py [RLlib] Fixes logging of all of RLlib's Algorithm names as warning messages. (#25840) 2022-06-17 08:41:18 +02:00