ray/rllib/algorithms
2022-08-17 00:28:00 -07:00
..
a2c [RLlib] Fix A2C release tests (#27314) 2022-08-02 10:44:52 -07:00
a3c [RLlib] Revert 41c9ef70. (#27243) 2022-07-29 11:05:15 -07:00
alpha_star [RLlib] Move IMPALA and APPO back to exec plan (for now; due to unresolved learning/performance issues). (#25851) 2022-06-29 08:41:47 +02:00
alpha_zero [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
apex_ddpg [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
apex_dqn [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
appo [RLlib] Unify gnorm mixin for tf and torch policies. (#26102) 2022-07-24 15:31:09 +02:00
ars [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
bandit [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
bc [RLlib]: Raise deprecation warning in MARWIL OPE methods. (#26893) 2022-07-23 13:55:40 +02:00
cql [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
crr [RLlib] CRR framework torch by default. (#27161) 2022-08-09 16:53:00 +02:00
ddpg [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
ddppo [RLlib] Cleanup some deprecated metric keys and classes. (#26036) 2022-06-23 21:30:01 +02:00
dqn [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
dreamer [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
dt [RLlib] Add DTTorchPolicy (#27889) 2022-08-17 00:28:00 -07:00
es [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
impala [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
maddpg [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
maml [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
marwil [RLlib] Deflake MARWIL and BC and remove memory leak from torch MARWIL policy (#27406) 2022-08-03 16:53:12 -07:00
mbmpo [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
pg [RLlib] Fix a bunch of issues related to connectors. (#26510) 2022-07-13 18:55:20 +02:00
ppo [RLlib] Fix dqn reproducibility (#27459) 2022-08-09 15:56:44 -07:00
qmix [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
r2d2 [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
sac [RLlib] Fix SAC config parameter that is not used. (#27741) 2022-08-11 18:57:55 +02:00
simple_q [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
slateq Fix grammer of error message (#27900) 2022-08-16 11:26:03 -07:00
td3 [RLlib] Move learning_starts logic from buffers into training_step(). (#26032) 2022-08-11 13:07:30 +02:00
tests [RLlib] Eval workers use async req manager. (#27390) 2022-08-16 12:05:55 +02:00
__init__.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
algorithm.py [RLlib] Eval workers use async req manager. (#27390) 2022-08-16 12:05:55 +02:00
algorithm_config.py [RLlib] Eval workers use async req manager. (#27390) 2022-08-16 12:05:55 +02:00
callbacks.py [RLlib] more connector polishes and fixes. (#26645) 2022-07-19 08:50:28 -07:00
mock.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
registry.py [RLlib] Try to checkpoint a durable policy name (#27016) 2022-07-27 00:01:14 -07:00