ray/rllib/algorithms
2022-06-01 11:27:54 -07:00
..
a2c [RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314) 2022-06-01 09:29:16 +02:00
a3c [RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314) 2022-06-01 09:29:16 +02:00
alpha_star [RLlib] MB-MPO TrainerConfig objects. (#25278) 2022-05-30 17:33:01 +02:00
alpha_zero [RLlib] AlphaZero TrainerConfig objects. (#25256) 2022-05-30 15:37:58 +02:00
ars [RLlib] Upgrade gym 0.23 (#24171) 2022-05-23 08:18:44 +02:00
bandit [RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896) 2022-05-19 18:30:42 +02:00
cql Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
ddpg Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
dqn Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
dreamer Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
es [RLlib] Upgrade gym 0.23 (#24171) 2022-05-23 08:18:44 +02:00
maddpg [RLlib] MA-DDPG TrainerConfig objects. (#25255) 2022-05-30 15:38:24 +02:00
maml [RLlib] MB-MPO TrainerConfig objects. (#25278) 2022-05-30 17:33:01 +02:00
marwil [RLlib]: Rename input_evaluation to off_policy_estimation_methods. (#25107) 2022-05-27 13:14:54 +02:00
mbmpo Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
pg Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
qmix [RLlib] SAC, RNNSAC, and CQL TrainerConfig objects (#25059) 2022-05-22 19:58:47 +02:00
sac Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
slateq Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
__init__.py [RLlib] Moved agents.es to algorithms.es (#24511) 2022-05-06 14:54:22 +02:00