.. |
a2c
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
a3c
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
alpha_star
|
[RLlib] MB-MPO TrainerConfig objects. (#25278)
|
2022-05-30 17:33:01 +02:00 |
alpha_zero
|
[RLlib] AlphaZero TrainerConfig objects. (#25256)
|
2022-05-30 15:37:58 +02:00 |
ars
|
[RLlib] Upgrade gym 0.23 (#24171)
|
2022-05-23 08:18:44 +02:00 |
bandit
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
cql
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
ddpg
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
dqn
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
dreamer
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
es
|
[RLlib] Upgrade gym 0.23 (#24171)
|
2022-05-23 08:18:44 +02:00 |
maddpg
|
[RLlib] MA-DDPG TrainerConfig objects. (#25255)
|
2022-05-30 15:38:24 +02:00 |
maml
|
[RLlib] MB-MPO TrainerConfig objects. (#25278)
|
2022-05-30 17:33:01 +02:00 |
marwil
|
[RLlib]: Rename input_evaluation to off_policy_estimation_methods . (#25107)
|
2022-05-27 13:14:54 +02:00 |
mbmpo
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
pg
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
qmix
|
[RLlib] SAC, RNNSAC, and CQL TrainerConfig objects (#25059)
|
2022-05-22 19:58:47 +02:00 |
sac
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
slateq
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
__init__.py
|
[RLlib] Moved agents.es to algorithms.es (#24511)
|
2022-05-06 14:54:22 +02:00 |