.. |
a2c
|
[RLlib] A2C + A3C move to algorithms folder and re-name into A2C/A3C (from ...Trainer). (#25314)
|
2022-06-01 09:29:16 +02:00 |
a3c
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
alpha_star
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
alpha_zero
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
apex_ddpg
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
apex_dqn
|
[tune] Custom resources per worker added to default_resource_request (#24463)
|
2022-06-06 22:41:02 +01:00 |
appo
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
ars
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
bandit
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
bc
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
cql
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
crr
|
[RLlib] CRR Tests fixes. (#25586)
|
2022-06-08 19:18:55 +02:00 |
ddpg
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
ddppo
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
dqn
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
dreamer
|
[RLlib] Dreamer Policy sub-classing schema. (#25585)
|
2022-06-09 17:14:15 +02:00 |
es
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
impala
|
[tune] Custom resources per worker added to default_resource_request (#24463)
|
2022-06-06 22:41:02 +01:00 |
maddpg
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
maml
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
marwil
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
mbmpo
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
pg
|
[RLlib] PG policy subclassing conversion. (#25288)
|
2022-06-06 13:07:47 +02:00 |
ppo
|
[RLlib] PG policy subclassing conversion. (#25288)
|
2022-06-06 13:07:47 +02:00 |
qmix
|
[RLlib] Issue 4965: Fixes PyTorch grad clipping logic and adds grad clipping to QMIX. (#25584)
|
2022-06-08 19:40:57 +02:00 |
r2d2
|
[RLlib] Better default values for training_intensity and target_network_update_freq for R2D2. (#25510)
|
2022-06-07 10:29:56 +02:00 |
sac
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
simple_q
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
slateq
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
td3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
__init__.py
|
[RLlib] Moved agents.es to algorithms.es (#24511)
|
2022-05-06 14:54:22 +02:00 |