.. |
a3c
|
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
2020-11-12 16:27:34 +01:00 |
ars
|
[RLlib] Trajectory view API: enable by default for ES and ARS (#11826)
|
2020-11-12 10:33:10 -08:00 |
ddpg
|
[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945)
|
2020-11-29 12:31:24 +01:00 |
dqn
|
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
2020-11-27 16:25:47 -08:00 |
dreamer
|
[rllib] Forgot to pass ioctx to child json readers (#11839)
|
2020-11-05 22:07:57 -08:00 |
es
|
[RLlib] Trajectory view API: enable by default for ES and ARS (#11826)
|
2020-11-12 10:33:10 -08:00 |
impala
|
[RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397)
|
2020-11-25 11:27:46 -08:00 |
maml
|
[RLLib] MAML extension for all models except RNNs (#11337)
|
2020-11-12 16:51:40 -08:00 |
marwil
|
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
2020-11-27 16:25:47 -08:00 |
mbmpo
|
MBMPO Cartpole (#11832)
|
2020-11-12 10:30:41 -08:00 |
pg
|
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
2020-11-12 16:27:34 +01:00 |
ppo
|
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
2020-11-27 16:25:47 -08:00 |
qmix
|
[RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609)
|
2020-10-27 10:00:24 +01:00 |
sac
|
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
2020-11-27 16:25:47 -08:00 |
slateq
|
[RLlib] Implement the SlateQ algorithm (#11450)
|
2020-11-03 09:52:04 +01:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063)
|
2020-11-19 19:01:14 +01:00 |
mock.py
|
[tune] Use public methods for trainable (#9184)
|
2020-07-01 11:00:00 -07:00 |
registry.py
|
[RLlib] Implement the SlateQ algorithm (#11450)
|
2020-11-03 09:52:04 +01:00 |
trainer.py
|
[RLLib] Fix HyperOptSearch tuple to list conversion (#12462)
|
2020-11-28 10:07:54 -08:00 |
trainer_template.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |