.. |
a3c
|
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
2020-10-01 16:57:10 +02:00 |
ars
|
[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544)
|
2020-09-06 10:58:00 +02:00 |
ddpg
|
fix incorrect critic loss in TD3 (#10775)
|
2020-09-20 20:01:51 -07:00 |
dqn
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
dreamer
|
[RLlib] Dreamer (#10172)
|
2020-08-26 13:24:05 +02:00 |
es
|
[rllib] Fix torch TD error, IMPALA LR updates (#9477)
|
2020-07-23 12:50:25 -07:00 |
impala
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
maml
|
Cartpole MAML + Discrete (#11028)
|
2020-10-02 12:56:34 +02:00 |
marwil
|
add large data warning (#10957)
|
2020-09-23 15:46:06 -07:00 |
mbmpo
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
pg
|
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
2020-10-01 16:57:10 +02:00 |
ppo
|
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
2020-10-01 16:57:10 +02:00 |
qmix
|
add large data warning (#10957)
|
2020-09-23 15:46:06 -07:00 |
sac
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
__init__.py
|
[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544)
|
2020-09-06 10:58:00 +02:00 |
callbacks.py
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
mock.py
|
[tune] Use public methods for trainable (#9184)
|
2020-07-01 11:00:00 -07:00 |
registry.py
|
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
2020-09-09 17:33:21 +02:00 |
trainer.py
|
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
2020-10-01 16:57:10 +02:00 |
trainer_template.py
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |