Chace Ashcraft
|
ebeee1d59a
|
[RLlib] Pytorch MAML fix for more than two workers with discrete actions (#13835)
|
2021-02-08 12:06:02 +01:00 |
|
Sven Mika
|
2e3655e8a9
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
|
Sven Mika
|
99ae7bae05
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
|
Michael Luo
|
59bc1e6c09
|
[RLLib] MAML extension for all models except RNNs (#11337)
|
2020-11-12 16:51:40 -08:00 |
|
Sven Mika
|
62c7ab5182
|
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
2020-11-12 16:27:34 +01:00 |
|
Michael Luo
|
8e613652af
|
[RLLib] MBMPO Fixes (#10296)
|
2020-09-09 09:34:34 +02:00 |
|
Michael Luo
|
4d7bd8c892
|
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409)
|
2020-08-02 18:12:09 +02:00 |
|
Michael Luo
|
851d02463b
|
[Doc] RLlib Algorithms Documentation: MAML + PyTorch MAML (#9189)
|
2020-07-03 11:05:15 -07:00 |
|