Sven Mika
|
ef18893fb5
|
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
|
2020-09-02 14:03:01 +02:00 |
|
Michael Luo
|
4d7bd8c892
|
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409)
|
2020-08-02 18:12:09 +02:00 |
|
Michael Luo
|
94fcd43593
|
[rllib] MAML Transform (#9463)
* MAML Transform
* Moved Inner Adapt to Method in Execution Plan
|
2020-07-16 11:11:33 -07:00 |
|
Michael Luo
|
851d02463b
|
[Doc] RLlib Algorithms Documentation: MAML + PyTorch MAML (#9189)
|
2020-07-03 11:05:15 -07:00 |
|
Sven Mika
|
b4c0b942fe
|
[RLlib] Remove requirement for dataclasses in rllib (not supported in py3.5) (#9237)
|
2020-07-01 17:31:44 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Michael Luo
|
cf0894d396
|
[rllib] MAML Agent (#8862)
* Halfway done with transferring MAML to new Ray
* MAML Beta Out
* Debugging MAML atm
* Distributed Execution
* Pendulum Mass Working
* All experiments complete
* Cleaned up codebase
* Travis CI
* Travis CI
* Tests
* Merged conflicts
* Fixed variance bug conflict
* Comment resolved
* Apply suggestions from code review
fixed test_maml
* Update rllib/agents/maml/tests/test_maml.py
* asdf
* Fix testing
Co-authored-by: Sven Mika <sven@anyscale.io>
|
2020-06-23 09:48:23 -07:00 |
|