Artur Niederfahrenhorst
|
9a64bd4e9b
|
[RLlib] Simple-Q uses training iteration fn (instead of execution_plan); ReplayBuffer API for Simple-Q (#22842)
|
2022-03-29 14:44:40 +02:00 |
|
Sven Mika
|
93c0a5549b
|
[RLlib] Deprecate vf_share_layers in top-level PPO/MAML/MB-MPO configs. (#13397)
|
2021-01-19 09:51:35 +01:00 |
|
Edward Oakes
|
32d159a2ed
|
Fix release directory & RELEASE_PROCESS.md (#12269)
|
2020-11-23 14:28:59 -06:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Eric Liang
|
dd70720578
|
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
* bulk rename
* deprecation warn
* update doc
* update fig
* line length
* rename
* make pytest comptaible
* fix test
* fi sys
* rename
* wip
* fix more
* lint
* update svg
* comments
* lint
* fix use of batch steps
|
2020-03-14 12:05:04 -07:00 |
|
Sven Mika
|
6043ce710d
|
Fix old exploration configs. (#7240)
|
2020-02-20 08:39:16 -08:00 |
|
Sven Mika
|
6e1c3ea824
|
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
2020-02-10 15:22:07 -08:00 |
|
Sven Mika
|
c957ed58ed
|
[RLlib] Implement PPO torch version. (#6826)
|
2020-01-20 23:06:50 -08:00 |
|
Eric Liang
|
a229bdf272
|
[rllib] Deprecate custom preprocessors (#6833)
* deprecation warnings
* add log warn
* fix test
|
2020-01-18 23:30:09 -08:00 |
|
Eric Liang
|
b924299833
|
Add large scale regression test for RLlib (#6093)
|
2019-11-13 12:22:55 -08:00 |
|