Sven Mika
|
7f2b3c0824
|
[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). (#17742)
|
2021-08-11 18:30:21 +02:00 |
|
Sven Mika
|
811d71b368
|
[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657)
|
2021-08-11 12:44:35 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
2900a06dd7
|
[RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427)
|
2021-06-18 17:27:29 +02:00 |
|
Sven Mika
|
839fc59224
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
|
Sven Mika
|
469f5227da
|
[RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814)
|
2021-05-16 15:21:06 +02:00 |
|
Sven Mika
|
c4a3e1589b
|
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761)
|
2021-05-13 09:17:23 +02:00 |
|
Michael Luo
|
4cbe13cdfd
|
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
Co-authored-by: Sven Mika <sven@anyscale.io>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
|
2021-05-04 19:06:19 +02:00 |
|
Michael Luo
|
ec2c10309b
|
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
2021-02-22 17:30:18 +01:00 |
|
Sven Mika
|
2e3655e8a9
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
|
Michael Luo
|
42cd414e5b
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
|