Sven Mika
|
839fc59224
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
|
Michael Luo
|
4cbe13cdfd
|
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
Co-authored-by: Sven Mika <sven@anyscale.io>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
|
2021-05-04 19:06:19 +02:00 |
|
Michael Luo
|
ec2c10309b
|
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
2021-02-22 17:30:18 +01:00 |
|
Michael Luo
|
587f207c2f
|
[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550)
|
2021-01-21 16:43:55 +01:00 |
|
Michael Luo
|
42cd414e5b
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
|