Carlo Grisetti
|
a8286c55af
|
[RLLib] Fix deprecated convert_to_non_torch_type (#20751)
|
2021-12-09 14:42:12 +01:00 |
|
Jun Gong
|
2317c693cf
|
[RLlib] Use SampleBrach instead of input dict whenever possible (#20746)
|
2021-12-02 13:11:26 +01:00 |
|
Sven Mika
|
cf21c634a3
|
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
2021-11-03 10:00:46 +01:00 |
|
Sven Mika
|
b4300dd532
|
[RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. (#18937)
|
2021-10-04 13:29:00 +02:00 |
|
Sven Mika
|
ed85f59194
|
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
2021-09-30 16:39:05 +02:00 |
|
Sven Mika
|
7f2b3c0824
|
[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). (#17742)
|
2021-08-11 18:30:21 +02:00 |
|
Sven Mika
|
811d71b368
|
[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657)
|
2021-08-11 12:44:35 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
2900a06dd7
|
[RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427)
|
2021-06-18 17:27:29 +02:00 |
|
Sven Mika
|
839fc59224
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
|
Sven Mika
|
469f5227da
|
[RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814)
|
2021-05-16 15:21:06 +02:00 |
|
Sven Mika
|
c4a3e1589b
|
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761)
|
2021-05-13 09:17:23 +02:00 |
|
Michael Luo
|
4cbe13cdfd
|
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
Co-authored-by: Sven Mika <sven@anyscale.io>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
|
2021-05-04 19:06:19 +02:00 |
|
Michael Luo
|
ec2c10309b
|
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
2021-02-22 17:30:18 +01:00 |
|
Sven Mika
|
2e3655e8a9
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
|
Michael Luo
|
42cd414e5b
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
|