ray/rllib/agents/marwil
Michael Luo 4cbe13cdfd
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
Co-authored-by: Sven Mika <sven@anyscale.io>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-05-04 19:06:19 +02:00
..
tests [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 2021-05-04 19:06:19 +02:00
__init__.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
bc.py [RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064) 2020-12-27 09:46:03 -05:00
marwil.py [RLlib] Issue 14523: Torch + py3.8 leads to GPU device error. (#15014) 2021-03-30 21:43:11 +02:00
marwil_tf_policy.py [RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684) 2021-04-27 10:44:54 +02:00
marwil_torch_policy.py [RLlib] Issue 14523: Torch + py3.8 leads to GPU device error. (#15014) 2021-03-30 21:43:11 +02:00