1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-13 22:56:38 -04:00
ray/rllib/tuned_examples/cql
gjoliver e7f9e8ceec
[RLlib] Report total_train_steps correctly for offline agents like CQL. ()
* Fix trainer timestep reporting for offline agents like CQL.

* wip.

* extend timesteps_total to 200K for learning_tests_pendulum_cql test

Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-11-22 21:46:45 +01:00
..
halfcheetah-bc.yaml [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. () 2021-05-04 19:06:19 +02:00
halfcheetah-cql.yaml [RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. () 2021-09-07 11:48:41 +02:00
hopper-bc.yaml [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. () 2021-05-04 19:06:19 +02:00
hopper-cql.yaml [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. () 2021-05-04 19:06:19 +02:00
pendulum-cql.yaml [RLlib] Report total_train_steps correctly for offline agents like CQL. () 2021-11-22 21:46:45 +01:00