ray/rllib/tuned_examples/cql at 092598774a434e5602c8c7b6cc24ed8831f8b776 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-13 22:56:38 -04:00

History

gjoliver e7f9e8ceec [RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541 ) * Fix trainer timestep reporting for offline agents like CQL. * wip. * extend timesteps_total to 200K for learning_tests_pendulum_cql test Co-authored-by: sven1977 <svenmika1977@gmail.com>		2021-11-22 21:46:45 +01:00
..
halfcheetah-bc.yaml	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 )	2021-05-04 19:06:19 +02:00
halfcheetah-cql.yaml	[RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381 )	2021-09-07 11:48:41 +02:00
hopper-bc.yaml	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 )	2021-05-04 19:06:19 +02:00
hopper-cql.yaml	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 )	2021-05-04 19:06:19 +02:00
pendulum-cql.yaml	[RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541 )	2021-11-22 21:46:45 +01:00