ray/rllib/agents/slateq
2022-05-02 12:51:14 +02:00
..
tests [RLlib] SlateQ: framework=tf fixes and SlateQ documentation update (#22543) 2022-02-23 13:03:45 +01:00
__init__.py [RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389) 2022-02-22 09:36:44 +01:00
slateq.py [RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting. (#24372) 2022-05-02 12:51:14 +02:00
slateq_tf_model.py [RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544) 2022-02-25 21:58:16 +01:00
slateq_tf_policy.py [RLlib] SlateQ training iteration function. (#24151) 2022-04-29 18:38:17 +02:00
slateq_torch_model.py [RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544) 2022-02-25 21:58:16 +01:00
slateq_torch_policy.py [RLlib] SlateQ training iteration function. (#24151) 2022-04-29 18:38:17 +02:00