ray/rllib/agents/slateq
2022-05-17 13:43:49 +02:00
..
tests [RLlib] SlateQ config objects. (#24577) 2022-05-10 20:07:18 +02:00
__init__.py [RLlib] SlateQ config objects. (#24577) 2022-05-10 20:07:18 +02:00
slateq.py [RLlib] Replay Buffer API and Ape-X. (#24506) 2022-05-17 13:43:49 +02:00
slateq_tf_model.py [RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544) 2022-02-25 21:58:16 +01:00
slateq_tf_policy.py [RLlib] SlateQ + tf; release test fixes, related to TD-error not properly being formatted. (#24521) 2022-05-06 08:50:30 +02:00
slateq_torch_model.py [RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544) 2022-02-25 21:58:16 +01:00
slateq_torch_policy.py [RLlib] SlateQ fixes: Release learning tests wrong yaml structure + TD-error torch issue (#24429) 2022-05-04 13:37:14 +02:00