ray/rllib/agents/slateq
2021-11-19 11:57:37 +01:00
..
__init__.py [RLlib] Implement the SlateQ algorithm (#11450) 2020-11-03 09:52:04 +01:00
slateq.py [RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552) 2021-11-19 11:57:37 +01:00
slateq_torch_policy.py [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00