ray/rllib/tuned_examples/slateq at e643b75129fa7b71c6aa62007665ab679cdc610e - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

History

Artur Niederfahrenhorst 0dceddb912 [RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )		2022-08-11 13:07:30 +02:00
..
interest-evolution-10-candidates-recsim-env-slateq-fake-gpus.yaml	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
interest-evolution-10-candidates-recsim-env-slateq.yaml	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
interest-evolution-50-candidates-recsim-env-slateq.yaml	[RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544 )	2022-02-25 21:58:16 +01:00
long-term-satisfaction-recsim-env-slateq.yaml	[RLlib] SlateQ (tf GPU + multi-GPU) + Bandit fixes (#23276 )	2022-03-18 13:45:16 +01:00
parametric-item-reco-env-slateq.yaml	[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389 )	2022-02-22 09:36:44 +01:00
recomm-sys001-slateq.yaml	[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389 )	2022-02-22 09:36:44 +01:00