1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-12 22:26:39 -04:00
ray/rllib/algorithms/bandit
2022-06-11 15:10:39 +02:00
..
tests [RLlib] Trainer to Algorithm renaming. () 2022-06-11 15:10:39 +02:00
__init__.py [RLlib] Move all remaining algos into algorithms directory. () 2022-06-04 07:35:24 +02:00
bandit.py [RLlib] Trainer to Algorithm renaming. () 2022-06-11 15:10:39 +02:00
bandit_tf_model.py [RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits () 2022-05-19 18:30:42 +02:00
bandit_tf_policy.py [RLlib] Trainer to Algorithm renaming. () 2022-06-11 15:10:39 +02:00
bandit_torch_model.py [RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits () 2022-05-19 18:30:42 +02:00
bandit_torch_policy.py [RLlib] Bandits (torch) Policy sub-class. () 2022-06-02 15:16:51 +02:00