ray/rllib/contrib/bandits/agents
2021-09-30 16:39:05 +02:00
..
__init__.py Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
lin_ts.py [RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520) 2020-05-27 16:19:13 +02:00
lin_ucb.py [RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520) 2020-05-27 16:19:13 +02:00
policy.py [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 2021-09-30 16:39:05 +02:00