ray/rllib/contrib/bandits/agents
2021-12-04 22:05:26 +01:00
..
__init__.py Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
lin_ts.py [RLlib] Sub-class Trainer (instead of build_trainer()): All remaining classes; soft-deprecate build_trainer. (#20725) 2021-12-04 22:05:26 +01:00
lin_ucb.py [RLlib] Sub-class Trainer (instead of build_trainer()): All remaining classes; soft-deprecate build_trainer. (#20725) 2021-12-04 22:05:26 +01:00
policy.py [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 2021-09-30 16:39:05 +02:00