ray/rllib/contrib/bandits/agents
2020-05-21 10:16:18 -07:00
..
__init__.py Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
lin_ts.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
lin_ucb.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
policy.py Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00