ray/rllib/agents/bandit
2022-05-02 21:14:08 +02:00
..
tests [RLlib] Removed deprecated code with error=True (#23916) 2022-04-15 13:51:12 +02:00
__init__.py [RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; (#21773) 2022-01-27 13:58:12 +01:00
bandit.py [RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting. (#24372) 2022-05-02 12:51:14 +02:00
bandit_tf_model.py [RLlib] Change type to tensortype for cql policies. (#23438) 2022-03-24 12:32:29 +01:00
bandit_tf_policy.py [RLlib] Issue 24075: Better error message for Bandit MultiDiscrete (suggest using our wrapper). (#24385) 2022-05-02 21:14:08 +02:00
bandit_torch_model.py [RLlib] TF2 Bandit Agent (#22838) 2022-03-21 16:55:55 +01:00
bandit_torch_policy.py [RLlib] Issue 24075: Better error message for Bandit MultiDiscrete (suggest using our wrapper). (#24385) 2022-05-02 21:14:08 +02:00