ray/rllib/contrib/alpha_zero/core
2021-12-04 22:05:26 +01:00
..
__init__.py [RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178) 2020-02-15 14:50:44 -08:00
alpha_zero_policy.py Revert "Revert [RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2021-11-16 14:49:41 +01:00
alpha_zero_trainer.py [RLlib] Sub-class Trainer (instead of build_trainer()): All remaining classes; soft-deprecate build_trainer. (#20725) 2021-12-04 22:05:26 +01:00
mcts.py Remove (object) from class declarations. (#6658) 2020-01-02 17:42:13 -08:00
ranked_rewards.py AlphaZero and Ranked reward implementation (#6385) 2019-12-07 12:08:40 -08:00