ray/rllib/contrib/alpha_zero/core at ed85f591949c6a989de30c3686de744860add100 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 21:06:39 -04:00

History

Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )		2021-09-30 16:39:05 +02:00
..
__init__.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
alpha_zero_policy.py	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
alpha_zero_trainer.py	[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420 )	2020-12-08 16:41:45 -08:00
mcts.py	Remove (object) from class declarations. (#6658 )	2020-01-02 17:42:13 -08:00
ranked_rewards.py	AlphaZero and Ranked reward implementation (#6385 )	2019-12-07 12:08:40 -08:00