This website requires JavaScript.
Explore
Help
Sign in
hiro
/
ray
Watch
1
Star
0
Fork
You've already forked ray
0
mirror of
https://github.com/vale981/ray
synced
2025-03-08 19:41:38 -05:00
Code
Issues
Projects
Releases
Packages
Wiki
Activity
Actions
a43193b9e5
ray
/
rllib
/
contrib
/
bandits
History
Sven Mika
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (
#18879
)
2021-09-30 16:39:05 +02:00
..
agents
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (
#18879
)
2021-09-30 16:39:05 +02:00
envs
[RLlib] Unity3d soccer benchmarks (
#8834
)
2020-06-11 14:29:57 +02:00
examples
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (
#18879
)
2021-09-30 16:39:05 +02:00
models
[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.
2021-06-15 13:30:31 +02:00
__init__.py
Contextual Bandit algorithms (WIP) (
#7642
)
2020-03-26 13:41:16 -07:00
exploration.py
[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.
2021-06-15 13:30:31 +02:00