This website requires JavaScript.
Explore
Help
Sign in
hiro
/
ray
Watch
1
Star
0
Fork
You've already forked ray
0
mirror of
https://github.com/vale981/ray
synced
2025-03-14 15:16:38 -04:00
Code
Issues
Projects
Releases
Packages
Wiki
Activity
Actions
b4790900f5
ray
/
rllib
/
contrib
/
bandits
/
agents
History
Sven Mika
b4790900f5
[RLlib] Sub-class
Trainer
(instead of
build_trainer()
): All remaining classes; soft-deprecate
build_trainer
. (
#20725
)
2021-12-04 22:05:26 +01:00
..
__init__.py
Contextual Bandit algorithms (WIP) (
#7642
)
2020-03-26 13:41:16 -07:00
lin_ts.py
[RLlib] Sub-class
Trainer
(instead of
build_trainer()
): All remaining classes; soft-deprecate
build_trainer
. (
#20725
)
2021-12-04 22:05:26 +01:00
lin_ucb.py
[RLlib] Sub-class
Trainer
(instead of
build_trainer()
): All remaining classes; soft-deprecate
build_trainer
. (
#20725
)
2021-12-04 22:05:26 +01:00
policy.py
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (
#18879
)
2021-09-30 16:39:05 +02:00