Sven Mika
|
a5831f9429
|
[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.
|
2021-06-15 13:30:31 +02:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Sven Mika
|
ad695a818b
|
Bug fix in the contextual bandit's linear_regression.py model. (#8815)
|
2020-06-06 22:47:42 +02:00 |
|
Sven Mika
|
d8a081a185
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
|
Sven Mika
|
c593fb09b7
|
[RLlib] Remove all f-strings to keep py3.5 compatibility.
|
2020-04-30 11:10:16 -07:00 |
|
Sven Mika
|
bf25aee392
|
[RLlib] Deprecate all Model(v1) usage. (#8146)
Deprecate all Model(v1) usage.
|
2020-04-29 12:12:59 +02:00 |
|
Saurabh Gupta
|
6ddf84b019
|
Contextual Bandit algorithms (WIP) (#7642)
|
2020-03-26 13:41:16 -07:00 |
|