konichuvak
|
13c2e13120
|
fixing polynomial schedule horizon (#7795)
|
2020-05-27 10:59:28 +02:00 |
|
Sven Mika
|
1d4823c0ec
|
[RLlib] Add testing framework_iterator. (#7852)
* Add testing framework_iterator.
* LINT.
* WIP.
* Fix and LINT.
* LINT fix.
|
2020-04-03 12:24:25 -07:00 |
|
Sven Mika
|
20ef4a8603
|
[RLlib] Cleanup/unify all test cases. (#7533)
|
2020-03-11 20:39:47 -07:00 |
|
Sven Mika
|
d537e9f0d8
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
|
Sven Mika
|
6e1c3ea824
|
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
2020-02-10 15:22:07 -08:00 |
|
Sven Mika
|
136ada5fb9
|
[RLlib] Experiment with py_func as a means to further unify tf and torch (Schedule classes). (#6951)
|
2020-01-30 11:27:57 -08:00 |
|
Sven Mika
|
4c97348cb6
|
[RLlib] Schedule-classes multi-framework support. (#6926)
|
2020-01-28 11:07:55 -08:00 |
|