Sven Mika
|
dfc116ea27
|
[RLlib] Discussion 681: Metrics prepends newest episodes instead of appending. (#15236)
|
2021-04-11 15:31:43 +02:00 |
|
Chris Bamford
|
cd89f0dc55
|
[RLLib] Episode media logging support (#14767)
|
2021-03-19 09:17:09 +01:00 |
|
Sven Mika
|
c3a15ecc0f
|
[RLlib] Issue #13802: Enhance metrics for multiagent->count_steps_by=agent_steps setting. (#14033)
|
2021-03-18 20:27:41 +01:00 |
|
Sven Mika
|
eb0038612f
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
|
Michael Luo
|
a2d1215200
|
[RLlib] Execution Annotation (#13036)
|
2020-12-24 09:30:33 -05:00 |
|
Edward Oakes
|
cde711aaf1
|
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
This reverts commit becca1424d .
|
2020-12-15 11:03:02 -08:00 |
|
Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Sven Mika
|
805dad3bc4
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
|
Sven Mika
|
ef18893fb5
|
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
|
2020-09-02 14:03:01 +02:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Eric Liang
|
b14cc16616
|
[rllib] Enable functional execution workflow API by default (#8221)
|
2020-05-05 12:36:42 -07:00 |
|
Eric Liang
|
2298f6fb40
|
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
2020-04-23 12:39:19 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|