Joseph Suarez
|
8136d2912b
|
[RLlib] Add policies arg to callback: on_episode_step (already exists in all other episode-related callbacks) (#18119)
|
2021-08-27 16:12:19 +02:00 |
|
Amog Kamsetty
|
cb74053ee5
|
Retry remove gpustat dependency (#17115)
* remove gpustat
* move psutil imports
|
2021-07-19 11:14:10 -07:00 |
|
Chris Bamford
|
fd1a97e39f
|
[RLlib] Memory leak docs (#15908)
|
2021-06-10 18:10:21 +02:00 |
|
Chris Bamford
|
0be83d9a95
|
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815)
|
2021-05-18 13:23:00 +02:00 |
|
Sven Mika
|
eb0038612f
|
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
2021-02-08 15:02:19 +01:00 |
|
Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Sven Mika
|
dab241dcc6
|
[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063)
|
2020-11-19 19:01:14 +01:00 |
|
Sven Mika
|
6da4342822
|
[RLlib] Add on_learn_on_batch (Policy) callback to DefaultCallbacks. (#12070)
|
2020-11-18 15:39:23 +01:00 |
|
Sven Mika
|
715ee8dfc9
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
|
Sven Mika
|
e968b52cb7
|
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
2020-08-21 12:35:16 +02:00 |
|
Sven Mika
|
2256047876
|
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
2020-08-15 13:24:22 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Eric Liang
|
f48da50e1c
|
[rllib] observation function api for multi-agent (#8236)
|
2020-05-04 22:13:49 -07:00 |
|
roireshef
|
dbcad35022
|
[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972)
|
2020-04-16 16:06:42 -07:00 |
|