Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Sven Mika
|
dab241dcc6
|
[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063)
|
2020-11-19 19:01:14 +01:00 |
|
Sven Mika
|
6da4342822
|
[RLlib] Add on_learn_on_batch (Policy) callback to DefaultCallbacks. (#12070)
|
2020-11-18 15:39:23 +01:00 |
|
Sven Mika
|
715ee8dfc9
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
|
Sven Mika
|
e968b52cb7
|
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
2020-08-21 12:35:16 +02:00 |
|
Sven Mika
|
2256047876
|
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
2020-08-15 13:24:22 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Eric Liang
|
f48da50e1c
|
[rllib] observation function api for multi-agent (#8236)
|
2020-05-04 22:13:49 -07:00 |
|
roireshef
|
dbcad35022
|
[RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972)
|
2020-04-16 16:06:42 -07:00 |
|