Avnish Narayan
|
393cf4d8f7
|
[RLlib] Fix action_sampler_fn call in TorchPolicyV2 (obs_batch instead of input_dict arg). (#25877)
|
2022-06-17 08:39:39 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Artur Niederfahrenhorst
|
7495e9c89c
|
[RLlib] Dreamer Policy sub-classing schema. (#25585)
|
2022-06-09 17:14:15 +02:00 |
|
Eric Liang
|
905258dbc1
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
|
kourosh hakhamaneshi
|
9684ea3af6
|
[RLlib] Fix TorchPolicyV2 bug. (#25203)
|
2022-05-26 20:49:26 +02:00 |
|
Jun Gong
|
d5a6d46049
|
[RLlib] Migrate MAML, MB-MPO, MARWIL, and BC to use Policy sub-classing implementation. (#24914)
|
2022-05-20 14:10:59 +02:00 |
|
Jun Gong
|
bc3a1d35cf
|
[RLlib] Introduce new policy base classes. (#24742)
|
2022-05-13 21:48:30 +02:00 |
|