Sven Mika
|
d629292d63
|
[RLlib] Add grad_clip config option to MARWIL and stabilize grad clipping against inf global_norms. (#13634)
|
2021-01-22 19:36:02 +01:00 |
|
Sven Mika
|
a65ee92b69
|
[RLlib] MARWIL loss function test case and cleanup. (#13455)
|
2021-01-19 09:51:05 +01:00 |
|
Sven Mika
|
c524f86785
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
fcdf410ae1
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
4fd8977eaf
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
* WIP.
* Fixes.
* LINT.
* Fixes.
* Fixes and LINT.
* WIP.
|
2020-06-25 19:01:32 +02:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Sven Mika
|
c2cb5c2214
|
[RLlib] MARWIL torch. (#7836)
* WIP.
* WIP.
* LINT.
* Fix MARWIL so it can run with eager-mode.
* LINT.
|
2020-04-06 16:38:50 -07:00 |
|