Sven Mika
|
0b308719f8
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
|
Sven Mika
|
9c73871da0
|
[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783)
|
2021-10-29 12:03:56 +02:00 |
|
Sven Mika
|
b213565783
|
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
2021-10-25 15:00:00 +02:00 |
|
Sven Mika
|
e2be41b407
|
[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218)
|
2021-06-03 22:29:00 +02:00 |
|
Sven Mika
|
f6302d81be
|
[RLlib] Discussion 2210: BC algo broken, if "advantages" missing in offline data. (#16019)
|
2021-05-25 08:47:17 +02:00 |
|
Sven Mika
|
eaa7f6696d
|
[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. (#15898)
|
2021-05-19 15:44:11 -07:00 |
|
Michael Luo
|
474f04e322
|
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707)
|
2021-05-19 16:32:29 +02:00 |
|
Sven Mika
|
839fc59224
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
|
Sven Mika
|
bb8a286cbc
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
|
Sven Mika
|
8b3554e37e
|
[RLlib] Remove all (already soft-deprecated) SampleBatch.data from code. (#15335)
|
2021-04-15 19:19:51 +02:00 |
|
Sven Mika
|
04bc0a9828
|
[RLlib] Remove all non-trajectory view API code. (#14860)
|
2021-03-23 09:50:18 -07:00 |
|
Sven Mika
|
d629292d63
|
[RLlib] Add grad_clip config option to MARWIL and stabilize grad clipping against inf global_norms. (#13634)
|
2021-01-22 19:36:02 +01:00 |
|
Sven Mika
|
a65ee92b69
|
[RLlib] MARWIL loss function test case and cleanup. (#13455)
|
2021-01-19 09:51:05 +01:00 |
|
Sven Mika
|
c524f86785
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
fcdf410ae1
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
4fd8977eaf
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
* WIP.
* Fixes.
* LINT.
* Fixes.
* Fixes and LINT.
* WIP.
|
2020-06-25 19:01:32 +02:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Sven Mika
|
c2cb5c2214
|
[RLlib] MARWIL torch. (#7836)
* WIP.
* WIP.
* LINT.
* Fix MARWIL so it can run with eager-mode.
* LINT.
|
2020-04-06 16:38:50 -07:00 |
|