Sven Mika
|
9eba1871bb
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
|
Sven Mika
|
8726521604
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
|
Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Michael Luo
|
59ccbc0fc7
|
[RLlib] Model Annotations: Tensorflow (#11964)
|
2020-11-12 12:18:50 +01:00 |
|
Sven Mika
|
d3bc20b727
|
[RLlib] ConvTranspose2D module (#11231)
|
2020-10-12 15:00:42 +02:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
01125b8fcf
|
[RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). (#9304)
|
2020-07-09 10:44:10 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
4fd8977eaf
|
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
* WIP.
* Fixes.
* LINT.
* Fixes.
* Fixes and LINT.
* WIP.
|
2020-06-25 19:01:32 +02:00 |
|
Tanay Wakhare
|
f77c638d6d
|
Pytorch AttentionNet (#9088)
|
2020-06-23 20:42:30 +02:00 |
|
Sven Mika
|
796a834c48
|
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
2020-05-18 17:26:40 +02:00 |
|