Sven Mika
|
f82880eda1
|
Revert "Revert [RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
This reverts commit 90dc5460d4 .
|
2021-11-16 14:49:41 +01:00 |
|
Amog Kamsetty
|
90dc5460d4
|
Revert "[RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399)
This reverts commit 5b1c8e46e1 .
|
2021-11-15 16:11:35 -08:00 |
|
Sven Mika
|
5b1c8e46e1
|
[RLlib] POC: Deprecate build_policy (policy template) for torch only; PPOTorchPolicy (#20061)
|
2021-11-15 10:41:54 +01:00 |
|
Sven Mika
|
82465f9342
|
[RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254)
|
2021-08-31 22:03:23 +02:00 |
|
Sven Mika
|
8248ba531b
|
[RLlib] Redo #17410: Example script: Remote worker envs with inference done on main node. (#17960)
|
2021-08-20 08:02:18 +02:00 |
|
Alex Wu
|
318ba6fae0
|
Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)" (#17951)
This reverts commit 8fc16b9a18 .
|
2021-08-19 07:55:10 -07:00 |
|
Sven Mika
|
8fc16b9a18
|
[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)
|
2021-08-19 12:14:50 +02:00 |
|
Stefan Schneider
|
489febc6b2
|
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038)
|
2021-07-26 22:25:48 -04:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
d2c755ccef
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
|
Sven Mika
|
e973b726c2
|
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
2021-04-30 19:26:30 +02:00 |
|
Sven Mika
|
e961d2f4b2
|
[RLlib] Improve example scripts for attention nets, CartPole LSTM, and custom RNN-models. (#15329)
|
2021-04-15 16:11:34 +02:00 |
|
Sven Mika
|
e98808ce11
|
[RLlib] Fix 2 flakey test cases. (#14892)
|
2021-03-29 17:20:29 +02:00 |
|
Sven Mika
|
78c64ca151
|
[RLlib] Attention net example script: Clarifications on how to use with Trainer.compute_action. (#14864)
|
2021-03-23 19:33:01 +01:00 |
|
Sven Mika
|
9eba1871bb
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
|
Sven Mika
|
d5604eaba3
|
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
2020-12-21 18:38:34 -08:00 |
|
Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Sven Mika
|
c17169dc11
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Sven Mika
|
d8a081a185
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
0422e9c5a8
|
[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624)
|
2020-05-27 10:19:47 +02:00 |
|
Sven Mika
|
796a834c48
|
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
2020-05-18 17:26:40 +02:00 |
|