Sven Mika
|
5d50d37f45
|
[RLlib] Issue 13330: No TF installed causes crash in ModelCatalog.get_action_shape() (#13332)
|
2021-01-11 13:19:46 +01:00 |
|
Sven Mika
|
9dd9f72111
|
[RLlib] Add more detailed Documentation on Model building API (#13261)
|
2021-01-09 12:38:29 +01:00 |
|
Sven Mika
|
6f342a2221
|
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
2021-01-08 10:56:09 +01:00 |
|
Sven Mika
|
a5b39ef8e2
|
[RLlib] Fix missing "info_batch" arg (None) in compute_actions calls. (#13237)
|
2021-01-07 21:25:02 +01:00 |
|
Sven Mika
|
bcaff63909
|
[RLlib] SquashedGaussians should throw error when entropy or kl are called. (#13126)
|
2021-01-07 15:07:35 +01:00 |
|
Basu Jindal
|
4e569ee20b
|
Update multi_agent_independent_learning.py (#13196)
pettingzoo.utils.error.DeprecatedEnv: waterworld_v0 is now depreciated, use waterworld_v2 instead
|
2021-01-05 13:44:54 -08:00 |
|
Sven Mika
|
9eba1871bb
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
|
Sven Mika
|
8726521604
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
|
Sven Mika
|
391cdfae8c
|
[RLlib] Trajectory view API docs. (#12718)
|
2020-12-30 17:32:21 -08:00 |
|
Sven Mika
|
28ac4243f4
|
[RLlib] Deflake test case: 2-step game MADDPG. (#13121)
|
2020-12-30 18:37:37 -05:00 |
|
Michael Luo
|
42cd414e5b
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
|
Michael Luo
|
eae7a1f433
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
|
Sven Mika
|
d811d65920
|
[RLlib] run_regression_tests.py: --framework flag (instead of --torch). (#13097)
|
2020-12-29 15:27:59 -05:00 |
|
Sven Mika
|
c524f86785
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
|
Sven Mika
|
a5318961de
|
[RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083)
|
2020-12-26 20:14:36 -05:00 |
|
Sven Mika
|
99ae7bae05
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
|
Michael Luo
|
4bcd475671
|
[RLlib] Improved Documentation for PPO, DDPG, and SAC (#12943)
|
2020-12-24 09:31:35 -05:00 |
|
Michael Luo
|
a2d1215200
|
[RLlib] Execution Annotation (#13036)
|
2020-12-24 09:30:33 -05:00 |
|
Corey Lowman
|
668ea0bc26
|
Fix typo RMSProp -> RMSprop (#13063)
|
2020-12-23 13:37:46 -08:00 |
|
Sven Mika
|
1e74187179
|
[RLlib] TorchPolicies: Accessing "infos" dict in train_batch causes TypeError . (#13039)
|
2020-12-23 11:30:50 -05:00 |
|
Sven Mika
|
670d083a56
|
[RLlib] Fix broken unity3d_env import in example server script. (#13040)
|
2020-12-23 11:29:58 -05:00 |
|
Sven Mika
|
01faeabc17
|
[RLlib] Issue 12789: RLlib throws the warning "The given NumPy array is not writeable" (#12793)
|
2020-12-22 09:28:07 -05:00 |
|
Sven Mika
|
d5604eaba3
|
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
2020-12-21 18:38:34 -08:00 |
|
roireshef
|
ef95db51e1
|
[RLlib] Arbitrary input to value() when not using GAE (#12941)
|
2020-12-21 12:19:33 -05:00 |
|
Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Sven Mika
|
407a3523f3
|
[RLlib] eval_workers after restore not generated in Trainer due to unintuitive config handling. (#12844)
|
2020-12-20 09:37:31 -05:00 |
|
Sven Mika
|
124c8318a8
|
[RLlib] Fix broken test_distributions.py (test_categorical) (#12915)
|
2020-12-17 17:44:26 -06:00 |
|
Edward Oakes
|
aedcf0c9d9
|
Disable test_distributions (#12919)
|
2020-12-16 14:17:49 -08:00 |
|
Edward Oakes
|
cde711aaf1
|
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
This reverts commit becca1424d .
|
2020-12-15 11:03:02 -08:00 |
|
Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Sven Mika
|
3c808835a5
|
[RLlib] Issue 12831: AttributeError: 'NoneType' object has no attribute 'id' when using custom Atari env. (#12832)
|
2020-12-13 16:15:54 +01:00 |
|
Sven Mika
|
abb1eefdc2
|
[RLlib] Issue 12483: Discrete observation space error: "ValueError: ('Observation ({}) outside given space ..." when doing Trainer.compute_action. (#12787)
|
2020-12-11 22:43:30 +01:00 |
|
Sven Mika
|
74c98ac38e
|
[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786)
|
2020-12-11 16:13:38 +01:00 |
|
Sven Mika
|
a082ea18b8
|
[RLlib] Issue 12212: "TFEagerPolicy has no attribute action_sampler_fn.
|
2020-12-11 12:57:33 +01:00 |
|
Sven Mika
|
deb33bce84
|
[RLlib] Add DQN SoftQ learning test case. (#12712)
|
2020-12-10 14:55:19 +01:00 |
|
Sven Mika
|
ea25482f6a
|
WIP. (#12706)
|
2020-12-09 11:49:21 -08:00 |
|
Sven Mika
|
f6241302a8
|
[RLlib] Fix issue 12678: MultiAgentBatch has no attribute total . (#12704)
|
2020-12-09 16:41:13 +01:00 |
|
Sven Mika
|
28108c905b
|
[RLlib] Tf-eager policy bug fix: Duplicate model call in compute_gradients. (#12682)
|
2020-12-09 08:03:58 +01:00 |
|
Sven Mika
|
e40b14d255
|
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
2020-12-08 16:41:45 -08:00 |
|
Felipe Antunes
|
4c0f0ce3a9
|
[RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619)
|
2020-12-08 12:39:40 +01:00 |
|
Sven Mika
|
340b1e99fc
|
[RLlib] Fix JAX import bug. (#12621)
|
2020-12-07 11:05:08 -08:00 |
|
Sven Mika
|
99c81c6795
|
[RLlib] Attention Net prep PR #3. (#12450)
|
2020-12-07 13:08:17 +01:00 |
|
Kai Fricke
|
219c445648
|
[tune] verbosity refactor second attempt (#12571)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
|
2020-12-04 13:56:26 -08:00 |
|
Sven Mika
|
3f4bc16276
|
[RLlib] Add a minimal JAX ModelV2 (FCNet) to RLlib. (#12502)
|
2020-12-03 15:51:30 +01:00 |
|
Sven Mika
|
19c8033df2
|
[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366)
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* LINT and fixes.
MB-MPO and MAML not working yet.
* wip
* update
* update
* rmeove
* remove dep
* higher
* Update requirements_rllib.txt
* Update requirements_rllib.txt
* relpos
* no mbmpo
Co-authored-by: Eric Liang <ekhliang@gmail.com>
|
2020-12-01 17:41:10 -08:00 |
|
Sven Mika
|
9021f15b2a
|
[RLlib] Fix setup-dev.py error when creating a softlink for new_dashboard. (#12442)
|
2020-12-01 11:46:59 +01:00 |
|
Sven Mika
|
3ad9365e1d
|
[RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449)
|
2020-12-01 08:21:45 +01:00 |
|
Amog Kamsetty
|
f9a99f20dd
|
Revert "Re-Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12478)" (#12515)
This reverts commit 3f22448834 .
|
2020-11-30 19:05:55 -08:00 |
|
Siyuan (Ryans) Zhuang
|
3f22448834
|
Re-Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12478)
* [Core] zero-copy serializer for pytorch (#12344)
* zero-copy serializer for pytorch
* address possible bottleneck
* add tests & device support
(cherry picked from commit 0a505ca83d )
* add environmental variables
* update doc
|
2020-11-30 11:43:03 -08:00 |
|
Sven Mika
|
bb03e2499b
|
[RLlib] PyBullet Env native support via env str-specifier (if installed). (#12209)
|
2020-11-30 12:41:24 +01:00 |
|