Jan Blumenkamp
964689b280
[RLlib] Fix bug in ModelCatalog when using custom action distribution ( #12846 )
...
* return tuple returned from _get_multi_action_distribution when using custom action dict
* Always return dst_class and required_model_output_shape in _get_multi_action_distribution
* pass model config to _get_multi_action_distribution
2021-01-25 12:42:39 +01:00
Sven Mika
9423930bcc
[RLlib] MAML: Add cartpole mass test for PyTorch. ( #13679 )
2021-01-25 12:32:41 +01:00
Sven Mika
d629292d63
[RLlib] Add grad_clip config option to MARWIL and stabilize grad clipping against inf global_norms. ( #13634 )
2021-01-22 19:36:02 +01:00
Michael Luo
587f207c2f
[RLlib] Support for D4RL + Semi-working CQL Benchmark ( #13550 )
2021-01-21 16:43:55 +01:00
Saeid
d11e62f9e6
[RLlib] Fix problem in preprocessing nested MultiDiscrete ( #13308 )
2021-01-21 16:36:11 +01:00
Sven Mika
daf0bef285
[RLlib] Dreamer: Fix broken import and add compilation test case. ( #13553 )
2021-01-21 16:30:26 +01:00
Sven Mika
2e3655e8a9
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. ( #13238 )
2021-01-19 14:22:36 +01:00
Sven Mika
e74947cc94
[RLlib] Env directory cleanup and tests. ( #13082 )
2021-01-19 10:09:39 +01:00
Sven Mika
93c0a5549b
[RLlib] Deprecate vf_share_layers
in top-level PPO/MAML/MB-MPO configs. ( #13397 )
2021-01-19 09:51:35 +01:00
Sven Mika
a65ee92b69
[RLlib] MARWIL loss function test case and cleanup. ( #13455 )
2021-01-19 09:51:05 +01:00
Sven Mika
1f00f834ac
[RLlib] Solve PyTorch/TF-eager A3C async race condition between calling model and its value function. ( #13467 )
2021-01-18 10:29:03 -08:00
Sven Mika
d98235cc84
[RLlib] Deflake 2x remote & local inference tests (external env). ( #13459 )
2021-01-14 20:44:26 +01:00
Sven Mika
56878221ed
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). ( #13363 )
2021-01-14 14:44:33 +01:00
Sven Mika
d49c3fae0b
[RLlib] Trajectory View API: Atari framestacking. ( #13315 )
2021-01-13 08:53:34 +01:00
Maltimore
3a3e4aed86
[RLlib] Add __len__()
method to SampleBatch ( #13371 )
2021-01-12 20:15:23 +01:00
Kai Fricke
25f10a947a
Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. ( #13339 )" ( #13361 )
...
This reverts commit e2b2abb88b
.
2021-01-12 12:33:57 +01:00
Sven Mika
e2b2abb88b
[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. ( #13339 )
2021-01-11 22:42:30 +01:00
Sven Mika
5d50d37f45
[RLlib] Issue 13330: No TF installed causes crash in ModelCatalog.get_action_shape()
( #13332 )
2021-01-11 13:19:46 +01:00
Sven Mika
9dd9f72111
[RLlib] Add more detailed Documentation on Model building API ( #13261 )
2021-01-09 12:38:29 +01:00
Sven Mika
6f342a2221
[RLlib] Preparatory PR for: Documentation on Model Building. ( #13260 )
2021-01-08 10:56:09 +01:00
Sven Mika
a5b39ef8e2
[RLlib] Fix missing "info_batch" arg (None) in compute_actions
calls. ( #13237 )
2021-01-07 21:25:02 +01:00
Sven Mika
bcaff63909
[RLlib] SquashedGaussians should throw error when entropy or kl are called. ( #13126 )
2021-01-07 15:07:35 +01:00
Basu Jindal
4e569ee20b
Update multi_agent_independent_learning.py ( #13196 )
...
pettingzoo.utils.error.DeprecatedEnv: waterworld_v0 is now depreciated, use waterworld_v2 instead
2021-01-05 13:44:54 -08:00
Sven Mika
9eba1871bb
[RLlib] Support easy use_attention=True
flag for using the GTrXL model. ( #11698 )
2021-01-01 14:06:23 -05:00
Sven Mika
8726521604
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). ( #13091 )
2020-12-30 22:30:52 -05:00
Sven Mika
391cdfae8c
[RLlib] Trajectory view API docs. ( #12718 )
2020-12-30 17:32:21 -08:00
Sven Mika
28ac4243f4
[RLlib] Deflake test case: 2-step game MADDPG. ( #13121 )
2020-12-30 18:37:37 -05:00
Michael Luo
42cd414e5b
[RLlib] New Offline RL Algorithm: CQL (based on SAC) ( #13118 )
2020-12-30 10:11:57 -05:00
Michael Luo
eae7a1f433
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents ( #13035 )
2020-12-29 18:45:55 -05:00
Sven Mika
d811d65920
[RLlib] run_regression_tests.py: --framework flag (instead of --torch). ( #13097 )
2020-12-29 15:27:59 -05:00
Sven Mika
c524f86785
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. ( #13064 )
2020-12-27 09:46:03 -05:00
Sven Mika
a5318961de
[RLlib] Preprocessor fixes (multi-discrete) and tests. ( #13083 )
2020-12-26 20:14:36 -05:00
Sven Mika
99ae7bae05
[RLlib] JAXPolicy prep. PR #1 . ( #13077 )
2020-12-26 20:14:18 -05:00
Michael Luo
4bcd475671
[RLlib] Improved Documentation for PPO, DDPG, and SAC ( #12943 )
2020-12-24 09:31:35 -05:00
Michael Luo
a2d1215200
[RLlib] Execution Annotation ( #13036 )
2020-12-24 09:30:33 -05:00
Corey Lowman
668ea0bc26
Fix typo RMSProp -> RMSprop ( #13063 )
2020-12-23 13:37:46 -08:00
Sven Mika
1e74187179
[RLlib] TorchPolicies: Accessing "infos" dict in train_batch causes TypeError
. ( #13039 )
2020-12-23 11:30:50 -05:00
Sven Mika
670d083a56
[RLlib] Fix broken unity3d_env import in example server script. ( #13040 )
2020-12-23 11:29:58 -05:00
Sven Mika
01faeabc17
[RLlib] Issue 12789: RLlib throws the warning "The given NumPy array is not writeable" ( #12793 )
2020-12-22 09:28:07 -05:00
Sven Mika
d5604eaba3
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). ( #12029 )
2020-12-21 18:38:34 -08:00
roireshef
ef95db51e1
[RLlib] Arbitrary input to value() when not using GAE ( #12941 )
2020-12-21 12:19:33 -05:00
Sven Mika
b2bcab711d
[RLlib] Attention Nets: tf ( #12753 )
2020-12-20 20:22:32 -05:00
Sven Mika
407a3523f3
[RLlib] eval_workers after restore not generated in Trainer due to unintuitive config handling. ( #12844 )
2020-12-20 09:37:31 -05:00
Sven Mika
124c8318a8
[RLlib] Fix broken test_distributions.py (test_categorical) ( #12915 )
2020-12-17 17:44:26 -06:00
Edward Oakes
aedcf0c9d9
Disable test_distributions ( #12919 )
2020-12-16 14:17:49 -08:00
Edward Oakes
cde711aaf1
Revert "[RLLib] Execution-Folder Type Annotations ( #12760 )" ( #12886 )
...
This reverts commit becca1424d
.
2020-12-15 11:03:02 -08:00
Michael Luo
becca1424d
[RLLib] Execution-Folder Type Annotations ( #12760 )
2020-12-14 19:16:44 +01:00
Sven Mika
3c808835a5
[RLlib] Issue 12831: AttributeError: 'NoneType' object has no attribute 'id' when using custom Atari env. ( #12832 )
2020-12-13 16:15:54 +01:00
Sven Mika
abb1eefdc2
[RLlib] Issue 12483: Discrete observation space error: "ValueError: ('Observation ({}) outside given space ..." when doing Trainer.compute_action. ( #12787 )
2020-12-11 22:43:30 +01:00
Sven Mika
74c98ac38e
[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). ( #12786 )
2020-12-11 16:13:38 +01:00