Sven Mika
|
d629292d63
|
[RLlib] Add grad_clip config option to MARWIL and stabilize grad clipping against inf global_norms. (#13634)
|
2021-01-22 19:36:02 +01:00 |
|
Michael Luo
|
587f207c2f
|
[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550)
|
2021-01-21 16:43:55 +01:00 |
|
Saeid
|
d11e62f9e6
|
[RLlib] Fix problem in preprocessing nested MultiDiscrete (#13308)
|
2021-01-21 16:36:11 +01:00 |
|
Sven Mika
|
daf0bef285
|
[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553)
|
2021-01-21 16:30:26 +01:00 |
|
Sven Mika
|
2e3655e8a9
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
|
Sven Mika
|
e74947cc94
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
|
Sven Mika
|
93c0a5549b
|
[RLlib] Deprecate vf_share_layers in top-level PPO/MAML/MB-MPO configs. (#13397)
|
2021-01-19 09:51:35 +01:00 |
|
Sven Mika
|
a65ee92b69
|
[RLlib] MARWIL loss function test case and cleanup. (#13455)
|
2021-01-19 09:51:05 +01:00 |
|
Sven Mika
|
1f00f834ac
|
[RLlib] Solve PyTorch/TF-eager A3C async race condition between calling model and its value function. (#13467)
|
2021-01-18 10:29:03 -08:00 |
|
Sven Mika
|
d98235cc84
|
[RLlib] Deflake 2x remote & local inference tests (external env). (#13459)
|
2021-01-14 20:44:26 +01:00 |
|
Sven Mika
|
56878221ed
|
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363)
|
2021-01-14 14:44:33 +01:00 |
|
Sven Mika
|
d49c3fae0b
|
[RLlib] Trajectory View API: Atari framestacking. (#13315)
|
2021-01-13 08:53:34 +01:00 |
|
Maltimore
|
3a3e4aed86
|
[RLlib] Add __len__() method to SampleBatch (#13371)
|
2021-01-12 20:15:23 +01:00 |
|
Kai Fricke
|
25f10a947a
|
Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)" (#13361)
This reverts commit e2b2abb88b .
|
2021-01-12 12:33:57 +01:00 |
|
Sven Mika
|
e2b2abb88b
|
[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)
|
2021-01-11 22:42:30 +01:00 |
|
Sven Mika
|
5d50d37f45
|
[RLlib] Issue 13330: No TF installed causes crash in ModelCatalog.get_action_shape() (#13332)
|
2021-01-11 13:19:46 +01:00 |
|
Sven Mika
|
9dd9f72111
|
[RLlib] Add more detailed Documentation on Model building API (#13261)
|
2021-01-09 12:38:29 +01:00 |
|
Sven Mika
|
6f342a2221
|
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
2021-01-08 10:56:09 +01:00 |
|
Sven Mika
|
a5b39ef8e2
|
[RLlib] Fix missing "info_batch" arg (None) in compute_actions calls. (#13237)
|
2021-01-07 21:25:02 +01:00 |
|
Sven Mika
|
bcaff63909
|
[RLlib] SquashedGaussians should throw error when entropy or kl are called. (#13126)
|
2021-01-07 15:07:35 +01:00 |
|
Basu Jindal
|
4e569ee20b
|
Update multi_agent_independent_learning.py (#13196)
pettingzoo.utils.error.DeprecatedEnv: waterworld_v0 is now depreciated, use waterworld_v2 instead
|
2021-01-05 13:44:54 -08:00 |
|
Sven Mika
|
9eba1871bb
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
|
Sven Mika
|
8726521604
|
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
2020-12-30 22:30:52 -05:00 |
|
Sven Mika
|
391cdfae8c
|
[RLlib] Trajectory view API docs. (#12718)
|
2020-12-30 17:32:21 -08:00 |
|
Sven Mika
|
28ac4243f4
|
[RLlib] Deflake test case: 2-step game MADDPG. (#13121)
|
2020-12-30 18:37:37 -05:00 |
|
Michael Luo
|
42cd414e5b
|
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
2020-12-30 10:11:57 -05:00 |
|
Michael Luo
|
eae7a1f433
|
[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035)
|
2020-12-29 18:45:55 -05:00 |
|
Sven Mika
|
d811d65920
|
[RLlib] run_regression_tests.py: --framework flag (instead of --torch). (#13097)
|
2020-12-29 15:27:59 -05:00 |
|
Sven Mika
|
c524f86785
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
|
Sven Mika
|
a5318961de
|
[RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083)
|
2020-12-26 20:14:36 -05:00 |
|
Sven Mika
|
99ae7bae05
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
|
Michael Luo
|
4bcd475671
|
[RLlib] Improved Documentation for PPO, DDPG, and SAC (#12943)
|
2020-12-24 09:31:35 -05:00 |
|
Michael Luo
|
a2d1215200
|
[RLlib] Execution Annotation (#13036)
|
2020-12-24 09:30:33 -05:00 |
|
Corey Lowman
|
668ea0bc26
|
Fix typo RMSProp -> RMSprop (#13063)
|
2020-12-23 13:37:46 -08:00 |
|
Sven Mika
|
1e74187179
|
[RLlib] TorchPolicies: Accessing "infos" dict in train_batch causes TypeError . (#13039)
|
2020-12-23 11:30:50 -05:00 |
|
Sven Mika
|
670d083a56
|
[RLlib] Fix broken unity3d_env import in example server script. (#13040)
|
2020-12-23 11:29:58 -05:00 |
|
Sven Mika
|
01faeabc17
|
[RLlib] Issue 12789: RLlib throws the warning "The given NumPy array is not writeable" (#12793)
|
2020-12-22 09:28:07 -05:00 |
|
Sven Mika
|
d5604eaba3
|
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
2020-12-21 18:38:34 -08:00 |
|
roireshef
|
ef95db51e1
|
[RLlib] Arbitrary input to value() when not using GAE (#12941)
|
2020-12-21 12:19:33 -05:00 |
|
Sven Mika
|
b2bcab711d
|
[RLlib] Attention Nets: tf (#12753)
|
2020-12-20 20:22:32 -05:00 |
|
Sven Mika
|
407a3523f3
|
[RLlib] eval_workers after restore not generated in Trainer due to unintuitive config handling. (#12844)
|
2020-12-20 09:37:31 -05:00 |
|
Sven Mika
|
124c8318a8
|
[RLlib] Fix broken test_distributions.py (test_categorical) (#12915)
|
2020-12-17 17:44:26 -06:00 |
|
Edward Oakes
|
aedcf0c9d9
|
Disable test_distributions (#12919)
|
2020-12-16 14:17:49 -08:00 |
|
Edward Oakes
|
cde711aaf1
|
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
This reverts commit becca1424d .
|
2020-12-15 11:03:02 -08:00 |
|
Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Sven Mika
|
3c808835a5
|
[RLlib] Issue 12831: AttributeError: 'NoneType' object has no attribute 'id' when using custom Atari env. (#12832)
|
2020-12-13 16:15:54 +01:00 |
|
Sven Mika
|
abb1eefdc2
|
[RLlib] Issue 12483: Discrete observation space error: "ValueError: ('Observation ({}) outside given space ..." when doing Trainer.compute_action. (#12787)
|
2020-12-11 22:43:30 +01:00 |
|
Sven Mika
|
74c98ac38e
|
[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786)
|
2020-12-11 16:13:38 +01:00 |
|
Sven Mika
|
a082ea18b8
|
[RLlib] Issue 12212: "TFEagerPolicy has no attribute action_sampler_fn.
|
2020-12-11 12:57:33 +01:00 |
|
Sven Mika
|
deb33bce84
|
[RLlib] Add DQN SoftQ learning test case. (#12712)
|
2020-12-10 14:55:19 +01:00 |
|