hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

Author	SHA1	Message	Date
Chace Ashcraft	ebeee1d59a	[RLlib] Pytorch MAML fix for more than two workers with discrete actions (#13835 )	2021-02-08 12:06:02 +01:00
Sven Mika	d001af3e59	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
Sven Mika	9ac731558b	[RLlib] Unify fcnet initializers for the value output layer (std=1.0 in torch, but 0.01 in tf). (#13733 )	2021-02-02 18:42:49 +01:00
Sven Mika	0a0d9183fe	[RLlib] Trajectory view API example script (enhancements and tf2 support). (#13786 )	2021-02-02 18:42:18 +01:00
Stanislav Chekmenev	b9c15a2551	[RLlib] Issue #13761 : Fix get action shape (#13764 )	2021-02-02 13:13:43 +01:00
Raoul Khouri	714c367b9d	[RLlib] Trainer._validate_config idempotentcy correction (issue 13427) (#13556 )	2021-02-02 13:11:57 +01:00
Sven Mika	52c94b7ee9	[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522 )	2021-02-02 13:05:58 +01:00
Sven Mika	4bc257f4fb	[RLlib] Fix custom multi action distr (#13681 )	2021-01-28 19:28:48 +01:00
Yuri Rocha	b01b0f80aa	[RLlib] Fix multiple Unity3DEnvs trying to connect to the same custom port (#13519 )	2021-01-28 13:28:08 +01:00
cathrinS	d4ef5c5993	[RLlib] Atari-RAM-Preprocessing, unsigned observation vector results in a false preprocessed observation (#13013 )	2021-01-28 12:07:00 +01:00
Maltimore	b4702de1c2	[RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708 )	2021-01-25 12:56:00 +01:00
Jan Blumenkamp	964689b280	[RLlib] Fix bug in ModelCatalog when using custom action distribution (#12846 ) * return tuple returned from _get_multi_action_distribution when using custom action dict * Always return dst_class and required_model_output_shape in _get_multi_action_distribution * pass model config to _get_multi_action_distribution	2021-01-25 12:42:39 +01:00
Sven Mika	9423930bcc	[RLlib] MAML: Add cartpole mass test for PyTorch. (#13679 )	2021-01-25 12:32:41 +01:00
Sven Mika	d629292d63	[RLlib] Add grad_clip config option to MARWIL and stabilize grad clipping against inf global_norms. (#13634 )	2021-01-22 19:36:02 +01:00
Michael Luo	587f207c2f	[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550 )	2021-01-21 16:43:55 +01:00
Saeid	d11e62f9e6	[RLlib] Fix problem in preprocessing nested MultiDiscrete (#13308 )	2021-01-21 16:36:11 +01:00
Sven Mika	daf0bef285	[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553 )	2021-01-21 16:30:26 +01:00
Sven Mika	2e3655e8a9	[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238 )	2021-01-19 14:22:36 +01:00
Sven Mika	e74947cc94	[RLlib] Env directory cleanup and tests. (#13082 )	2021-01-19 10:09:39 +01:00
Sven Mika	93c0a5549b	[RLlib] Deprecate `vf_share_layers` in top-level PPO/MAML/MB-MPO configs. (#13397 )	2021-01-19 09:51:35 +01:00
Sven Mika	a65ee92b69	[RLlib] MARWIL loss function test case and cleanup. (#13455 )	2021-01-19 09:51:05 +01:00
Sven Mika	1f00f834ac	[RLlib] Solve PyTorch/TF-eager A3C async race condition between calling model and its value function. (#13467 )	2021-01-18 10:29:03 -08:00
Sven Mika	d98235cc84	[RLlib] Deflake 2x remote & local inference tests (external env). (#13459 )	2021-01-14 20:44:26 +01:00
Sven Mika	56878221ed	[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363 )	2021-01-14 14:44:33 +01:00
Sven Mika	d49c3fae0b	[RLlib] Trajectory View API: Atari framestacking. (#13315 )	2021-01-13 08:53:34 +01:00
Maltimore	3a3e4aed86	[RLlib] Add `__len__()` method to SampleBatch (#13371 )	2021-01-12 20:15:23 +01:00
Kai Fricke	25f10a947a	Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )" (#13361 ) This reverts commit `e2b2abb88b`.	2021-01-12 12:33:57 +01:00
Sven Mika	e2b2abb88b	[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339 )	2021-01-11 22:42:30 +01:00
Sven Mika	5d50d37f45	[RLlib] Issue 13330: No TF installed causes crash in `ModelCatalog.get_action_shape()` (#13332 )	2021-01-11 13:19:46 +01:00
Sven Mika	9dd9f72111	[RLlib] Add more detailed Documentation on Model building API (#13261 )	2021-01-09 12:38:29 +01:00
Sven Mika	6f342a2221	[RLlib] Preparatory PR for: Documentation on Model Building. (#13260 )	2021-01-08 10:56:09 +01:00
Sven Mika	a5b39ef8e2	[RLlib] Fix missing "info_batch" arg (None) in `compute_actions` calls. (#13237 )	2021-01-07 21:25:02 +01:00
Sven Mika	bcaff63909	[RLlib] SquashedGaussians should throw error when entropy or kl are called. (#13126 )	2021-01-07 15:07:35 +01:00
Basu Jindal	4e569ee20b	Update multi_agent_independent_learning.py (#13196 ) pettingzoo.utils.error.DeprecatedEnv: waterworld_v0 is now depreciated, use waterworld_v2 instead	2021-01-05 13:44:54 -08:00
Sven Mika	9eba1871bb	[RLlib] Support easy `use_attention=True` flag for using the GTrXL model. (#11698 )	2021-01-01 14:06:23 -05:00
Sven Mika	8726521604	[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091 )	2020-12-30 22:30:52 -05:00
Sven Mika	391cdfae8c	[RLlib] Trajectory view API docs. (#12718 )	2020-12-30 17:32:21 -08:00
Sven Mika	28ac4243f4	[RLlib] Deflake test case: 2-step game MADDPG. (#13121 )	2020-12-30 18:37:37 -05:00
Michael Luo	42cd414e5b	[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118 )	2020-12-30 10:11:57 -05:00
Michael Luo	eae7a1f433	[RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035 )	2020-12-29 18:45:55 -05:00
Sven Mika	d811d65920	[RLlib] run_regression_tests.py: --framework flag (instead of --torch). (#13097 )	2020-12-29 15:27:59 -05:00
Sven Mika	c524f86785	[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064 )	2020-12-27 09:46:03 -05:00
Sven Mika	a5318961de	[RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083 )	2020-12-26 20:14:36 -05:00
Sven Mika	99ae7bae05	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
Michael Luo	4bcd475671	[RLlib] Improved Documentation for PPO, DDPG, and SAC (#12943 )	2020-12-24 09:31:35 -05:00
Michael Luo	a2d1215200	[RLlib] Execution Annotation (#13036 )	2020-12-24 09:30:33 -05:00
Corey Lowman	668ea0bc26	Fix typo RMSProp -> RMSprop (#13063 )	2020-12-23 13:37:46 -08:00
Sven Mika	1e74187179	[RLlib] TorchPolicies: Accessing "infos" dict in train_batch causes `TypeError`. (#13039 )	2020-12-23 11:30:50 -05:00
Sven Mika	670d083a56	[RLlib] Fix broken unity3d_env import in example server script. (#13040 )	2020-12-23 11:29:58 -05:00
Sven Mika	01faeabc17	[RLlib] Issue 12789: RLlib throws the warning "The given NumPy array is not writeable" (#12793 )	2020-12-22 09:28:07 -05:00

... 4 5 6 7 8 ...

812 commits