hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	7f2b3c0824	[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). (#17742 )	2021-08-11 18:30:21 +02:00
Julius Frost	6891dee6ea	[RLlib] Better exceptions with traceback in TorchPolicy (#17690 )	2021-08-11 15:01:07 +02:00
Sven Mika	811d71b368	[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657 )	2021-08-11 12:44:35 +02:00
Sven Mika	29f20cccb6	[RLlib] Issue 17706: AttributeError: 'numpy.ndarray' object has no attribute 'items'" on certain turn-based MultiAgentEnvs with Dict obs space. (#17735 )	2021-08-11 12:33:35 +02:00
J K Terry	48e32555c8	[rllib] Update PettingZoo dependency versions (#17702 ) * update pettingzoo dependency versions * pettingzoo verison * fix tests	2021-08-11 01:19:19 -07:00
Julius Frost	9322f6aab5	[rllib] Fix classes decorated with @Deprecated to be classes instead of methods (#17666 ) * fix deprecated classes from being methods * format	2021-08-10 18:25:31 -07:00
Amog Kamsetty	0b8489dcc6	Revert "[RLlib] Add support for multi-GPU to DDPG. (#17586 )" (#17707 ) This reverts commit `0eb0e0ff58`.	2021-08-10 10:50:21 -07:00
Amog Kamsetty	77f28f1c30	Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566 )" (#17709 ) This reverts commit `3b447265d8`.	2021-08-10 10:50:01 -07:00
Sven Mika	3b447265d8	[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566 )	2021-08-05 11:41:18 -04:00
Sven Mika	0eb0e0ff58	[RLlib] Add support for multi-GPU to DDPG. (#17586 )	2021-08-05 11:39:51 -04:00
Sven Mika	3013d9b341	[RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587 )	2021-08-05 11:39:15 -04:00
Kai Fricke	5d56a8aac5	[RLlib] Fix ignoring "sample_collector" config key (#17460 )	2021-08-04 10:27:35 -04:00
Sven Mika	b539560429	[RLlib] Remove SampleBatch._get_slice_indices() deprecated logging spam (moderate). (#17567 )	2021-08-04 04:21:32 -04:00
Sven Mika	5107d16ae5	[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530 )	2021-08-03 18:30:02 -04:00
Sven Mika	924f11cd45	[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371 )	2021-08-03 11:35:49 -04:00
Sven Mika	8a844ff840	[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch\|TFPolicy should not use `ray.get_gpu_ids()` (b/c no GPUs assigned by ray). (#17444 )	2021-08-02 17:29:59 -04:00
Kai Fricke	44d209dd5f	[tune] re-enable tensorboardx without torch installed (#17403 )	2021-07-29 10:39:38 +01:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00
Julius Frost	d7a5ec1830	[RLlib] SAC tuple observation space fix (#17356 )	2021-07-28 12:39:28 -04:00
Sven Mika	0d8fce8fd8	[RLlib] Discussion 2294: Custom vector env example and fix. (#16083 )	2021-07-28 10:40:04 -04:00
Rohan138	f30b444bac	[Rllib] set self._allow_unknown_config (#17335 ) Co-authored-by: Sven Mika <sven@anyscale.io>	2021-07-28 11:48:41 +01:00
Sven Mika	58da5c1c9b	[RLlib] Discussion 3001: Fix comment on internal state shape (must be [B x S=state dim]). (#17341 )	2021-07-27 21:41:53 -04:00
Sven Mika	90b21ce27e	[RLlib] De-flake 3 test cases; Fix `config.simple_optimizer` and `SampleBatch.is_training` warnings. (#17321 )	2021-07-27 14:39:06 -04:00
Stefan Schneider	489febc6b2	[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038 )	2021-07-26 22:25:48 -04:00
Julius Frost	16be091702	[RLlib] Refactor `if __name__ == "__main__"` into `main()` method in rollout/train.py for better reusability (#17315 )	2021-07-26 11:12:59 -04:00
Sven Mika	5231fdd996	[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). (#17331 )	2021-07-26 10:52:55 -04:00
Sven Mika	0c5c70b584	[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118 )	2021-07-25 16:55:51 -04:00
Chris Bamford	29768a7c01	[RLLib] (P1 regression) Fixing view requirements in compute actions (#15856 )	2021-07-25 14:25:07 -04:00
ddworak94	fba8461663	[RLlib] Add RNN-SAC agent (#16577 ) Shoutout to @ddworak94 :)	2021-07-25 10:04:52 -04:00
Sven Mika	7bc4376466	[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077 )	2021-07-22 10:59:13 -04:00
Richard Liaw	a78a2263e5	[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896 )	2021-07-22 10:55:07 -04:00
Vince Jankovics	05c9dfbbda	[RLlib] CV2 to Skimage dependency change (#16841 )	2021-07-21 22:24:18 -04:00
Julius Frost	0b1b6222bc	[rllib] Add merge_trainer_config arguments to trainer template (#17160 )	2021-07-21 15:43:06 -07:00
Sven Mika	5a313ba3d6	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
Amog Kamsetty	cb74053ee5	Retry remove `gpustat` dependency (#17115 ) * remove gpustat * move psutil imports	2021-07-19 11:14:10 -07:00
Sven Mika	18d173b172	[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031 )	2021-07-19 13:16:03 -04:00
Sven Mika	e0640ad0dc	[RLlib] Fix seeding for ES and ARS. (#16744 )	2021-07-19 13:13:05 -04:00
Sven Mika	649580d735	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
kk-55	13094a3f1c	AttributeError: 'numpy.ndarray' object has no attribute 'get_shape' when running with framework config tf2 or tfe (#16868 )	2021-07-15 10:47:16 +01:00
Sven Mika	ce6dfc9b2d	[RLlib] Update tf1.x vs tf2.x documentation and eager example script. (#17030 )	2021-07-13 20:02:17 -04:00
Grzegorz Bartyzel	d553d4da6c	[RLlib] DQN (Rainbow): Fix torch noisy layer support and loss (#16716 )	2021-07-13 16:48:06 -04:00
Sven Mika	1fd0eb805e	[RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). (#17014 )	2021-07-13 14:01:30 -04:00
Antoine Galataud	16f1011c07	[RLlib] Issue 15910: APEX current learning rate not updated on local worker (#15911 )	2021-07-13 14:01:00 -04:00
Amog Kamsetty	38b5b6d24c	Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )" (#17036 ) This reverts commit `e4123fff27`.	2021-07-13 09:57:15 -07:00
Kai Fricke	27d80c4c88	[RLlib] ONNX export for tensorflow (1.x) and torch (#16805 )	2021-07-13 12:38:11 -04:00
Kai Fricke	3380b68b54	[RLlib] Issue 16683: Fix last infos dict (#16999 ).	2021-07-13 11:33:48 -04:00
Sven Mika	e4123fff27	[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )	2021-07-13 06:38:14 -04:00
Amog Kamsetty	df3dd81348	[rllib] skip highly flaky tests (#17010 )	2021-07-12 11:18:28 -07:00
Amog Kamsetty	bc33dc7e96	Revert "[RLlib] Fix bug in policy.py: normalize_actions=True has to call `unsquash_action`, not `normalize_action`." (#17002 ) This reverts commit `7862dd64ea`.	2021-07-12 11:09:14 -07:00
Sven Mika	55a90e670a	[RLlib] Trainer.add_policy() not working for tf, if added policy is trained afterwards. (#16927 )	2021-07-11 23:41:38 +02:00

1 2 3 4 5 ...

785 commits