hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	3f89f35e52	[RLlib] Better error messages and hints; + failure-mode tests; (#18466 )	2021-09-10 16:52:47 +02:00
Sven Mika	8a066474d4	[RLlib] No Preprocessors; preparatory PR #1 (#18367 )	2021-09-09 08:10:42 +02:00
Sven Mika	1520c3d147	[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428 )	2021-09-09 07:10:06 +02:00
Sven Mika	9a8ca6a69d	[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306 )	2021-09-03 13:29:57 +02:00
gjoliver	336e79956a	[RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156 )	2021-09-03 08:02:05 +02:00
Sven Mika	2357bbc0c8	[RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249 )	2021-09-02 09:28:16 +02:00
Sven Mika	82465f9342	[RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254 )	2021-08-31 22:03:23 +02:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00
Sven Mika	0d8fce8fd8	[RLlib] Discussion 2294: Custom vector env example and fix. (#16083 )	2021-07-28 10:40:04 -04:00
Sven Mika	0c5c70b584	[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118 )	2021-07-25 16:55:51 -04:00
Sven Mika	7bc4376466	[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077 )	2021-07-22 10:59:13 -04:00
Vince Jankovics	05c9dfbbda	[RLlib] CV2 to Skimage dependency change (#16841 )	2021-07-21 22:24:18 -04:00
Kai Fricke	3380b68b54	[RLlib] Issue 16683: Fix last infos dict (#16999 ).	2021-07-13 11:33:48 -04:00
Kai Fricke	10fd7111b3	[rllib] Improve test learning check, fix flaky two step qmix (#16843 )	2021-07-06 19:39:12 +01:00
Rodrigo de Lazcano	5072d86323	[rllib] parallel pettingzoo import (#16722 )	2021-07-01 18:37:59 -07:00
Sven Mika	53206dd440	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
Sven Mika	c95dea51e9	[RLlib] External env enhancements + more examples. (#16583 )	2021-06-23 09:09:01 +02:00
Sven Mika	be6db06485	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
Sven Mika	79a9d6d517	[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428 )	2021-06-19 08:57:53 +02:00
Amog Kamsetty	bd3cbfc56a	Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359 )" (#16543 ) This reverts commit `e78ec370a9`.	2021-06-18 12:21:49 -07:00
Sven Mika	e78ec370a9	[RLlib] Allow policies to be added/deleted on the fly. (#16359 )	2021-06-18 10:31:30 +02:00
Sven Mika	d89fb82bfb	[RLlib] Add simple curriculum learning API and example script. (#15740 )	2021-05-16 17:35:10 +02:00
Ian Rodney	82876ecc2a	[rllib] [testing] make kill failure non fatal (#15771 )	2021-05-13 12:24:49 -07:00
Sven Mika	16ddab49f5	[RLlib] Trainer._evaluate -> Trainer.evaluate; Also make evaluation possible w/o evaluation worker set. (#15591 )	2021-05-12 12:16:00 +02:00
Amog Kamsetty	ebc44c3d76	[CI] Upgrade flake8 to 3.9.1 (#15527 ) * formatting * format util * format release * format rllib/agents * format rllib/env * format rllib/execution * format rllib/evaluation * format rllib/examples * format rllib/policy * format rllib utils and tests * format streaming * more formatting * update requirements files * fix rllib type checking * updates * update * fix circular import * Update python/ray/tests/test_runtime_env.py * noqa	2021-05-03 14:23:28 -07:00
Sven Mika	7e1a191f17	[RLlib] Remove all remaining tf- and MuJoCo warnings from RLlib. (#15454 )	2021-04-22 19:20:19 +02:00
Sven Mika	7ff27dfe07	[RLlib] Remove atari dependency for RLlib (in favor of detailed error message). (#15292 )	2021-04-20 08:46:58 +02:00
Sven Mika	bbfa8ffec9	[RLlib] Minor release 1.3 warnings cleanups. (#15272 )	2021-04-14 14:03:15 +02:00
Sven Mika	f859ebb99f	[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). (#14796 )	2021-03-23 10:06:06 +01:00
Sven Mika	ee4b6e7e3b	[RLlib] Unity3D example broken due to change in ML-Agents API. Attention-net prev-n-a/r. Attention-wrapper works with images. (#14569 )	2021-03-12 18:27:25 +01:00
Maxime RICHE	9a7fbd3cdf	[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208 )	2021-03-09 17:26:20 +01:00
Sven Mika	8000258333	[RLlib] R2D2 Implementation. (#13933 )	2021-02-25 12:18:11 +01:00
Sven Mika	eb0038612f	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
Sven Mika	d001af3e59	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
Yuri Rocha	b01b0f80aa	[RLlib] Fix multiple Unity3DEnvs trying to connect to the same custom port (#13519 )	2021-01-28 13:28:08 +01:00
Sven Mika	daf0bef285	[RLlib] Dreamer: Fix broken import and add compilation test case. (#13553 )	2021-01-21 16:30:26 +01:00
Sven Mika	e74947cc94	[RLlib] Env directory cleanup and tests. (#13082 )	2021-01-19 10:09:39 +01:00
Sven Mika	d98235cc84	[RLlib] Deflake 2x remote & local inference tests (external env). (#13459 )	2021-01-14 20:44:26 +01:00
Sven Mika	d49c3fae0b	[RLlib] Trajectory View API: Atari framestacking. (#13315 )	2021-01-13 08:53:34 +01:00
Sven Mika	3c808835a5	[RLlib] Issue 12831: AttributeError: 'NoneType' object has no attribute 'id' when using custom Atari env. (#12832 )	2020-12-13 16:15:54 +01:00
Sven Mika	f6241302a8	[RLlib] Fix issue 12678: MultiAgentBatch has no attribute `total`. (#12704 )	2020-12-09 16:41:13 +01:00
Tomasz Wrona	82852f0ed2	[RLlib] Add ResetOnExceptionWrapper with tests for unstable 3rd party envs (#12353 )	2020-11-25 08:41:58 +01:00
Michael Luo	6e6c680f14	MBMPO Cartpole (#11832 ) * MBMPO Cartpole Done * Added doc	2020-11-12 10:30:41 -08:00
Benjamin Black	1999266bba	Updated pettingzoo env to acomidate api changes and fixes (#11873 ) * Updated pettingzoo env to acomidate api changes and fixes * fixed test failure * fixed linting issue * fixed test failure	2020-11-09 16:09:49 -08:00
heng2j	9073e6507c	WIP: Update to support the Food Collector environment (#11373 ) * Update to support the Food Collector environment Recently, I am trying out ML Agent with Ray, and trying to use the food collector environment. Since the observation space and action space haven't defined in the unity3d_env.py. I propose to make this changes to add the support for Food Collector. I have tried to use this env in the [unity3d_env_local example](https://github.com/ray-project/ray/blob/master/rllib/examples/unity3d_env_local.py). Please let me know if this the proper adjustment. Even these are just few line of code, please let me know how can I made a proper contribution. * Apply suggestions from code review	2020-11-04 12:29:16 +01:00
desktable	5af745c90d	[RLlib] Implement the SlateQ algorithm (#11450 )	2020-11-03 09:52:04 +01:00
desktable	8af9ff6dc2	[RLlib] Add MultiAgentEnv wrapper for Kaggle's football environment (#11249 ) * [RLlib] Add MultiAgentEnv wrapper for Kaggle's football environment * Add unit tests to BUILD * Add gfootball dependency * Revert the last two commits	2020-10-08 10:57:58 -07:00
desktable	f9621ce23c	[RLlib] Add recsim_wrapper unit test to BUILD (#11225 )	2020-10-08 08:23:27 +02:00
Anes Benmerzoug	ff3e411ea2	[rllib] Fix VectorEnv's check for the info object's type (#10982 )	2020-10-07 15:00:37 -07:00
Sven Mika	ce96b03b07	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00

1 2

84 commits