hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Siyuan (Ryans) Zhuang	0c74ecad12	[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128 )	2022-03-15 17:34:21 +01:00
Sven Mika	3fe6f3b3eb	[RLlib] 2 bug fixes: Bandit registration not working if torch not installed. Env checker for MA envs. (#22821 )	2022-03-04 19:16:30 +01:00
Daniel	8d1f1b0a64	[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 (#22519 )	2022-03-01 11:23:27 +01:00
Jun Gong	a385c9b127	[RLlib] Update bandit_envs_recommender_system (#22421 )	2022-02-24 22:43:41 +01:00
Sven Mika	6522935291	[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389 )	2022-02-22 09:36:44 +01:00
JYX	49d7ba3738	[RLlib] Fix typo in vector_env docstring (#22534 )	2022-02-22 08:13:50 +01:00
Avnish Narayan	740def0a13	[RLlib] Put env-checker on critical path. (#22191 )	2022-02-17 14:06:14 +01:00
Sven Mika	1c791b71d8	[RLlib] Fix Unity3D built-in examples action bounds from -inf/inf to -1.0/1.0. (#22247 )	2022-02-10 03:00:30 +01:00
Balaji Veeramani	31ed9e5d02	[CI] Replace YAPF disables with Black disables (#21982 )	2022-02-08 16:29:25 -08:00
Sven Mika	8b678ddd68	[RLlib] Issue 22036: Client should handle concurrent episodes with one being `training_enabled=False`. (#22076 )	2022-02-06 12:35:03 +01:00
Sven Mika	f6617506a2	[RLlib] Add `on_sub_environment_created` to DefaultCallbacks class. (#21893 )	2022-02-04 22:22:47 +01:00
Sven Mika	38d75ce058	[RLlib] Cleanup SlateQ algo; add test + add target Q-net (#21827 )	2022-02-04 17:01:12 +01:00
Jun Gong	9c95b9a5fa	[RLlib] Add an env wrapper so RecSim works with our Bandits agent. (#22028 )	2022-02-02 12:15:38 +01:00
Balaji Veeramani	7f1bacc7dc	[CI] Format Python code with Black (#21975 ) See #21316 and #21311 for the motivation behind these changes.	2022-01-29 18:41:57 -08:00
Sven Mika	893536ebd9	[RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; (#21773 )	2022-01-27 13:58:12 +01:00
Sven Mika	d5bfb7b7da	[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652 )	2022-01-25 14:16:58 +01:00
Sven Mika	c288b97e5f	[RLlib] Issue 21629: Video recorder env wrapper not working. Added test case. (#21670 )	2022-01-24 19:38:21 +01:00
Avnish Narayan	12b087acb8	[RLlib] Base env pre-checker. (#21569 )	2022-01-18 16:34:06 +01:00
Avnish Narayan	c0f1202278	[RLlib] `MultiAgentEnv` pre-checker (#21476 )	2022-01-13 11:31:22 +01:00
Sven Mika	f94bd99ce4	[RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448 )	2022-01-11 19:50:03 +01:00
Sven Mika	34cee199b1	[RLlib] from remote_vector_env import ... -> from remote_base_env import ... (avoid deprecation warning). (#21460 )	2022-01-08 17:13:04 +01:00
Avnish Narayan	39f8072eac	[RLlib] [MultiAgentEnv Refactor #2 ] Change space types for `BaseEnvs` and `MultiAgentEnvs` (#21063 )	2022-01-06 14:34:20 -08:00
Sven Mika	c01245763e	[RLlib] Revert "Revert "updated pettingzoo wrappers, env versions, urls"" (#21339 )	2022-01-04 18:30:26 +01:00
Kai Fricke	489e6945a6	Revert "[RLlib] Updated pettingzoo wrappers, env versions, urls (#20113 )" (#21338 ) This reverts commit `327eb84154`.	2022-01-03 10:21:25 +00:00
Benjamin Black	327eb84154	[RLlib] Updated pettingzoo wrappers, env versions, urls (#20113 )	2022-01-02 21:29:09 +01:00
Balaji Veeramani	c263008c07	[RLlib] Move `__grouping_doc_end__` (#21321 ) These changes are needed for two reasons. `__grouping_doc_end__` is in the wrong place If you look at the part of the Ray documentation where the tag is referenced, you'll read > You can use the MultiAgentEnv.with_agent_groups() method to define these groups: However, if you look at the code snippet below, you'll see the implementation of `to_base_env` in addition to the implementation of `with_agent_groups`. To remove `to_base_env` from the code snippet, we need to move `__grouping_doc__end__`. Black cannot format `multi_agent_env.py` For some reason, Black errors while formatting `multi_agent_env.py`. However, if we move `__grouping_doc_end__` up, the issue is resolved.	2022-01-01 20:11:06 -08:00
Avnish Narayan	85a368c720	[RLlib] Expand Base env API to add necessary methods for testing. (#21027 )	2021-12-16 10:19:49 +01:00
Tomasz Wrona	39c202fa66	[RLlib] Allow extra keys in info in multi-agent (#20793 )	2021-12-09 14:44:33 +01:00
Avnish Narayan	6996eaa986	[RLlib] Add necessary fields to Base Envs, and BaseEnv wrapper classes (#20832 )	2021-12-09 14:40:40 +01:00
Avnish Narayan	b8c64480d8	[RLlib] Change return type of try_reset to MultiEnvDict (#20868 )	2021-12-06 14:15:33 +01:00
Jun Gong	65bd8e29f8	[RLlib] Update a few things to get rid of the `remote_vector_env` deprecation warning. (#20753 )	2021-12-02 13:10:44 +01:00
Avnish Narayan	74dd0e4085	[RLlib] Make `to_base_env()` a method of all RLlib-supported Env classes (#20811 )	2021-12-01 09:01:02 +01:00
Avnish Narayan	3ddc09544d	[rllib] Env to base env refactor (#20785 )	2021-11-30 17:02:10 -08:00
Carlo Grisetti	514ed27f63	[RLlib] Fix deprecation message for `rllib.env.remote_vector_env` (now `RemoteBaseEnv`) and migrate import (#20750 )	2021-11-30 18:01:21 +01:00
Sven Mika	56619b955e	[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. (#20250 )	2021-11-17 21:40:16 +01:00
Avnish Narayan	026bf01071	[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535 ) * Fix QMix, SAC, and MADDPA too. * Unpin gym and deprecate pendulum v0 Many tests in rllib depended on pendulum v0, however in gym 0.21, pendulum v0 was deprecated in favor of pendulum v1. This may change reward thresholds, so will have to potentially rerun all of the pendulum v1 benchmarks, or use another environment in favor. The same applies to frozen lake v0 and frozen lake v1 Lastly, all of the RLlib tests and have been moved to python 3.7 * Add gym installation based on python version. Pin python<= 3.6 to gym 0.19 due to install issues with atari roms in gym 0.20 * Reformatting * Fixing tests * Move atari-py install conditional to req.txt * migrate to new ale install method * Fix QMix, SAC, and MADDPA too. * Unpin gym and deprecate pendulum v0 Many tests in rllib depended on pendulum v0, however in gym 0.21, pendulum v0 was deprecated in favor of pendulum v1. This may change reward thresholds, so will have to potentially rerun all of the pendulum v1 benchmarks, or use another environment in favor. The same applies to frozen lake v0 and frozen lake v1 Lastly, all of the RLlib tests and have been moved to python 3.7 * Add gym installation based on python version. Pin python<= 3.6 to gym 0.19 due to install issues with atari roms in gym 0.20 Move atari-py install conditional to req.txt migrate to new ale install method Make parametric_actions_cartpole return float32 actions/obs Adding type conversions if obs/actions don't match space Add utils to make elements match gym space dtypes Co-authored-by: Jun Gong <jungong@anyscale.com> Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-11-03 16:24:00 +01:00
Sven Mika	902e854af2	[RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784 ) * wip. * Test: Make a change in tune to trigger tune tests, which are not run otherwise, but seem to fail nevertheless with this PR's changes. * remove bare_metal_policy_with_custom_view_reqs from tests	2021-10-29 10:46:52 +02:00
Sven Mika	d439fd7f17	[RLlib] TF2/eager memory leak fixes. (#19198 )	2021-10-09 00:11:53 +02:00
Sven Mika	fd438d5630	[RLlib] Issue 18104: Cannot set remote_worker_envs=True for non local-mode and MultiAgentEnv. (#19133 )	2021-10-07 22:39:21 +02:00
Sven Mika	05a55a9335	[RLlib] Issue 18668: Unity3D env client/server example not working (fix + add to test cases). (#18942 )	2021-09-30 08:30:20 +02:00
mvindiola1	62f5da0b65	[RLlib] Add unit tests for updating episode data in base_env (#17137 )	2021-09-24 16:08:11 +02:00
Sven Mika	fd13bac9b3	[RLlib] Add `worker` arg (optional) to `policy_mapping_fn`. (#18184 )	2021-09-17 12:07:11 +02:00
Sven Mika	3f89f35e52	[RLlib] Better error messages and hints; + failure-mode tests; (#18466 )	2021-09-10 16:52:47 +02:00
Sven Mika	8a066474d4	[RLlib] No Preprocessors; preparatory PR #1 (#18367 )	2021-09-09 08:10:42 +02:00
Sven Mika	1520c3d147	[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428 )	2021-09-09 07:10:06 +02:00
Sven Mika	9a8ca6a69d	[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306 )	2021-09-03 13:29:57 +02:00
gjoliver	336e79956a	[RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156 )	2021-09-03 08:02:05 +02:00
Sven Mika	2357bbc0c8	[RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249 )	2021-09-02 09:28:16 +02:00
Sven Mika	82465f9342	[RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254 )	2021-08-31 22:03:23 +02:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00

1 2 3

126 commits