hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
kourosh hakhamaneshi	5030a4c1d3	[RLlib] Simplify agent collector (#26803 )	2022-07-25 13:17:17 -07:00
Jun Gong	0bc560bd54	[RLlib] Make sure we step() after adding init_obs. (#26827 )	2022-07-21 20:43:46 -07:00
Rohan Potdar	2b13ac85f9	[RLLib]: Make IOContext optional for DatasetReader (#26694 )	2022-07-21 13:05:00 -07:00
Jun Gong	6b6d3017ba	[RLlib] more connector polishes and fixes. (#26645 )	2022-07-19 08:50:28 -07:00
Rohan Potdar	38c9e1d52a	[RLlib]: Fix OPE trainables (#26279 ) Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>	2022-07-17 14:25:53 -07:00
kourosh hakhamaneshi	569fe01096	[RLlib] improved unittests for dataset_reader and fixed bugs (#26458 )	2022-07-17 13:38:15 -07:00
Sven Mika	4aea24c8a8	[RLlib] `restart_failed_sub_environments` now works for MA cases and crashes during `reset()`; +more tests and logging; add eval worker sub-env fault tolerance test. (#26276 )	2022-07-15 08:55:14 +02:00
Avnish Narayan	a322ac463c	[RLlib] Make JSONReader default, users will have to use the DatasetReader for any speedups. (#26541 )	2022-07-14 17:19:38 +02:00
Jun Gong	b383d987d1	[RLlib] Fix a bunch of issues related to connectors. (#26510 )	2022-07-13 18:55:20 +02:00
Rohan Potdar	09ce4711fd	[RLlib]: Move OPE to evaluation config (#25911 )	2022-07-12 11:04:34 -07:00
kourosh hakhamaneshi	be6e4c644f	[RLlib] Feature importance evaluation for offline RL (#26412 )	2022-07-11 18:12:50 -07:00
Jun Gong	0c469e490e	[RLlib] Checkpoint and restore connectors. (#26253 )	2022-07-09 01:06:24 -07:00
Avnish Narayan	1243ed62bf	[RLlib] Make Dataset reader default reader and enable CRR to use dataset (#26304 ) Co-authored-by: avnish <avnish@avnishs-MBP.local.meter>	2022-07-08 12:43:35 -07:00
Sven Mika	f8785c49df	[RLlib] Issue 25696: Output writers not working w/ multiple workers. (#25722 )	2022-06-30 13:25:56 +02:00
Sven Mika	ca913ff6d6	[RLlib] Eval WorkerSet crashes when trying to re-add a failed worker (eval set does not have local worker). (#26134 )	2022-06-30 13:25:22 +02:00
Jun Gong	d83bbda281	[RLlib] Save serialized PolicySpec. Extract `num_gpus` related logics into a util function. (#25954 )	2022-06-30 11:38:21 +02:00
Jun Gong	52bb8e47d4	[RLlib] EnvRunnerV2 and EpisodeV2 that support Connectors. (#25922 )	2022-06-30 08:44:10 +02:00
simonsays1980	05d3af766c	[RLlib] Added 'episode.hist_data' to the 'atari_metrics' to nsure that custom metrics of the user are kept in postprocessing when using Atari environments. (#25292 )	2022-06-28 16:31:57 +02:00
Artur Niederfahrenhorst	bed9083f35	[RLlib] Add timeout to filter synchronization. (#25959 )	2022-06-24 14:37:43 +02:00
Sven Mika	59a967a3a0	[RLlib] Cleanup some deprecated metric keys and classes. (#26036 )	2022-06-23 21:30:01 +02:00
JYX	bde46e8a88	Fix several typos in rollout_worker.py (#26028 )	2022-06-23 11:41:53 -07:00
Eric Liang	43aa2299e6	[api] Annotate as public / move ray-core APIs to _private and add enforcement rule (#25695 ) Enable checking of the ray core module, excluding serve, workflows, and tune, in ./ci/lint/check_api_annotations.py. This required moving many files to ray._private and associated fixes.	2022-06-21 15:13:29 -07:00
Rohan Potdar	28df3f34f5	[RLlib]: Off-Policy Evaluation fixes. (#25899 )	2022-06-21 13:24:24 +02:00
Artur Niederfahrenhorst	e10876604d	[RLlib] Include SampleBatch.T column in all collected batches. (#25926 )	2022-06-21 13:20:22 +02:00
Sven Mika	96693055bd	[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869 )	2022-06-20 15:54:00 +02:00
Yi Cheng	7b8b0f8e03	Revert "[RLlib] Remove execution plan code no longer used by RLlib. (#25624 )" (#25776 ) This reverts commit `804719876b`.	2022-06-14 13:59:15 -07:00
Avnish Narayan	804719876b	[RLlib] Remove execution plan code no longer used by RLlib. (#25624 )	2022-06-14 10:57:27 +02:00
Sven Mika	130b7eeaba	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
kourosh hakhamaneshi	b3a351925d	[RLlib] Added meaningful error for multi-agent failure of SampleCollector in case no agent steps in episode. (#25596 )	2022-06-10 12:30:43 +02:00
Rohan Potdar	a9d8da0100	[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056 )	2022-06-07 12:52:19 +02:00
Artur Niederfahrenhorst	5133978adc	[RLlib] PG policy subclassing conversion. (#25288 )	2022-06-06 13:07:47 +02:00
Sven Mika	b5bc2b93c3	[RLlib] Move all remaining algos into `algorithms` directory. (#25366 )	2022-06-04 07:35:24 +02:00
Yi Cheng	fd0f967d2e	Revert "[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346 )" (#25420 ) This reverts commit `e4ceae19ef`. Reverts #25346 linux://python/ray/tests:test_client_library_integration never fail before this PR. In the CI of the reverted PR, it also fails (https://buildkite.com/ray-project/ray-builders-pr/builds/34079#01812442-c541-4145-af22-2a012655c128). So high likely it's because of this PR. And test output failure seems related as well (https://buildkite.com/ray-project/ray-builders-branch/builds/7923#018125c2-4812-4ead-a42f-7fddb344105b)	2022-06-02 20:38:44 -07:00
Sven Mika	e4ceae19ef	[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346 )	2022-06-02 16:47:05 +02:00
kourosh hakhamaneshi	87c9fdd0f8	RLlib: Fix bug: `WorkerSet.stop()` will raise error if `self._local_worker` is None (e.g. in evaluation worker sets). (#25332 )	2022-06-02 09:41:43 +02:00
Eric Liang	905258dbc1	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00
Sven Mika	18c03f8d93	[RLlib] A2C + A3C move to `algorithms` folder and re-name into A2C/A3C (from ...Trainer). (#25314 )	2022-06-01 09:29:16 +02:00
Sven Mika	d95009a3ac	[RLlib] Vectorized envs: Gracefully handle sub-environments failing by restarting them (if configured so). (#24967 )	2022-05-28 10:50:03 +02:00
Sven Mika	163fa81976	[RLlib] Discussion 6060 and 5120: auto-infer different agents' spaces in multi-agent env. (#24649 )	2022-05-27 14:56:24 +02:00
Rohan Potdar	ab81c8e9ca	[RLlib]: Rename `input_evaluation` to `off_policy_estimation_methods`. (#25107 )	2022-05-27 13:14:54 +02:00
Avnish Narayan	eaed256d68	[RLlib] Async parallel execution manager. (#24423 )	2022-05-25 17:54:08 +02:00
Jun Gong	eaf9c941ae	[RLlib] Migrate PPO Impala and APPO policies to use sub-classing implementation. (#25117 )	2022-05-25 14:38:03 +02:00
Eric Liang	4963dfaae0	[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060 )	2022-05-24 22:14:25 -07:00
Sven Mika	09886d7ab8	[RLlib] Upgrade gym 0.23 (#24171 )	2022-05-23 08:18:44 +02:00
Eric Liang	55d039af32	Annotate datasources and add API annotation check script (#24999 ) Why are these changes needed? Add API stability annotations for datasource classes, and add a linter to check all data classes have appropriate annotations.	2022-05-21 15:05:07 -07:00
Rohan Potdar	5a70b732e8	[RLlib] MARWIL and BC Config. (#24853 )	2022-05-21 12:50:20 +02:00
kourosh hakhamaneshi	3815e52a61	[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896 )	2022-05-19 18:30:42 +02:00
Artur Niederfahrenhorst	86bc9ecce2	[RLlib] DDPG Training iteration fn & Replay Buffer API (#24212 )	2022-05-05 09:41:38 +02:00
Sven Mika	7cca7782f1	[RLlib] OPE (off policy estimator) API. (#24384 )	2022-05-02 21:15:50 +02:00
Sven Mika	296e2ebc46	[RLlib] Issue 24082: WorkerSet.policies_to_train (deprecated) - if still used - returns wrong values. (#24386 )	2022-05-02 18:33:52 +02:00

1 2 3 4 5 ...

320 commits