hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

Author	SHA1	Message	Date
kourosh hakhamaneshi	3b3c20209b	[RLlib] Fix dqn reproducibility (#27459 )	2022-08-09 15:56:44 -07:00
Charles Sun	c358305ca6	[RLlib] DatasetReader action normalization. (#27356 )	2022-08-09 16:54:03 +02:00
Sven Mika	537f7c65c1	[RLlib] CRR framework torch by default. (#27161 )	2022-08-09 16:53:00 +02:00
kourosh hakhamaneshi	b84dd38f01	[RLlib] Add `__getitem__` to `MultiAgentBatch` to access `policy_batches`. (#27619 )	2022-08-09 16:51:26 +02:00
kourosh hakhamaneshi	98b9fa6944	[RLlib] Hotfix for connector tests (#27654 ) hot fix for rllib connector tests Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>	2022-08-08 15:12:47 -07:00
Artur Niederfahrenhorst	4fe47d069f	[RLlib] Require ApeX LR schedule test to produce learner info. (#27557 )	2022-08-08 18:19:02 +02:00
kourosh hakhamaneshi	3b2a8427af	[RLlib] Fix SampleBatch to_device(). (#27572 )	2022-08-08 18:18:33 +02:00
Jun Gong	a61095a480	[RLlib] fix bandit pre-merge tests (#27554 )	2022-08-07 17:48:29 -07:00
Jun Gong	5f07987ab1	[RLlib] Fix connector examples (#27583 )	2022-08-07 17:48:09 -07:00
Jun Gong	f8b2128f16	[RLlib] async_request_test needs to run exclusively. (#27603 )	2022-08-07 17:47:29 -07:00
Avnish Narayan	55209692ee	[RLlib] Deflake MARWIL and BC and remove memory leak from torch MARWIL policy (#27406 )	2022-08-03 16:53:12 -07:00
Rohan Potdar	5b6a58ed28	[RLlib] Add OPE Learning Tests (#27154 )	2022-08-02 17:51:38 -07:00
Avnish Narayan	00f9438101	[RLlib] Training step docs. (#27344 )	2022-08-02 23:41:45 +02:00
Jun Gong	61add8ede6	[RLlib] Fix the last cartpole-crashing premerge test. (#27315 )	2022-08-02 20:08:33 +02:00
kourosh hakhamaneshi	bda5026428	[RLlib] Fix A2C release tests (#27314 )	2022-08-02 10:44:52 -07:00
kourosh hakhamaneshi	8d848890f1	[RLlib] Fix default view_requirement in policy.py (#27255 )	2022-08-02 10:44:07 -07:00
Artur Niederfahrenhorst	a598458c46	[RLlib] Fix complex torch one-hot and flattened layers not being added to module list. (#27304 )	2022-08-01 15:52:28 +02:00
Steven Morad	d0a8e3c36f	[RLlib] User-friendly RNN sequencing. (#27087 )	2022-08-01 15:32:22 +02:00
Steven Morad	77318abfaf	[RLlib] Warn on PPO infinite KL loss term. (#26629 )	2022-08-01 12:55:26 +02:00
Jun Gong	e6e10ce4cf	[RLlib] Revert `41c9ef70`. (#27243 ) Why are these changes needed? Also: Add validation to make sure multi-gpu and micro-batch is not used together. Update A2C learning test to hit the microbatching branch. Minor comment updates.	2022-07-29 11:05:15 -07:00
Kai Fricke	1d3c167bfe	[rllib/release] Fix rllib connect test with Tuner() API (#27155 ) Currently failing because the Tune framework example does not return fitting results. Signed-off-by: Kai Fricke <kai@anyscale.com>	2022-07-28 11:08:02 +01:00
Eric Liang	a4434fac7f	[docs] Fix the remaining style violations in docstrings and add lint rule (#27033 )	2022-07-27 22:24:20 -07:00
xwjiang2010	eb69c1ca28	[air] Add annotation for Tune module. (#27060 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-07-27 13:53:46 -07:00
Malinda	1d789aee63	[RLlib/Serve/Release tests] Few code refactoring for better use of efficient NumPy functions. (#26284 )	2022-07-27 22:38:35 +02:00
Jun Gong	e1cf0cc982	[RLlib] Deflake cartpole crashing tests. (#27097 ) Make sure cartpole crashing tests are not flaky.	2022-07-27 12:50:34 -07:00
Jun Gong	acf2bf9b2f	[RLlib] Get rid of all these deprecation warnings. (#27085 )	2022-07-27 10:48:54 -07:00
Kai Fricke	8fda425eca	[tune/rllib] Hotfix ml_utils deprecation import error (#27095 ) The changes conflicted with a recently merged PR that refactored the package structure (#27005). Signed-off-by: Kai Fricke <kai@anyscale.com>	2022-07-27 16:11:58 +01:00
Kai Fricke	a5ea99cf95	[rfc] [tune/rllib] Fetch _progress_metrics from trainable for verbose=2 display (#26967 ) RLLibs trainables produce a large number of metrics which makethe log output with verbose=2 illegible. This PR introduces a private `_progress_metrics` property for trainables. If set, the trial progress callback will only print these metrics per default, unless overridden e.g. with a custom `TrialProgressCallback`.	2022-07-27 16:04:23 +01:00
Amog Kamsetty	862d10c162	[AIR] Remove ML code from `ray.util` (#27005 ) Removes all ML related code from `ray.util` Removes: - `ray.util.xgboost` - `ray.util.lightgbm` - `ray.util.horovod` - `ray.util.ray_lightning` Moves `ray.util.ml_utils` to other locations Closes #23900 Signed-off-by: Amog Kamsetty <amogkamsetty@yahoo.com> Signed-off-by: Kai Fricke <kai@anyscale.com> Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-07-27 14:24:19 +01:00
xwjiang2010	fcf897ee72	[air] update rllib example to use Tuner API. (#26987 ) update rllib example to use Tuner API. Signed-off-by: xwjiang2010 <xwjiang2010@gmail.com>	2022-07-27 12:12:59 +01:00
Jun Gong	c7ae787cc8	[RLlib] Beef up worker failure test. (#26953 )	2022-07-27 00:10:45 -07:00
Jun Gong	a22457b548	[RLlib] Small bug fix (#27003 )	2022-07-27 00:02:18 -07:00
Jun Gong	54df8bfe42	[RLlib] Try to checkpoint a durable policy name (#27016 )	2022-07-27 00:01:14 -07:00
kourosh hakhamaneshi	8ddcf89096	[RLlib] Implemented ViewRequirementConnector (#26998 )	2022-07-26 21:52:14 -07:00
Rohan Potdar	deccf33912	[RLlib]: Add Off-Policy Estimation docs (#26809 ) Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>	2022-07-26 13:57:56 -07:00
Jun Gong	ca5e0dcaf4	[RLLib] Record framework and algorithm used by an RLlib run. (#26956 ) Automatically record framework and algorithm used by RLlib jobs. For better planning.	2022-07-25 16:16:36 -07:00
kourosh hakhamaneshi	5030a4c1d3	[RLlib] Simplify agent collector (#26803 )	2022-07-25 13:17:17 -07:00
Artur Niederfahrenhorst	b1594260ba	[RLlib] Small SlateQ example fix. (#26948 )	2022-07-25 15:12:42 +02:00
Avnish Narayan	41c9ef709a	[RLlib] Using PG when not doing microbatching kills A2C performance. (#26844 )	2022-07-25 15:11:26 +02:00
Artur Niederfahrenhorst	e9a8f7d9ae	[RLlib] Unify gnorm mixin for tf and torch policies. (#26102 )	2022-07-24 15:31:09 +02:00
Ishant Mrinal	b32c784c7f	[RLLib] RE3 exploration algorithm TF2 framework support (#25221 )	2022-07-23 18:05:01 -07:00
Rohan Potdar	a53bbe49bf	[RLlib]: Raise deprecation warning in MARWIL OPE methods. (#26893 )	2022-07-23 13:55:40 +02:00
Rohan Potdar	97bcf38ec0	[RLlib] Fix torch None conversion in `torch_utils.py::convert_to_torch_tensor`. (#26863 )	2022-07-23 13:54:57 +02:00
Rohan Potdar	69f6b843da	[RLlib] Test output length in DatasetReader with default IOContext. (#26852 )	2022-07-23 13:53:59 +02:00
Rohan Potdar	2f22262d39	[RLlib]: Fix SampleBatch.split_by_episode to use dones if episode id is not available (#26492 )	2022-07-22 16:46:05 -07:00
Steven Morad	259429bdc3	Bump gym dep to 0.24 (#26190 ) Co-authored-by: Steven Morad <smorad@anyscale.com> Co-authored-by: Avnish <avnishnarayan@gmail.com> Co-authored-by: Avnish Narayan <38871737+avnishn@users.noreply.github.com>	2022-07-22 12:37:16 -07:00
Avnish Narayan	2a0ef663c9	[rllib] Use compress observations where replay buffers and image obs are used in tuned examples (#26735 )	2022-07-22 10:10:51 -07:00
Fabian Witter	dc2ad6c8b4	[RLlib] Fix ModelCatalog for nested complex inputs (#25620 )	2022-07-22 00:45:25 -07:00
Jun Gong	6c1acd1a2f	[RLlib] Quick state buffer connector fix (#26836 )	2022-07-21 20:43:59 -07:00
Jun Gong	0bc560bd54	[RLlib] Make sure we step() after adding init_obs. (#26827 )	2022-07-21 20:43:46 -07:00

1 2 3 4 5 ...

1385 commits