hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

Author	SHA1	Message	Date
Antoni Baum	ddb5572040	[Tune/CI] Fix Hyperopt notebook example (#26469 ) Fixes failing hyperopt notebook in CI (as found in #26410). The cause was a mismatch between keys in points to evaluate and the search space - now, an informative exception will be raised. Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>	2022-07-13 16:50:11 +01:00
Antoni Baum	9b2cd29511	[CI] Install Horovod in doc tests to fix notebook (#26476 ) Fixes the Horovod notebook example as found in #26410 by installing Horovod in doc tests jobs. Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>	2022-07-13 16:27:20 +01:00
Antoni Baum	67a7ffa6b4	[Tune/CI] Fix BOHB notebook example (#26473 ) Fixes the BOHB notebook example as found in #26410 Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>	2022-07-13 10:35:38 +01:00
Antoni Baum	e48d381926	[Tune/CI] Fix Tune-Pytorch-CIFAR notebook example (#26474 ) Fixes the Tune-Pytorch-CIFAR notebook example as found in #26410 Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>	2022-07-13 10:28:30 +01:00
Kai Fricke	753f5feaf4	[tune] Remove TrialCheckpoint class (#25406 ) The old user-facing TrialCheckpoint class has been deprecated in favor of `ray.ml.Checkpoint` and will be removed with this PR. The main change in this PR is to delete the old `TrialCheckpoint` class and replace remaining API calls (e.g. `checkpoint.local_path`) with the correct AIR equivalents. One issue that comes up is that with Ray client usage, checkpoint directories are not available on the local node (the client). Thus, we can't construct `Checkpoint` objects easily. (Previously, the TrialCheckpoint object held a reference to the location, even if it is not locally available). There are ongoing discussions on how to resolve this in the future. For now, we print an error when such a checkpoint is requested. Depends on #25805 Signed-off-by: Kai Fricke <kai@anyscale.com>	2022-07-11 20:08:10 +01:00
xwjiang2010	c97d65e64f	[tune] fix hebo_example. (#26439 ) Fixes a bug in the the ipython notebook.	2022-07-11 17:12:10 +01:00
Richard Liaw	5892a76a44	[air/tune] Documentation testing fixes (#26409 )	2022-07-09 19:47:21 -07:00
Antoni Baum	ea94cda1f3	[AIR] Replace `train.` with `session.` (#26303 ) This PR replaces legacy API calls to `train.` with AIR `session.` in Train code, examples and docs. Depends on https://github.com/ray-project/ray/pull/25735	2022-07-07 16:29:04 -07:00
xwjiang2010	ac831fded4	[air] update documentation to use `session.report` (#26051 ) Update documentation to use `session.report`. Next steps: 1. Update our internal caller to use `session.report`. Most importantly, CheckpointManager and DataParallelTrainer. 2. Update `get_trial_resources` to use PGF notions to incorporate the requirement of ResourceChangingScheduler. @Yard1 3. After 2 is done, change all `tune.get_trial_resources` to `session.get_trial_resources` 4. [internal implementation] remove special checkpoint handling logic from huggingface trainer. Optimize the flow for checkpoint conversion with `session.report`. Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2022-06-30 10:37:31 -07:00
Antoni Baum	128f9e5664	[AIR] Move integration logging callbacks to AIR (#26126 ) As the integration logging callbacks are commonly used with AIR Trainers, they should be moved from the tune package to the air package. The old imports will still work, but raise a deprecation warning.	2022-06-28 17:25:19 -07:00
Kai Fricke	75d08b0632	[tune/structure] Refactor `suggest` into `search` package (#26074 ) This PR renames the `suggest` package to `search` and alters the layout slightly. In the new package, the higher-level abstractions are on the top level and the search algorithms have their own subdirectories. In a future refactor, we can turn algorithms such as PBT into actual `SearchAlgorithm` classes and move them into the `search` package. The main reason to keep algorithms and searchers in the same directory is to avoid user confusion - for a user, `Bayesopt` is as much a search algorithm as e.g. `PBT`, so it doesn't make sense to split them up.	2022-06-25 14:55:30 +01:00
Amog Kamsetty	1316a2d05e	[AIR/Train] Move `ray.air.train` to `ray.train` (#25570 )	2022-06-08 21:34:18 -07:00
Kai Fricke	8affbc7be6	[tune/train] Consolidate checkpoint manager 3: Ray Tune (#24430 ) Update: This PR is now part 3 of a three PR group to consolidate the checkpoints. 1. Part 1 adds the common checkpoint management class #24771 2. Part 2 adds the integration for Ray Train #24772 3. This PR builds on #24772 and includes all changes. It moves the Ray Tune integration to use the new common checkpoint manager class. Old PR description: This PR consolidates the Ray Train and Tune checkpoint managers. These concepts previously did something very similar but in different modules. To simplify maintenance in the future, we've consolidated the common core. - This PR keeps full compatibility with the previous interfaces and implementations. This means that for now, Train and Tune will have separate CheckpointManagers that both extend the common core - This PR prepares Tune to move to a CheckpointStrategy object - In follow-up PRs, we can further unify interfacing with the common core, possibly removing any train- or tune-specific adjustments (e.g. moving to setup on init rather on runtime for Ray Train) Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2022-06-08 12:05:34 +01:00
Kai Fricke	4b9a89ad90	[air] Move python/ray/ml to python/ray/air (#25449 ) The package "ml" should be renamed to "air". Main question: Keep a `ml.py` with `from ray.air import *` for some level of backwards compatibility? I'd go for no to force people to use the new structure.	2022-06-03 21:53:44 +01:00
Kai Fricke	f0fa8e54f8	[tune] Remove DurableTrainable class (#25405 ) The DurableTrainable is deprecated (every trainable is a durable trainable). This PR removes it from the Tune library and a related example.	2022-06-03 10:16:02 +01:00
Antoni Baum	c74886a55e	[CI] Run doc notebooks in CI (#24816 ) Currently, we are not running doc notebooks in CI due to a bazel misconfiguration - we are using `glob` in a top level package in order to get the paths for the notebooks, but those are contained inside subpackages, which glob purposefully ignores. Therefore, the lists of notebooks to run are empty. This PR fixes that by: * Running the `py_test_run_all_notebooks` macro inside the relevant subpackages * Editing the `test_myst_doc.py` script to allow for recursive search for the target file, allowing to deal with mismatches between `name` and `data` arguments in `py_test_run_all_notebooks` * Setting the `allow_empty=False` flag inside `glob` calls in our macros to ensure that this oversight is caught early * Enabling detection of changes in doc folder for `*.ipynb` and `BUILD` files This PR also adds a GPU runner for doc tests, allowing one of our examples to pass - and setting the infra for more to come. Finally, a misconfigured path for one set of doc tests is also fixed.	2022-05-17 09:50:42 +01:00
Max Pumperla	cd5218f831	[docs] Tune examples better navigation, minor fixes (#24733 ) Replaces #24225 and adds example navigation Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>	2022-05-13 14:39:18 +01:00
Amog Kamsetty	a36e2a8f51	[Tune] Deprecate DistributedTrainableCreator (#24453 ) Fully deprecate DistributedTrainableCreator for Ray 2.0 Closes #24453	2022-05-10 11:06:43 -07:00
Amog Kamsetty	ae9c68e75f	[Train] Fully deprecate Ray SGD v1 (#24038 ) Ray SGD v1 has been denoted as a deprecated API for a while. This PR fully deprecates Ray SGD v1. An error will be raised if ray.util.sgd package is attempted to be imported. Closes #16435	2022-04-25 16:12:57 -07:00
Brett Göhre	9e0a59d94a	[docs] search algorithm notebook examples (#23924 ) Co-authored-by: brettskymind <brett@pathmind.com> Co-authored-by: Max Pumperla <max.pumperla@googlemail.com>	2022-04-25 11:10:58 -07:00
Brett Göhre	f5e492ea8a	[Docs] optuna notebook (#23477 )	2022-03-25 09:04:53 +01:00
Philipp Moritz	886cc4d674	Fix broken links in documentation and put linkcheck linter in place on CI (#23340 )	2022-03-18 21:02:52 -07:00
Eric Liang	c8f207f746	[docs] Core docs refactor (#23216 ) This PR makes a number of major overhauls to the Ray core docs: Add a key-concepts section for {Tasks, Actors, Objects, Placement Groups, Env Deps}. Re-org the user guide to align with key concepts. Rewrite the walkthrough to link to mini-walkthroughs in the key concept sections. Minor tweaks and additional transition material.	2022-03-17 11:26:17 -07:00
Max Pumperla	7d4296c72f	run code in browser (#22727 ) Example for running notebooks on our docs directly in the browser by connecting to a binder instance launched on demand. If this seems useful we can extend this to other examples gradually. Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>	2022-03-02 10:27:00 +01:00
Max Pumperla	372c620f58	[docs] Tune overhaul part II (#22656 ) Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2022-02-26 23:07:34 -08:00
Max Pumperla	29d94a2211	[docs] sphinx gallery removal, migrate to ipynb (#22467 )	2022-02-19 01:19:07 -08:00
Max Pumperla	d594b668bb	[docs] [tune] hyperopt notebook (#22315 )	2022-02-12 02:46:03 -08:00
Max Pumperla	5cc9355303	[Docs ] Tune docs overhaul (first part) (#22112 ) Continuing docs overhaul, tune now has: - [x] better landing page - [x] a getting started guide - [x] user guide was cut down, partially merged with FAQ, and partially integrated with tutorials - [x] the new user guide contains guides to tune features and practical integrations - [x] we rewrote some of the feature guides for clarity - [x] we got rid of sphinx-gallery for this sub-project (only data and core left), as it looks bad and is unnecessarily complicated anyway (plus, makes the build slower) - [x] sphinx-gallery examples are now moved to markdown notebook, as started in #22030. - [x] Examples are tested in the new framework, of course. There's still a lot one can do, but this is already getting too large. Will follow up with more fine-tuning next week. Co-authored-by: Antoni Baum <antoni.baum@protonmail.com> Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>	2022-02-07 15:47:03 +00:00
Dhruv Nair	3d79815cd0	Comet Integration (#20766 ) This PR adds a `CometLoggerCallback` to the Tune Integrations, allowing users to log runs from Ray to [Comet](https://www.comet.ml/site/). Co-authored-by: Michael Cullan <mjcullan@gmail.com> Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2022-01-25 11:42:00 -08:00
xwjiang2010	9af8f11191	Revert "[docs] Clean up doc structure (first part) (#21667 )" (#21763 ) This reverts commit `38e46c9fb3`.	2022-01-20 15:30:56 -08:00
Max Pumperla	38e46c9fb3	[docs] Clean up doc structure (first part) (#21667 )	2022-01-20 16:19:04 +01:00
Antoni Baum	0b14f38ac7	[tune] Multi-objective support for Optuna (#20489 ) This PR adds multi-objective support for Optuna searchers, including a test and example. Co-authored-by: gjoliver <jungong@anyscale.com>	2021-11-18 18:47:29 +00:00
Will Drevo	fa878e2d4d	Added example to user guide for cloud checkpointing (#20045 ) Co-authored-by: will <will@anyscale.com> Co-authored-by: Antoni Baum <antoni.baum@protonmail.com> Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-11-15 15:43:06 +00:00
Philipp Moritz	a64e32c53b	[docs] Fix broken links in documentation and add linkcheck to documentation (#20030 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-11-04 13:19:43 -07:00
Ryan L. Melvin	c081c68de7	[tune] Conditional search space example using hyperopt (#18130 ) Co-authored-by: Ryan Melvin <rmelvin@uabmc.edu> Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2021-08-31 17:06:22 +02:00
Antoni Baum	c40555c82b	[tune] Add define-by-run support to `OptunaSearcher` (#17464 )	2021-08-03 16:11:58 +01:00
Qingyun Wu	7678503d84	[Tune][docs]Correct reference name to CFO example (#17503 )	2021-08-02 14:46:10 +01:00
Antoni Baum	6e780ebf07	[tune] `ResourceChangingScheduler` dynamic resource allocation during tuning (#16787 )	2021-07-14 10:45:13 +01:00
Qingyun Wu	dae3ac1def	[Tune] Add new searchers from FLAML (#16329 )	2021-06-12 02:10:51 -07:00
Antoni Baum	58d7398246	[Tune] Add `HEBOSearch` Searcher (#13863 ) * HEBO first pass * Fix bad quotes * Fixes * Reproductibility * Update python/ray/tune/suggest/hebo.py Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com> * Add hebo_example.py to BUILD * Nit * Update to pypi package * Alphabetical HEBO requirement * Fix syntax error * Fix wrong space in hebo example * Move validate_warmstart to utils * Space assertion in HEBO * Comment * Apply suggestions from code review Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com> * Formatting Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>	2021-02-17 22:53:10 +01:00
architkulkarni	28cf5f91e3	[docs] change MLFlow to MLflow in docs (#13739 )	2021-01-27 16:53:15 -08:00
Lavanya Shukla	350917958c	[docs] fix wandb url (#13094 )	2020-12-28 17:19:17 -08:00
Amog Kamsetty	5d3c9c8861	[Tune] Mlflow Integration (#12840 ) Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-12-19 00:40:02 -08:00
Kai Fricke	3d72000826	[tune] Add `points_to_evaluate` to BasicVariantGenerator (#12916 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-12-17 19:16:03 -08:00
Richard Liaw	8b3f79f307	[tune] refactor and add examples (#11931 )	2020-11-14 20:43:28 -08:00
Richard Liaw	efa07d5403	Revert "Revert "[tune] PB2 (#11466 )" (#11795 )" (#11812 )	2020-11-04 20:47:12 -08:00
Amog Kamsetty	7248d5f4ae	Revert "[tune] PB2 (#11466 )" (#11795 ) This reverts commit `e7aafd7d24`.	2020-11-03 21:05:00 -08:00
Jack Parker-Holder	e7aafd7d24	[tune] PB2 (#11466 ) Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com> Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com> Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-10-27 01:03:21 -07:00
Richard Liaw	e7aa6441b7	[tune] a tiny ptl example (#11497 )	2020-10-22 18:50:34 -07:00
Kai Fricke	2f74fe5b71	[tune/docs] Add PTL example to tune docs/examples (#11474 )	2020-10-19 14:47:58 -07:00

1 2

59 commits