Max Pumperla
703c161034
[doc] Fix sklearn doc error, introduce MyST markdown parser ( #21527 )
2022-01-12 15:17:28 -08:00
Kai Fricke
236951ee4c
[tune] Introduce TrialCheckpoint class, making checkpoint down/upload easie ( #20585 )
...
This PR introduces a TrialCheckpoint class which is returned e.g. by ExperimentAnalysis.best_checkpoint. The class enables easy access to cloud storage locations (rather than just local directories before). It also comes with utilities to download, upload, and save trial checkpoints to local and cloud targets.
2021-11-22 14:16:26 +00:00
Antoni Baum
3f9ded55f7
[tune] Merge Analysis
into ExperimentAnalysis
( #20197 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-11-16 16:47:12 +00:00
Will Drevo
fa878e2d4d
Added example to user guide for cloud checkpointing ( #20045 )
...
Co-authored-by: will <will@anyscale.com>
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-11-15 15:43:06 +00:00
matthewdeng
790e22f9ad
[tune] move force_on_current_node to ml_utils ( #20211 )
2021-11-10 10:21:24 -08:00
Kai Fricke
9c2b8c8501
[tune] Deprecate DurableTrainable ( #19880 )
2021-11-08 20:56:07 +00:00
Philipp Moritz
a64e32c53b
[docs] Fix broken links in documentation and add linkcheck to documentation ( #20030 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-11-04 13:19:43 -07:00
Antoni Baum
f2773267c7
[docs] Tune doc fixes ( #19791 )
2021-10-29 11:45:29 +02:00
xwjiang2010
7ffd9cbed1
[Tune] Fix column width in doc. ( #19159 )
2021-10-07 18:16:21 +01:00
Antoni Baum
27b8633198
[docs] Remove outdated note in Tune docs ( #19110 )
2021-10-07 15:42:11 +01:00
Kai Fricke
81d3d8705e
[tune] fix docs example for tune qloguniform ( #17539 )
2021-08-03 14:48:22 +01:00
Antoni Baum
b500a651b7
[docs] Add LightGBM Tune integration to docs ( #17304 )
...
* Add LightGBM integration to docs
* Fix
2021-07-23 21:21:13 -07:00
Antoni Baum
2e37826458
[tune] Function API support for ResourceChangingScheduler
( #17150 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-21 14:14:12 -07:00
Antoni Baum
f20311f194
[tune] ResourceChangingScheduler
improvements ( #17082 )
2021-07-15 15:03:27 +01:00
Antoni Baum
6e780ebf07
[tune] ResourceChangingScheduler
dynamic resource allocation during tuning ( #16787 )
2021-07-14 10:45:13 +01:00
Antoni Baum
d71ec6e874
[docs] Add examples of new features to contribute ( #16477 )
2021-06-18 00:07:03 -07:00
Qingyun Wu
dae3ac1def
[Tune] Add new searchers from FLAML ( #16329 )
2021-06-12 02:10:51 -07:00
Kai Fricke
e8f8e9f328
[tune] Adjust searcher sample bounds to match Tune API ( #15899 )
2021-06-11 14:31:08 +01:00
Amog Kamsetty
38b657cb65
[Tune] Place remote tune.run on node running the client server ( #16034 )
...
* force placement on persistent node
* address comments
* doc
2021-05-28 18:32:57 -07:00
Kai Fricke
84b3c3376b
[tune] document scalability best practices (k8s, scalability thresholds) ( #14566 )
...
Adds a new page and table to document current scalability thresholds in Ray Tune to the documentation.
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-25 09:54:14 +01:00
Kai Fricke
43e098402a
[tune] make tune.with_parameters()
work with the class API ( #14532 )
...
* [tune] make `tune.with_parameters()` work with the class API
* Update python/ray/tune/utils/trainable.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-09 09:36:17 +01:00
Kai Fricke
b0bf44b154
[tune/docs] Add high level trial runner flow to documentation ( #14468 )
...
* [tune/docs] Add high level trial runner flow to documentation
* Apply suggestions from code review
2021-03-08 10:35:54 +01:00
Kai Fricke
4014168928
[tune] Introduce durable()
wrapper to convert trainables into durable trainables ( #14306 )
...
* [tune] Introduce `durable()` wrapper to convert trainables into durable trainables
* Fix wrong check
* Improve docs, add FAQ for tackling overhead
* Fix bugs in `tune.with_parameters`
* Update doc/source/tune/api_docs/trainable.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Update doc/source/tune/_tutorials/_faq.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-26 13:59:28 +01:00
Kai Fricke
757866ec01
[tune] enable placement groups per default ( #13906 )
...
* Refactor placement group factory object to accept placement_group arguments instead of callables
* Convert resources to pgf
* Enable placement groups per default
* Fix tests WIP
* Fix stop/resume with placement groups
* Fix progress reporter test
* Fix trial executor tests
* Check resource for trial, not resource object
* Move ENV vars into class
* Fix tests
* Sphinx
* Wait for trial start in PBT
* Revert merge errors
* Support trial reuse with placement groups
* Better check for just staged trials
* Fix trial queuing
* Wait for pg after trial termination
* Clean up PGs before tune run
* No PG settings in pbt scheduler
* Fix buffering tests
* Skip test if ray reports erroneous available resources
* Disable PG for cluster resource counting test
* Debug output for tests
* Output in-use resources for placement groups
* Don't start new trial on trial start failure
* Add docs
* Cleanup PGs once futures returned
* Fix placement group shutdown
* Use updated_queue flag
* Apply suggestions from code review
* Apply suggestions from code review
* Update docs
* Reuse placement groups independently from actors
* Do not remove placement groups for paused trials
* Only continue enqueueing trials if it didn't fail the first time
* Rename parameter
* Fix pause trial
* Code review + try_recover
* Update python/ray/tune/utils/placement_groups.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Move placement group lifecycle management
* Move total used resources to pg manager
* Update FAQ example
* Requeue trial if start was unsuccessful
* Do not cleanup pgs at start of run
* Revert "Do not cleanup pgs at start of run"
This reverts commit 933d9c4c
* Delayed PG removal
* Fix trial requeue test
* Trigger pg cleanup on status update
* Fix tests
* Fix docs
* fix-test
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-23 18:46:02 +01:00
Antoni Baum
58d7398246
[Tune] Add HEBOSearch
Searcher ( #13863 )
...
* HEBO first pass
* Fix bad quotes
* Fixes
* Reproductibility
* Update python/ray/tune/suggest/hebo.py
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
* Add hebo_example.py to BUILD
* Nit
* Update to pypi package
* Alphabetical HEBO requirement
* Fix syntax error
* Fix wrong space in hebo example
* Move validate_warmstart to utils
* Space assertion in HEBO
* Comment
* Apply suggestions from code review
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
* Formatting
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-02-17 22:53:10 +01:00
architkulkarni
28cf5f91e3
[docs] change MLFlow to MLflow in docs ( #13739 )
2021-01-27 16:53:15 -08:00
Amog Kamsetty
20016c983f
[Tune] MLflow Credentials ( #13533 )
2021-01-19 11:55:13 -08:00
Lavanya Shukla
350917958c
[docs] fix wandb url ( #13094 )
2020-12-28 17:19:17 -08:00
Antoni Baum
a4f2dd2138
[Tune]Add integer loguniform support ( #12994 )
...
* Add integer quantization and loguniform support
* Fix hyperopt qloguniform not being np.log'd first
* Add tests, __init__
* Try to fix tests, better exceptions
* Tweak docstrings
* Type checks in SearchSpaceTest
* Update docs
* Lint, tests
* Update doc/source/tune/api_docs/search_space.rst
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2020-12-23 09:27:16 -08:00
Amog Kamsetty
5d3c9c8861
[Tune] Mlflow Integration ( #12840 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-19 00:40:02 -08:00
Kai Fricke
3d72000826
[tune] Add points_to_evaluate
to BasicVariantGenerator ( #12916 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-17 19:16:03 -08:00
Kai Fricke
5f04ade6ef
[tune] add more stoppers and stopper documentation ( #12750 )
...
* Add new stoppers & docs
* Add tests for maximum iteration stopper and trial plateau stopper
* Update python/ray/tune/stopper.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Update doc/source/tune/api_docs/stoppers.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Update doc/source/tune/api_docs/stoppers.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Apply suggestions from code review
* Apply suggestions from code review
* Update python/ray/tune/stopper.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 01:47:19 -08:00
Kai Fricke
9f5986ee58
[tune] logger migration to ExperimentLogger classes ( #11984 )
2020-11-16 15:08:37 -08:00
Keqiu Hu
0c1bdaef59
[tune] TensorFlow Distributed Trainable ( #11876 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-10 14:59:08 -08:00
Richard Liaw
efa07d5403
Revert "Revert "[tune] PB2 ( #11466 )" ( #11795 )" ( #11812 )
2020-11-04 20:47:12 -08:00
Amog Kamsetty
7248d5f4ae
Revert "[tune] PB2 ( #11466 )" ( #11795 )
...
This reverts commit e7aafd7d24
.
2020-11-03 21:05:00 -08:00
Kai Fricke
f7b19c41e3
[tune] logger refactor part 1: move classes and utilities to own files ( #11746 )
...
* [tune] logger refactor part 1: move classes and utilities to own files
* Fix circular dependency
* Remove uneeded pretty print copy
* Apply suggestions from code review
2020-11-03 07:48:09 -08:00
Jack Parker-Holder
e7aafd7d24
[tune] PB2 ( #11466 )
...
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-27 01:03:21 -07:00
Richard Liaw
b02e61f672
[minor] fix up docs ( #11596 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-26 12:19:03 -07:00
Richard Liaw
56f858ed1a
[tune][docs/util] gputil check, docs ( #11260 )
...
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-10-10 00:54:31 -07:00
Sumanth Ratna
14d8826e43
Fix overriden typo ( #11227 )
2020-10-07 19:11:07 -07:00
Kai Fricke
bdf647c4ec
[tune] docker syncer ( #11035 )
...
* Add DockerSyncer
* Add docs
* Update python/ray/tune/integration/docker.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Updated docs
* fix dir
* Added docker integration test
* added docker integration test to bazel build
* Use sdk.rsync API
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 11:59:23 -07:00
Kai Fricke
e7315b0856
[tune] Callbacks for tune runs ( #11001 )
2020-09-27 16:50:07 -07:00
Kai Fricke
508cfa3540
[tune] Support yield
and return
statements ( #10857 )
...
* Support `yield` and `return` statements in Tune trainable functions
* Support anonymous metric with ``tune.report(value)``
* Raise on invalid return/yield value
* Fix end to end reporter test
2020-09-17 20:18:35 -07:00
Kai Fricke
7eaf063f29
[tune] wrapper function to pass arbitrary objects through the object store to trainables ( #10679 )
2020-09-10 17:39:44 -07:00
Richard Liaw
153813936b
[tune] auto infer metrics ( #10663 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-09-09 09:53:47 -07:00
Kai Fricke
d7c7aba99c
[tune] Tune experiment analysis improvements ( #10645 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-08 21:00:52 -07:00
Kai Fricke
088f8ebb69
[tune] Add algorithms for search space conversion ( #10621 )
2020-09-07 13:44:16 -07:00
Richard Liaw
551c597312
[tune] API revamp fix ( #10518 )
2020-09-05 15:34:53 -07:00
Sumanth Ratna
54215ff287
[tune] implement shim instantiation ( #10456 )
...
* Create ray.tune.suggest.create.create_scheduler
* Update __init__.py
* Resolve conflict in __init__.py
* Create ray.tune.schedulers.create.create_scheduler
* Update __init__.py
* Move create_scheduler to tune.schedulers.__init__
* Move create_searcher to tune.suggest.__init__
* Delete tune.suggest.create
* Delete tune.schedulers.create
* Update imports for shim functions in tune.__init__
* Remove shim from tune.suggest.__init__.__all__
* Remove shim from tune.schedulers.__init__.__all__
* Add ShimCreationTest
* Move ShimCreationTest to test_api
* Delete test_shim.py
* Add docstring for ray.tune.create_scheduler
* Add docstring to ray.tune.create_searcher
* Fix typo in ray.tune.create_scheduler docstring
* Fix lint errors in tune.schedulers.__init__
* Fix lint errors in tune.suggest.__init__
* Fix lint errors in tune.suggest.__init__
* Fix lint errors in tune.schedulers.__init__
* Fix imports in test_api
* Fix lint errors in test_api
* Fix kwargs in create_searcher
* Fix kwargs in create_scheduler
* Merge branch 'master' into shim-instantiation
* Update use-case in docs in tune.create_scheduler
* Update use-case in docs in tune.create_searcher
* Remove duplicate pytest run from test_api
* Add check to create_searcher
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Add check to create_scheduler
* lint
* Compare types of instances in test_api
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Add tune.create_searcher to docs
* Fix doc build
* Fix tests
* Add tune.create_scheduler to docs
* Fix tests
* Fix lint errors
* Update Ax search for master
* Fix metric kwarg for Ax in test_api
* Fix doc build
* Fix HyperOptSearch import in test_api
* Fix HyperOptSearch import in create_searcher
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-05 09:36:42 -07:00