xwjiang2010
9af8f11191
Revert "[docs] Clean up doc structure (first part) ( #21667 )" ( #21763 )
...
This reverts commit 38e46c9fb3
.
2022-01-20 15:30:56 -08:00
Max Pumperla
38e46c9fb3
[docs] Clean up doc structure (first part) ( #21667 )
2022-01-20 16:19:04 +01:00
Antoni Baum
3f9ded55f7
[tune] Merge Analysis
into ExperimentAnalysis
( #20197 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-11-16 16:47:12 +00:00
Will Drevo
fa878e2d4d
Added example to user guide for cloud checkpointing ( #20045 )
...
Co-authored-by: will <will@anyscale.com>
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-11-15 15:43:06 +00:00
Kai Fricke
d88fdd6e38
[tune] refactor SyncConfig ( #20155 )
2021-11-12 09:36:15 +00:00
Edward Oakes
082a4af3e6
[serve] Remove lingering backend/endpoint wording in docs ( #20229 )
2021-11-10 16:49:29 -08:00
Kai Fricke
9c2b8c8501
[tune] Deprecate DurableTrainable ( #19880 )
2021-11-08 20:56:07 +00:00
Amog Kamsetty
b1f24768a1
[Tune] More fixes to PTL Tutorial ( #20065 )
...
* ptl-fix-2
* improve
* fix
2021-11-08 09:13:44 -08:00
Philipp Moritz
a64e32c53b
[docs] Fix broken links in documentation and add linkcheck to documentation ( #20030 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-11-04 13:19:43 -07:00
Amog Kamsetty
f67b526b7a
[Tune] Fix PTL tutorial docs ( #19999 )
2021-11-04 09:21:28 -07:00
Philipp Moritz
0a5942d8b0
[Documentation] Fix quotes for windows installations ( #19859 )
...
* [Documentation] Fix quotes for windows installations
* update
* formatting
2021-10-29 10:54:38 -07:00
matthewdeng
4674c78050
[Train] Rename Ray SGD v2 to Ray Train ( #19436 )
2021-10-18 22:27:46 -07:00
Antoni Baum
cc3199b814
[docs] Provide information about resource deadlocks, early stopping in Tune docs ( #18947 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-10-01 13:52:47 +01:00
Amog Kamsetty
3b77840c1b
PyTorch Lightning Updates ( #17876 )
2021-08-27 23:15:51 -07:00
Antoni Baum
6e780ebf07
[tune] ResourceChangingScheduler
dynamic resource allocation during tuning ( #16787 )
2021-07-14 10:45:13 +01:00
Amog Kamsetty
33d798f8fc
[Docs] Add e2e guide on using Pytorch Lightning with Ray ( #16484 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-19 10:04:58 -07:00
Amog Kamsetty
04863d158a
[Tune] MLflow with Ray Client ( #16029 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-06-01 09:50:44 -07:00
Edward Oakes
82410f20b2
[serve] Add warning + docstring for anonymous namespaces ( #15921 )
2021-05-20 22:27:15 -05:00
Edward Oakes
c9550a86dc
[serve] Update docs for v2 Deployments API ( #15582 )
2021-05-03 13:19:34 -05:00
Richard Liaw
f4b2dd94b2
[tune] Cache MNIST and restore MNIST tests ( #15260 )
2021-04-13 14:20:26 -07:00
Kai Fricke
84b3c3376b
[tune] document scalability best practices (k8s, scalability thresholds) ( #14566 )
...
Adds a new page and table to document current scalability thresholds in Ray Tune to the documentation.
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-25 09:54:14 +01:00
Amog Kamsetty
7ee2e4185b
[Tune] PTL Fractional GPUs ( #14781 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-18 17:07:51 -07:00
Kai Fricke
43e098402a
[tune] make tune.with_parameters()
work with the class API ( #14532 )
...
* [tune] make `tune.with_parameters()` work with the class API
* Update python/ray/tune/utils/trainable.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-09 09:36:17 +01:00
Kai Fricke
4014168928
[tune] Introduce durable()
wrapper to convert trainables into durable trainables ( #14306 )
...
* [tune] Introduce `durable()` wrapper to convert trainables into durable trainables
* Fix wrong check
* Improve docs, add FAQ for tackling overhead
* Fix bugs in `tune.with_parameters`
* Update doc/source/tune/api_docs/trainable.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Update doc/source/tune/_tutorials/_faq.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-26 13:59:28 +01:00
Kai Fricke
757866ec01
[tune] enable placement groups per default ( #13906 )
...
* Refactor placement group factory object to accept placement_group arguments instead of callables
* Convert resources to pgf
* Enable placement groups per default
* Fix tests WIP
* Fix stop/resume with placement groups
* Fix progress reporter test
* Fix trial executor tests
* Check resource for trial, not resource object
* Move ENV vars into class
* Fix tests
* Sphinx
* Wait for trial start in PBT
* Revert merge errors
* Support trial reuse with placement groups
* Better check for just staged trials
* Fix trial queuing
* Wait for pg after trial termination
* Clean up PGs before tune run
* No PG settings in pbt scheduler
* Fix buffering tests
* Skip test if ray reports erroneous available resources
* Disable PG for cluster resource counting test
* Debug output for tests
* Output in-use resources for placement groups
* Don't start new trial on trial start failure
* Add docs
* Cleanup PGs once futures returned
* Fix placement group shutdown
* Use updated_queue flag
* Apply suggestions from code review
* Apply suggestions from code review
* Update docs
* Reuse placement groups independently from actors
* Do not remove placement groups for paused trials
* Only continue enqueueing trials if it didn't fail the first time
* Rename parameter
* Fix pause trial
* Code review + try_recover
* Update python/ray/tune/utils/placement_groups.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Move placement group lifecycle management
* Move total used resources to pg manager
* Update FAQ example
* Requeue trial if start was unsuccessful
* Do not cleanup pgs at start of run
* Revert "Do not cleanup pgs at start of run"
This reverts commit 933d9c4c
* Delayed PG removal
* Fix trial requeue test
* Trigger pg cleanup on status update
* Fix tests
* Fix docs
* fix-test
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-23 18:46:02 +01:00
javi-redondo
b8b2d6410d
[docs] new Ray Cluster documentation ( #13839 )
...
Co-authored-by: Javier Redondo <javier@anyscale.com>
Co-authored-by: AmeerHajAli <ameerh@berkeley.edu>
2021-02-15 00:47:14 -08:00
architkulkarni
28cf5f91e3
[docs] change MLFlow to MLflow in docs ( #13739 )
2021-01-27 16:53:15 -08:00
Amog Kamsetty
0452a3a435
[Tune] Rename MLFlow to MLflow ( #13301 )
2021-01-11 17:36:55 -08:00
Kai Fricke
97211a6170
[Tune] Fix tune serve integration example ( #13233 )
2021-01-06 17:02:04 +01:00
Lavanya Shukla
350917958c
[docs] fix wandb url ( #13094 )
2020-12-28 17:19:17 -08:00
Amog Kamsetty
5d3c9c8861
[Tune] Mlflow Integration ( #12840 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-19 00:40:02 -08:00
Kai Fricke
9f5986ee58
[tune] logger migration to ExperimentLogger classes ( #11984 )
2020-11-16 15:08:37 -08:00
Richard Liaw
1b357533b1
[tune] Try to enable PTL, SKlearn tests ( #11542 )
2020-10-24 01:08:46 -07:00
Kai Fricke
2f74fe5b71
[tune/docs] Add PTL example to tune docs/examples ( #11474 )
2020-10-19 14:47:58 -07:00
Sumanth Ratna
92a58aabce
[tune][docs] Fix learning rate bounds in FAQ ( #11345 )
2020-10-12 09:44:53 -07:00
Richard Liaw
56f858ed1a
[tune][docs/util] gputil check, docs ( #11260 )
...
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-10-10 00:54:31 -07:00
Sumanth Ratna
98ebf8e2d8
[tune][docs] fix typo in Tune FAQ ( #11161 )
...
* Fix typo in tune FAQ (used to use)
* Update doc/source/tune/_tutorials/_faq.rst
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 11:20:41 -07:00
Kai Fricke
b8f344f695
[tune] add faq entry for reproducing experiments (setting seeds etc) ( #11106 )
2020-09-29 14:48:39 -07:00
Richard Liaw
a563344bc2
[docs] remove ref to google groups -> github discussions ( #11019 )
2020-09-24 18:09:51 -07:00
Kai Fricke
d9c4dea7cf
[tune] strict metric checking ( #10972 )
2020-09-24 10:00:48 -07:00
Kai Fricke
50d63b8077
[tune] update pt tutorial docs ( #10925 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 13:33:37 -07:00
Richard Liaw
b0ca70f628
[tune+core] tune lifecycle and starting ray guide ( #10813 )
2020-09-21 11:27:50 -07:00
Ian Rodney
5bc2ba38fd
[docker] Detect CPUs in container correctly ( #10507 )
...
Co-authored-by: simon-mo <simon.mo@hey.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2020-09-13 23:40:48 -07:00
Kai Fricke
7eaf063f29
[tune] wrapper function to pass arbitrary objects through the object store to trainables ( #10679 )
2020-09-10 17:39:44 -07:00
Kai Fricke
756a9ea641
[tune] add mode/metric parameters to tune.run ( #10627 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-08 17:06:21 -07:00
Richard Liaw
551c597312
[tune] API revamp fix ( #10518 )
2020-09-05 15:34:53 -07:00
Edward Oakes
34bda32054
[tune/serve] Fix tune/serve integration script broken by serve API change ( #10586 )
2020-09-04 17:11:58 -07:00
Sumanth Ratna
89bf262130
[tune] Fix lr typo in FAQ ( #10548 )
2020-09-03 13:37:39 -07:00
krfricke
57c4183724
[tune] add xgboost callbacks to integration module ( #10502 )
2020-09-02 11:16:09 -07:00
Richard Liaw
3f98a8bfcb
[docs] Fix warnings for sphinx 1.8 ( #10476 )
...
* fix-build-for-sphinx18
* jnilit
2020-09-01 13:37:35 -07:00