hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 11:31:40 -05:00

Author	SHA1	Message	Date
xwjiang2010	9af8f11191	Revert "[docs] Clean up doc structure (first part) (#21667 )" (#21763 ) This reverts commit `38e46c9fb3`.	2022-01-20 15:30:56 -08:00
Max Pumperla	38e46c9fb3	[docs] Clean up doc structure (first part) (#21667 )	2022-01-20 16:19:04 +01:00
Antoni Baum	3f9ded55f7	[tune] Merge `Analysis` into `ExperimentAnalysis` (#20197 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-11-16 16:47:12 +00:00
Will Drevo	fa878e2d4d	Added example to user guide for cloud checkpointing (#20045 ) Co-authored-by: will <will@anyscale.com> Co-authored-by: Antoni Baum <antoni.baum@protonmail.com> Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-11-15 15:43:06 +00:00
Kai Fricke	d88fdd6e38	[tune] refactor SyncConfig (#20155 )	2021-11-12 09:36:15 +00:00
Edward Oakes	082a4af3e6	[serve] Remove lingering backend/endpoint wording in docs (#20229 )	2021-11-10 16:49:29 -08:00
Kai Fricke	9c2b8c8501	[tune] Deprecate DurableTrainable (#19880 )	2021-11-08 20:56:07 +00:00
Amog Kamsetty	b1f24768a1	[Tune] More fixes to PTL Tutorial (#20065 ) * ptl-fix-2 * improve * fix	2021-11-08 09:13:44 -08:00
Philipp Moritz	a64e32c53b	[docs] Fix broken links in documentation and add linkcheck to documentation (#20030 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-11-04 13:19:43 -07:00
Amog Kamsetty	f67b526b7a	[Tune] Fix PTL tutorial docs (#19999 )	2021-11-04 09:21:28 -07:00
Philipp Moritz	0a5942d8b0	[Documentation] Fix quotes for windows installations (#19859 ) * [Documentation] Fix quotes for windows installations * update * formatting	2021-10-29 10:54:38 -07:00
matthewdeng	4674c78050	[Train] Rename Ray SGD v2 to Ray Train (#19436 )	2021-10-18 22:27:46 -07:00
Antoni Baum	cc3199b814	[docs] Provide information about resource deadlocks, early stopping in Tune docs (#18947 ) Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>	2021-10-01 13:52:47 +01:00
Amog Kamsetty	3b77840c1b	PyTorch Lightning Updates (#17876 )	2021-08-27 23:15:51 -07:00
Antoni Baum	6e780ebf07	[tune] `ResourceChangingScheduler` dynamic resource allocation during tuning (#16787 )	2021-07-14 10:45:13 +01:00
Amog Kamsetty	33d798f8fc	[Docs] Add e2e guide on using Pytorch Lightning with Ray (#16484 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-06-19 10:04:58 -07:00
Amog Kamsetty	04863d158a	[Tune] MLflow with Ray Client (#16029 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-06-01 09:50:44 -07:00
Edward Oakes	82410f20b2	[serve] Add warning + docstring for anonymous namespaces (#15921 )	2021-05-20 22:27:15 -05:00
Edward Oakes	c9550a86dc	[serve] Update docs for v2 Deployments API (#15582 )	2021-05-03 13:19:34 -05:00
Richard Liaw	f4b2dd94b2	[tune] Cache MNIST and restore MNIST tests (#15260 )	2021-04-13 14:20:26 -07:00
Kai Fricke	84b3c3376b	[tune] document scalability best practices (k8s, scalability thresholds) (#14566 ) Adds a new page and table to document current scalability thresholds in Ray Tune to the documentation. Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-03-25 09:54:14 +01:00
Amog Kamsetty	7ee2e4185b	[Tune] PTL Fractional GPUs (#14781 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-03-18 17:07:51 -07:00
Kai Fricke	43e098402a	[tune] make `tune.with_parameters()` work with the class API (#14532 ) * [tune] make `tune.with_parameters()` work with the class API * Update python/ray/tune/utils/trainable.py Co-authored-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-03-09 09:36:17 +01:00
Kai Fricke	4014168928	[tune] Introduce `durable()` wrapper to convert trainables into durable trainables (#14306 ) * [tune] Introduce `durable()` wrapper to convert trainables into durable trainables * Fix wrong check * Improve docs, add FAQ for tackling overhead * Fix bugs in `tune.with_parameters` * Update doc/source/tune/api_docs/trainable.rst Co-authored-by: Richard Liaw <rliaw@berkeley.edu> * Update doc/source/tune/_tutorials/_faq.rst Co-authored-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-02-26 13:59:28 +01:00
Kai Fricke	757866ec01	[tune] enable placement groups per default (#13906 ) * Refactor placement group factory object to accept placement_group arguments instead of callables * Convert resources to pgf * Enable placement groups per default * Fix tests WIP * Fix stop/resume with placement groups * Fix progress reporter test * Fix trial executor tests * Check resource for trial, not resource object * Move ENV vars into class * Fix tests * Sphinx * Wait for trial start in PBT * Revert merge errors * Support trial reuse with placement groups * Better check for just staged trials * Fix trial queuing * Wait for pg after trial termination * Clean up PGs before tune run * No PG settings in pbt scheduler * Fix buffering tests * Skip test if ray reports erroneous available resources * Disable PG for cluster resource counting test * Debug output for tests * Output in-use resources for placement groups * Don't start new trial on trial start failure * Add docs * Cleanup PGs once futures returned * Fix placement group shutdown * Use updated_queue flag * Apply suggestions from code review * Apply suggestions from code review * Update docs * Reuse placement groups independently from actors * Do not remove placement groups for paused trials * Only continue enqueueing trials if it didn't fail the first time * Rename parameter * Fix pause trial * Code review + try_recover * Update python/ray/tune/utils/placement_groups.py Co-authored-by: Richard Liaw <rliaw@berkeley.edu> * Move placement group lifecycle management * Move total used resources to pg manager * Update FAQ example * Requeue trial if start was unsuccessful * Do not cleanup pgs at start of run * Revert "Do not cleanup pgs at start of run" This reverts commit 933d9c4c * Delayed PG removal * Fix trial requeue test * Trigger pg cleanup on status update * Fix tests * Fix docs * fix-test Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-02-23 18:46:02 +01:00
javi-redondo	b8b2d6410d	[docs] new Ray Cluster documentation (#13839 ) Co-authored-by: Javier Redondo <javier@anyscale.com> Co-authored-by: AmeerHajAli <ameerh@berkeley.edu>	2021-02-15 00:47:14 -08:00
architkulkarni	28cf5f91e3	[docs] change MLFlow to MLflow in docs (#13739 )	2021-01-27 16:53:15 -08:00
Amog Kamsetty	0452a3a435	[Tune] Rename MLFlow to MLflow (#13301 )	2021-01-11 17:36:55 -08:00
Kai Fricke	97211a6170	[Tune] Fix tune serve integration example (#13233 )	2021-01-06 17:02:04 +01:00
Lavanya Shukla	350917958c	[docs] fix wandb url (#13094 )	2020-12-28 17:19:17 -08:00
Amog Kamsetty	5d3c9c8861	[Tune] Mlflow Integration (#12840 ) Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-12-19 00:40:02 -08:00
Kai Fricke	9f5986ee58	[tune] logger migration to ExperimentLogger classes (#11984 )	2020-11-16 15:08:37 -08:00
Richard Liaw	1b357533b1	[tune] Try to enable PTL, SKlearn tests (#11542 )	2020-10-24 01:08:46 -07:00
Kai Fricke	2f74fe5b71	[tune/docs] Add PTL example to tune docs/examples (#11474 )	2020-10-19 14:47:58 -07:00
Sumanth Ratna	92a58aabce	[tune][docs] Fix learning rate bounds in FAQ (#11345 )	2020-10-12 09:44:53 -07:00
Richard Liaw	56f858ed1a	[tune][docs/util] gputil check, docs (#11260 ) Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>	2020-10-10 00:54:31 -07:00
Sumanth Ratna	98ebf8e2d8	[tune][docs] fix typo in Tune FAQ (#11161 ) * Fix typo in tune FAQ (used to use) * Update doc/source/tune/_tutorials/_faq.rst Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-10-01 11:20:41 -07:00
Kai Fricke	b8f344f695	[tune] add faq entry for reproducing experiments (setting seeds etc) (#11106 )	2020-09-29 14:48:39 -07:00
Richard Liaw	a563344bc2	[docs] remove ref to google groups -> github discussions (#11019 )	2020-09-24 18:09:51 -07:00
Kai Fricke	d9c4dea7cf	[tune] strict metric checking (#10972 )	2020-09-24 10:00:48 -07:00
Kai Fricke	50d63b8077	[tune] update pt tutorial docs (#10925 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-09-21 13:33:37 -07:00
Richard Liaw	b0ca70f628	[tune+core] tune lifecycle and starting ray guide (#10813 )	2020-09-21 11:27:50 -07:00
Ian Rodney	5bc2ba38fd	[docker] Detect CPUs in container correctly (#10507 ) Co-authored-by: simon-mo <simon.mo@hey.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Alex Wu <itswu.alex@gmail.com>	2020-09-13 23:40:48 -07:00
Kai Fricke	7eaf063f29	[tune] wrapper function to pass arbitrary objects through the object store to trainables (#10679 )	2020-09-10 17:39:44 -07:00
Kai Fricke	756a9ea641	[tune] add mode/metric parameters to tune.run (#10627 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-09-08 17:06:21 -07:00
Richard Liaw	551c597312	[tune] API revamp fix (#10518 )	2020-09-05 15:34:53 -07:00
Edward Oakes	34bda32054	[tune/serve] Fix tune/serve integration script broken by serve API change (#10586 )	2020-09-04 17:11:58 -07:00
Sumanth Ratna	89bf262130	[tune] Fix lr typo in FAQ (#10548 )	2020-09-03 13:37:39 -07:00
krfricke	57c4183724	[tune] add xgboost callbacks to integration module (#10502 )	2020-09-02 11:16:09 -07:00
Richard Liaw	3f98a8bfcb	[docs] Fix warnings for sphinx 1.8 (#10476 ) * fix-build-for-sphinx18 * jnilit	2020-09-01 13:37:35 -07:00

1 2

82 commits