hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 12:56:46 -04:00

Author	SHA1	Message	Date
Richard Liaw	a3a268435f	[docs] Edit survey links (#6777 )	2020-01-17 11:52:04 -08:00
Richard Liaw	62cbc043b4	[tune] tbx logger (#6133 ) * tbx * add_hparams * fix_hparams * ok * ok * fix * ok * fix	2019-11-15 08:45:44 -08:00
Adam Gleave	c157e93ba1	[tune] Retry failed tasks with checkpointing disabled (#6126 ) * Allow recovery for failed tasks without checkpointing * Update docs	2019-11-09 19:35:27 -08:00
Richard Liaw	5c549fd84b	[docs] Make slack more prominent (#5792 ) Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com>	2019-09-26 15:36:56 -07:00
Ujval Misra	a4659a8f8b	[tune] Add support for function-based stopping condition (#5754 )	2019-09-23 18:39:00 -07:00
Vince Jankovics	7e214fd95e	[tune] TensorBoard HParams for TF2.0 (#5678 )	2019-09-21 11:06:34 -07:00
Richard Liaw	34f6d2fc5c	[tune] Update trainable docs and support hparams (#5558 )	2019-09-04 12:44:42 -07:00
Eric Liang	daf38c8723	[tune] Deprecate tune.function (#5601 ) * remove tune function * remove examples * Update tune-usage.rst	2019-08-31 16:00:10 -07:00
Richard Liaw	411f30c125	[docs] Second push of changes (#5391 )	2019-08-28 17:54:15 -07:00
Richard Liaw	d7b309223b	[tune] MLFlow Logger (#5438 )	2019-08-14 15:58:18 -07:00
Richard Liaw	ed89897a31	[tune,autoscaler] Test yaml, add better distributed docs (#5403 )	2019-08-08 00:59:23 -07:00
Richard Liaw	1eaa57c98f	[tune] Distributed example + walkthrough (#5157 )	2019-08-02 09:17:20 -07:00
Richard Liaw	0b540ab492	[tune] Test example checkpointing (#4728 )	2019-07-10 01:58:26 -07:00
Kristian Hartikainen	9e0192bc0b	[tune] Change the log syncing behavior (#4450 ) * Change the log syncing behavior * fix up abstractions for syncer * Finished checkpoint syncing * Code * Set of changes to get things running * Fixes for log syncing * Fix parts * Lint and other fixes * fix some test * Remove extra parsing functionality * some test fixes * Fix up cloud syncing * Another thing to do * Fix up tests and local sync Changes LogSync into a mixin, and adds tests for different functionalities. * Fix up tests, start on local migration * fix distributed migrations * comments * formatting * Better checkpoint directory handling * fix tests * fix tests * fix click * comments * formatting comments * formatting and comments * sync function deprecations * syncfunction * Add documentation for Syncing and Uploading * nit * BaseSyncer as base for Mixin in edge case * more docs * clean up assertions * validate * nit * Update test_cluster.py * betterdoc * Update tune-usage.rst * cleanup * nit	2019-07-02 20:46:00 -07:00
Ashwinee Panda	11ccf66346	[docs] docs for running Tensorboard without sudo (#5015 ) * Instructions for running Tensorboard without sudo When we run Tensorboard to visualize the results of Ray outputs on multi-user clusters where we don't have sudo access, such as RISE clusters, a few commands need to first be run to make sure tensorboard can edit the tmp directory. This is a pretty common usecase so I figured we may as well put it in the documentation for Tune. * Update tune-usage.rst	2019-06-24 11:26:53 -07:00
Adi Zimmerman	36b71d1446	[Tune] Post-Experiment Tools (#4351 )	2019-05-04 02:51:26 -04:00
Andrew Tan	991b911e1d	[tune] Add `--columns` flag for CLI (#4564 )	2019-04-05 19:49:01 -07:00
Andrew Tan	bfd0af52bc	[tune] Add documentation to --output flag (#4518 ) ## What do these changes do? Add documentation for the `--output` flag for ls / lsx in the Tune CLI. ## Related issue number Closes #4511 ## Linter - [x] I've run `scripts/format.sh` to lint the changes in this PR.	2019-04-05 00:16:35 -07:00
Andrew Tan	12db684f72	[tune] add filter flag for Tune CLI (#4337 ) ## What do these changes do? Adds filter flag (--filter) to ls / lsx commands for Tune CLI. Usage: `tune ls [path] --filter [column] [operator] [value]` e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`	2019-03-27 11:19:25 -07:00
Richard Liaw	ea5a6f8455	[tune] Simplify API (#4234 ) Uses `tune.run` to execute experiments as preferred API. @noahgolmant This does not break backwards compat, but will slowly internalize `Experiment`. In a separate PR, Tune schedulers should only support 1 running experiment at a time.	2019-03-17 13:03:32 -07:00
Richard Liaw	6630a35353	[tune] Initial Commit for Tune CLI (#3983 ) This introduces a light CLI for Tune.	2019-03-08 16:46:05 -08:00
Adi Zimmerman	4cf2c9ecb8	[tune] Doc fixes (#4207 ) Co-Authored-By: adizim <adizim@berkeley.edu>	2019-03-05 14:11:53 -08:00
Richard Liaw	c695402dc3	[tune] Introduce ability to turn off default logging. (#4104 )	2019-02-28 17:02:41 -08:00
Adi Zimmerman	5cf388f29d	[tune] Support RESTful API for the Web Server (#4080 ) Change the client/server API to RESTful design. This includes resource modeling, model URI's, and correct HTTP methods.	2019-02-26 21:56:02 -08:00
Robert Nishihara	01e18b47f4	Direct people to stackoverflow for questions about usage. (#3830 ) * Direct people to stackoverflow for questions about usage. * Improve wording	2019-01-23 13:30:02 -08:00
Eric Liang	7db1f3be2a	[tune] resume=False by default but print a tip to set resume="prompt" + jenkins fix (#3681 )	2019-01-04 17:23:19 -08:00
Richard Liaw	aad3c50e2d	[tune] Cluster Fault Tolerance (#3309 ) This PR introduces cluster-level fault tolerance for Tune by checkpointing global state. This occurs with relatively high frequency and allows users to easily resume experiments when the cluster crashes. Note that this PR may affect automated workflows due to auto-prompting, but this is resolvable.	2018-12-29 11:42:25 +08:00
Richard Liaw	e046a5c767	[tune] resources_per_trial from trial_resources (#3580 ) Renaming variable due to user errors.	2018-12-20 19:00:47 -08:00
Richard Liaw	e0fbb68e47	[tune] Custom Logging, Trial Name (#3465 ) Adds support for custom loggers, custom trial strings, and custom sync commands. Closes #3034, #2985, and #3390.	2018-12-11 13:41:59 -08:00
Eric Liang	412aaa5195	[tune] Deprecate ambiguous function values (use tune.function / tune.sample_from instead) (#3457 ) * wip * exclude	2018-12-06 11:35:20 -08:00
Richard Liaw	c3a2c7ebed	[tune] Doc: Autofilled, StatusReporter (#3294 ) * autofill and revise doc page for things * lint * comments	2018-11-13 13:15:56 -08:00
Richard Liaw	2086a57e61	[tune] Add Fractional GPU example/docs (#3169 ) * Add example for fractional GPU support * Update tune_mnist_keras.py * Update doc/source/tune-usage.rst	2018-10-31 18:53:16 -07:00
Richard Liaw	eff7cb4458	[tune] Fix SearchAlg finishing early (#3081 ) * Fix trial search alg finishing early * Fix lint * fix lint * nit fix	2018-10-22 12:17:13 -07:00
Richard Liaw	f9b58d7b02	[tune] Tweaks to Trainable and Verbosity (#2889 )	2018-10-11 23:42:13 -07:00
Praveen Palanisamy	357c0d6156	[tune] Adds option to checkpoint at end of trials (#2754 ) * Added checkpoint_at_end option. To fix #2740 * Added ability to checkpoint at the end of trials if the option is set to True * checkpoint_at_end option added; Consistent with Experience and Trial runner * checkpoint_at_end option mentioned in the tune usage guide * Moved the redundant checkpoint criteria check out of the if-elif * Added note that checkpoint_at_end is enabled only when checkpoint_freq is not 0 * Added test case for checkpoint_at_end * Made checkpoint_at_end have an effect regardless of checkpoint_freq * Removed comment from the test case * Fixed the indentation * Fixed pep8 E231 * Handled cases when trainable does not have _save implemented * Constrained test case to a particular exp using the MockAgent * Revert "Constrained test case to a particular exp using the MockAgent" This reverts commit e965a9358ec7859b99a3aabb681286d6ba3c3906. * Revert "Handled cases when trainable does not have _save implemented" This reverts commit 0f5382f996ff0cbf3d054742db866c33494d173a. * Simpler test case for checkpoint_at_end * Preserved bools from loosing their actual value * Revert "Moved the redundant checkpoint criteria check out of the if-elif" This reverts commit 783005122902240b0ee177e9e206e397356af9c5. * Fix linting error.	2018-08-29 13:14:17 -07:00
Michael Tu	d16b6f6a32	[tune] Rename 'repeat' to 'num_samples' (#2698 ) Deprecates the `repeat` argument and introduces `num_samples`. Also updates docs accordingly.	2018-08-24 15:05:24 -07:00
Richard Liaw	62d0698097	[tune] Tune Facelift (#2472 ) This PR introduces the following changes: * Ray Tune -> Tune * [breaking] Creation of `schedulers/`, moving PBT, HyperBand into a submodule * [breaking] Search Algorithms now must take in experiment configurations via `add_configurations` rather through initialization * Support `"run": (function \| class \| str)` with automatic registering of trainable * Documentation Changes	2018-08-19 11:00:55 -07:00

37 commits