hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 05:16:49 -04:00

Author	SHA1	Message	Date
krfricke	06af62ba91	[tune] refactor tune search space (#10444 ) * Added basic functionality and tests * Feature parity with old tune search space config * Convert Optuna search spaces * Introduced quantized values * Updated Optuna resolving * Added HyperOpt search space conversion * Convert search spaces to AxSearch * Convert search spaces to BayesOpt * Added basic functionality and tests * Feature parity with old tune search space config * Convert Optuna search spaces * Introduced quantized values * Updated Optuna resolving * Added HyperOpt search space conversion * Convert search spaces to AxSearch * Convert search spaces to BayesOpt * Re-factored samplers into domain classes * Re-added base classes * Re-factored into list comprehensions * Added `from_config` classmethod for config conversion * Applied suggestions from code review * Removed truncated normal distribution * Set search properties in tune.run * Added test for tune.run search properties * Move sampler initializers to base classes * Add tune API sampling test, fixed includes, fixed resampling bug * Add to API docs * Fix docs * Update metric and mode only when set. Set default metric and mode to experiment analysis object. * Fix experiment analysis tests * Raise error when delimiter is used in the config keys * Added randint/qrandint to API docs, added additional check in tune.run * Fix tests * Fix linting error * Applied suggestions from code review. Re-aded tune.function for the time being * Fix sampling tests * Fix experiment analysis tests * Fix tests and linting error * Removed unnecessary default_config attribute from OptunaSearch * Revert to set AxSearch default metric * fix-min-max * fix * nits * Added function check, enhanced loguniform error message * fix-print * fix * fix * Raise if unresolved values are in config and search space is already set Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-09-03 09:06:13 -07:00
krfricke	57c4183724	[tune] add xgboost callbacks to integration module (#10502 )	2020-09-02 11:16:09 -07:00
krfricke	f3f698816d	[tune] Added PyTorch Lightning callbacks to integrations (#10220 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-08-31 15:30:48 -07:00
fyrestone	08adbb371f	Cross language exception (#10023 )	2020-08-26 10:46:05 +08:00
Amog Kamsetty	5af7d24f66	[Tune] Transformer blog example (#9789 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2020-08-04 22:05:01 -07:00
Hao Chen	260bc52254	Java doc: "Ray Core Walkthrough" page (#8595 )	2020-07-30 11:13:38 +08:00
Richard Liaw	139d21e068	[tune] Docs for tune-sklearn (#9129 ) Co-authored-by: krfricke <krfricke@users.noreply.github.com>	2020-07-06 15:35:10 -07:00
Richard Liaw	b506f87117	[tune] New Doc edits, add Concepts page (#8083 ) Co-Authored-By: Sven Mika <sven@anyscale.io>	2020-04-25 18:25:56 -07:00
Richard Liaw	4d8bf5635d	[hotfix] Lint formatting for new Tune optimizer ZOOpt (#8040 ) * formatting * removedill * lint	2020-04-16 09:24:30 -07:00
Richard Liaw	e68d601ec7	[docs] Add link master <-> latest via sphinx version warnings (#8010 )	2020-04-13 15:21:08 -07:00
Richard Liaw	a67edc4051	[tune] Improve user guides and API docs (#7716 ) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io>	2020-04-06 12:16:35 -07:00
Richard Liaw	18327254b6	[docs] Fix readthedocs rendering (#7810 )	2020-03-30 11:40:08 -07:00
Eric Liang	5cebee68d6	[rllib] Add scaling guide to documentation, improve bandit docs (#7780 ) * update * reword * update * ms * multi node sgd * reorder * improve bandit docs * contrib * update * ref * improve refs * fix build * add pillow dep * add pil * update pil * pillow * remove false	2020-03-27 22:05:43 -07:00
Richard Liaw	e10dc91821	Fix doc build (#7781 )	2020-03-27 17:39:38 -07:00
Richard Liaw	3fa2e4a346	[docs] Fix import breaking docs build (#7715 ) * psutil missing * ok	2020-03-23 13:21:39 -07:00
Richard Liaw	81d311031b	[tune] Update API Reference Page (#7671 ) * widerdocs * init * docs * fix * moveit * mix * better_docs * remove * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io>	2020-03-22 16:42:20 -07:00
Richard Liaw	b38ed4be71	[raysgd] Fix More Docs (#7565 )	2020-03-11 14:17:47 -07:00
SangBin Cho	50145e668d	Fix the problem that ray.remote reference is not visible at a document. (#7311 )	2020-02-28 14:03:08 -08:00
fyrestone	0648bd28ef	[xlang] Cross language Python support (#6709 )	2020-02-08 13:01:28 +08:00
Simon Mo	6285851743	Add sphinx copy button (#6694 ) * Add sphinx copy button * Update requirements-doc.txt Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>	2020-01-04 19:31:49 -06:00
Richard Liaw	5719a05757	[sgd] Add support for multi-model multi-optimizer training (#6317 )	2019-12-15 15:19:45 -08:00
Richard Liaw	10f21fa313	[docs] Convert Examples to Gallery (#5414 )	2019-09-24 15:46:56 -07:00
Richard Liaw	d37c09aac0	[docs] Add a feedback form (#5610 ) * Create a feedback form * feedback * fix * fix * fix * wording	2019-09-02 01:28:28 -07:00
Philipp Moritz	c852213b83	[projects] Project examples and documentation (#5407 )	2019-08-20 20:49:15 -07:00
Kristian Hartikainen	13fb9fe3db	[rllib] Feature/soft actor critic v2 (#5328 ) * Add base for Soft Actor-Critic * Pick changes from old SAC branch * Update sac.py * First implementation of sac model * Remove unnecessary SAC imports * Prune unnecessary noise and exploration code * Implement SAC model and use that in SAC policy * runs but doesn't learn * clear state * fix batch size * Add missing alpha grads and vars * -200 by 2k timesteps * doc * lazy squash * one file * ignore tfp * revert done	2019-08-01 23:37:36 -07:00
Hao Chen	0131353d42	[gRPC] Migrate gcs data structures to protobuf (#5024 )	2019-06-25 14:31:19 -07:00
Yuhong Guo	5eff47b657	[C++] Add hash table to Redis-Module (#4911 )	2019-06-07 16:11:37 +08:00
Peter Schafhalter	c2ade075a3	[sgd] Distributed Training via PyTorch (#4797 ) Implements distributed SGD using distributed PyTorch.	2019-06-01 21:39:22 -07:00
Romil Bhardwaj	004440f526	Dynamic Custom Resources - create and delete resources (#3742 )	2019-05-11 20:06:04 +08:00
Tasha Chin	5693cd1344	[docs] Show source code (#3281 )	2019-04-03 21:30:20 -07:00
Tianming Xu	692bb336a1	Fix master branch compilation error and lint error (#4109 )	2019-02-21 11:54:30 -08:00
Hao Chen	de17443dc2	Propagate backend error to worker (#4039 )	2019-02-16 11:39:15 +08:00
Eric Liang	2dccf383dd	[rllib] Basic infrastructure for off-policy estimation (IS, WIS) (#3941 )	2019-02-13 16:25:05 -08:00
Hao Chen	f31a79f3f7	Implement actor checkpointing (#3839 ) * Implement Actor checkpointing * docs * fix * fix * fix * move restore-from-checkpoint to HandleActorStateTransition * Revert "move restore-from-checkpoint to HandleActorStateTransition" This reverts commit 9aa4447c1e3e321f42a1d895d72f17098b72de12. * resubmit waiting tasks when actor frontier restored * add doc about num_actor_checkpoints_to_keep=1 * add num_actor_checkpoints_to_keep to Cython * add checkpoint_expired api * check if actor class is abstract * change checkpoint_ids to long string * implement java * Refactor to delay actor creation publish until checkpoint is resumed * debug, lint * Erase from checkpoints to restore if task fails * fix lint * update comments * avoid duplicated actor notification log * fix unintended change * add actor_id to checkpoint_expired * small java updates * make checkpoint info per actor * lint * Remove logging * Remove old actor checkpointing Python code, move new checkpointing code to FunctionActionManager * Replace old actor checkpointing tests * Fix test and lint * address comments * consolidate kill_actor * Remove __ray_checkpoint__ * fix non-ascii char * Loosen test checks * fix java * fix sphinx-build	2019-02-13 19:39:02 +08:00
Si-Yuan	48139cf861	Migrate Python C extension to Cython (#3541 )	2019-01-24 09:17:14 -08:00
Robert Nishihara	0b1608a546	Factor out code for starting new processes and test plasma store in valgrind. (#3824 ) * Factor out starting Ray processes. * Detect flags through environment variables. * Return ProcessInfo from start_ray_process. * Print valgrind errors at exit. * Test valgrind in travis. * Some valgrind fixes. * Undo raylet monitor change. * Only test plasma store in valgrind.	2019-01-22 14:59:11 -08:00
Yuhong Guo	fb33fa9097	Enable function_descriptor in backend to replace the function_id (#3028 )	2018-12-18 18:53:59 -05:00
Robert Nishihara	82863b5251	[autoscaler] Update autoscaler to use heartbeat batches. (#3409 )	2018-11-27 23:46:27 -08:00
Eric Liang	53489d2f85	[sgd] Document and add simple MNIST example (#3236 )	2018-11-10 21:52:20 -08:00
Robert Nishihara	658c14282c	Remove legacy Ray code. (#3121 ) * Remove legacy Ray code. * Fix cmake and simplify monitor. * Fix linting * Updates * Fix * Implement some methods. * Remove more plasma manager references. * Fix * Linting * Fix * Fix * Make sure class IDs are strings. * Some path fixes * Fix * Path fixes and update arrow * Fixes. * linting * Fixes * Java fixes * Some java fixes * TaskLanguage -> Language * Minor * Fix python test and remove unused method signature. * Fix java tests * Fix jenkins tests * Remove commented out code.	2018-10-26 13:36:58 -07:00
Richard Liaw	62d0698097	[tune] Tune Facelift (#2472 ) This PR introduces the following changes: * Ray Tune -> Tune * [breaking] Creation of `schedulers/`, moving PBT, HyperBand into a submodule * [breaking] Search Algorithms now must take in experiment configurations via `add_configurations` rather through initialization * Support `"run": (function \| class \| str)` with automatic registering of trainable * Documentation Changes	2018-08-19 11:00:55 -07:00
Jones Wong	007208d2bb	Support older version TF and Support RMSProp in Impala (#2590 ) to support TF version < 1.5 to support rmsprop optimizer in Impala Before TF1.5, tf.reduce_sum() and tf.reduce_max() has an argument keep_dims which has been renamed as keepdims in later versions. In the original paper of Impala, they use rmsprop algorithm to optimize the model. We'd better also support it so that users can reproduce their experiments. Without any tuning, say that using the same hyper-parameters as AdamOptimizer, it reaches "episode_reward_mean": 19.083333333333332 in Pong after consume 3,610,350 samples.	2018-08-09 19:51:32 -07:00
Melih Elibol	8ae82180b4	[xray] Adds a driver table. (#2289 ) This PR adds a driver table for the new GCS, which enables cleanup functionality associated with monitoring driver death. Some testing in `monitor_test.py` is restored, but redis sharding for xray is needed to enable remaining tests.	2018-08-08 23:41:40 -07:00
Robert Nishihara	515da7721a	Change ray.worker.cleanup -> ray.shutdown and improve API documentation. (#2374 ) * Change ray.worker.cleanup -> ray.shutdown and improve API documentation. * Deprecate ray.worker.cleanup() gracefully. * Fix linting	2018-07-12 12:00:00 -07:00
Robert Nishihara	b90e551b41	[xray] Implement timeline and profiling API. (#2306 ) * Add profile table and store profiling information there. * Code for dumping timeline. * Improve color scheme. * Push timeline events on driver only for raylet. * Improvements to profiling and timeline visualization * Some linting * Small fix. * Linting * Propagate node IP address through profiling events. * Fix test. * object_id.hex() should return byte string in python 2. * Include gcs.fbs in node_manager.fbs. * Remove flatbuffer definition duplication. * Decode to unicode in Python 3 and bytes in Python 2. * Minor * Submit profile events in a batch. Revert some CMake changes. * Fix * Workaround test failure. * Fix linting * Linting * Don't return anything from chrome_tracing_dump when filename is provided. * Remove some redundancy from profile table. * Linting * Move TODOs out of docstring. * Minor	2018-07-04 23:23:48 -07:00
Robert Nishihara	ff2217251f	[xray] Add error table and push error messages to driver through node manager. (#2256 ) * Fix documentation indentation. * Add error table to GCS and push error messages through node manager. * Add type to error data. * Linting * Fix failure_test bug. * Linting. * Enable one more test. * Attempt to fix doc building. * Restructuring * Fixes * More fixes. * Move current_time_ms function into util.h.	2018-06-20 21:29:28 -07:00
Robert Nishihara	6172f94c04	Implement Python global state API for xray. (#2125 ) * Implement global state API for xray. * Fix object table. * Fixes for log structure. * Implement cluster_resources. * Add driver task to task table. * Remove python flatbuffers code * Get some global state API tests running. * Python linting. * Fix linting. * Fix mock modules for doc * Copy over flatbuffer bindings. * Fix for tests. * Linting * Fix monitor crash.	2018-05-29 16:25:54 -07:00
Robert Nishihara	3c76461b22	Remove smart_open install. (#1943 )	2018-04-23 23:18:09 -07:00
Robert Nishihara	fbfbb1c079	[xray] Integrate worker.py with raylet. (#1810 ) * Integrate worker with raylet. * Begin allowing worker to attach to cluster. * Fix linting and documentation. * Fix linting. * Comment tests back in. * Fix type of worker command. * Remove xray python files and tests. * Fix from rebase. * Add test. * Copy over raylet executable. * Small cleanup.	2018-04-03 02:38:56 -07:00
Robert Nishihara	96913be939	Treat actor creation like a regular task. (#1668 ) * Treat actor creation like a regular task. * Small cleanups. * Change semantics of actor resource handling. * Bug fix. * Minor linting * Bug fix * Fix jenkins test. * Fix actor tests * Some cleanups * Bug fix * Fix bug. * Remove cached actor tasks when a driver is removed. * Add more info to taskspec in global state API. * Fix cyclic import bug in tune. * Fix * Fix linting. * Fix linting. * Don't schedule any tasks (especially actor creaiton tasks) on local schedulers with 0 CPUs. * Bug fix. * Add test for 0 CPU case * Fix linting * Address comments. * Fix typos and add comment. * Add assertion and fix test.	2018-03-16 11:18:07 -07:00

1 2

61 commits