ray/doc/source at 015181ab9a9e80a98c26ff773a6faa552b54cdd0 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Eric Liang 015181ab9a Add random access support for Datasets (experimental feature) (#22749 ) This PR adds experimental support for random access to datasets. A Dataset can be random access enabled by calling `ds.to_random_access_dataset(key, num_workers=N)`. This creates a RandomAccessDataset. RandomAccessDataset partitions the dataset across the cluster by the given sort key, providing efficient random access to records via binary search. A number of worker actors are created, each of which has zero-copy access to the underlying sorted data blocks of the Dataset. Performance-wise, you can expect each worker to provide ~3000 records / second via ``get_async()``, and ~10000 records / second via ``multiget()``. Since Ray actor calls go direct from worker->worker, throughput scales linearly with the number of workers.		2022-03-17 15:01:12 -07:00
..
_includes	[docs] fix includes for md files (#23180 )	2022-03-15 11:09:18 +00:00
_static	[docs] RLlib concepts consolidation, user guide, RL conf prep (#22496 )	2022-02-18 09:35:20 -08:00
_templates	[docs] templates and contribution guide (fixes #21753 ) (#23003 )	2022-03-10 15:28:07 +00:00
cluster	[Job submission] Improve job submission docs (#23115 )	2022-03-15 21:20:33 -05:00
data	Add random access support for Datasets (experimental feature) (#22749 )	2022-03-17 15:01:12 -07:00
images	[docs] Core docs refactor (#23216 )	2022-03-17 11:26:17 -07:00
ray-contribute	[docs] Core docs refactor (#23216 )	2022-03-17 11:26:17 -07:00
ray-core	[Doc] [runtime_env] Add limitation about single-file `py_modules` to doc (#23248 )	2022-03-17 16:23:46 -05:00
ray-more-libs	[docs] re/move old core examples (#22802 )	2022-03-10 12:17:00 -08:00
ray-observability	[runtime env] [Doc] add more details about runtime env logs (#22480 )	2022-03-02 14:27:28 -08:00
ray-overview	Update paper links to include exoshuffle and remove whitepaper (moved to docs) (#23099 )	2022-03-15 13:12:01 -07:00
ray-references	[Doc] [Jobs] add CLI and SDK reference to docs (#22680 )	2022-02-28 17:57:46 -06:00
raysgd	[GCS-Ray] update doc and error message for GCS-Ray (#22528 )	2022-02-22 17:56:30 -08:00
rllib	[docs] RLlib broken links (fixes #23160 ) (#23226 )	2022-03-16 12:38:18 +01:00
serve	[Serve] Remove legacy pipeline codebase (#23172 )	2022-03-17 13:27:16 -07:00
train	[Train] Add support for automatic mixed precision (#22227 )	2022-03-16 20:53:02 -07:00
tune	[docs] Core docs refactor (#23216 )	2022-03-17 11:26:17 -07:00
workflows	[Workflow] Improve workflow docs (#23114 )	2022-03-13 18:55:45 -07:00
_toc.yml	Move the third-party data integrations (non-Dataset stuff) out of the user guides which is for Dataset (#23162 )	2022-03-17 11:27:40 -07:00
conf.py	run code in browser (#22727 )	2022-03-02 10:27:00 +01:00
custom_directives.py	[Train] Add support for automatic mixed precision (#22227 )	2022-03-16 20:53:02 -07:00
index.md	[docs] fix includes for md files (#23180 )	2022-03-15 11:09:18 +00:00