mirror of
https://github.com/vale981/ray
synced 2025-03-06 10:31:39 -05:00

This is part of the Dataset GA doc fix effort to update/improve the documentation. This PR revamps the Getting Started page. What are the changes: - Focus on basic/core features that are bread-and-butter for users, leave the advanced features out - Focus on high level introduction, leave the detailed spec out (e.g. what are possible batch_types for map_batches() API) - Use more realistic (yet still simple) data example that's familiar to people (IRIS dataset in this case) - Use the same data example throughout to make it context-switch free - Use runnable code rather than faked - Reference to the code from doc, instead of inlining them in the doc Co-authored-by: Ubuntu <ubuntu@ip-172-31-32-136.us-west-2.compute.internal> Co-authored-by: Eric Liang <ekhliang@gmail.com>
29 lines
827 B
ReStructuredText
29 lines
827 B
ReStructuredText
.. _data_user_guide :
|
||
|
||
===========
|
||
User Guides
|
||
===========
|
||
|
||
If you’re new to Ray Datasets, we recommend starting with the :ref:`Ray Datasets Quick Start <datasets_getting_started>`.
|
||
This user guide will help you navigate the Ray Datasets project and show you how achieve several tasks, for instance
|
||
you will learn
|
||
|
||
- how to load data and preprocess it for machine learning applications,
|
||
- how to use Tensors with Ray Datasets,
|
||
- how to run Dataset Pipelines in common scenarios,
|
||
- and how to tune your Ray Datasets applications for performance.
|
||
|
||
.. toctree::
|
||
:maxdepth: 2
|
||
|
||
creating-datasets
|
||
saving-datasets
|
||
transforming-datasets
|
||
accessing-datasets
|
||
pipelining-compute
|
||
dataset-ml-preprocessing
|
||
dataset-tensor-support
|
||
advanced-pipelines
|
||
random-access
|
||
custom-data
|
||
performance-tips
|