ray/doc/source/data/user-guide.rst
Jian Xiao 9fe4dba4ad
Revamp the Getting Started page for Dataset (#24860)
This is part of the Dataset GA doc fix effort to update/improve the documentation.
This PR revamps the Getting Started page.

What are the changes:
- Focus on basic/core features that are bread-and-butter for users, leave the advanced features out
- Focus on high level introduction, leave the detailed spec out (e.g. what are possible batch_types for map_batches() API)
- Use more realistic (yet still simple) data example that's familiar to people (IRIS dataset in this case)
- Use the same data example throughout to make it context-switch free
- Use runnable code rather than faked
- Reference to the code from doc, instead of inlining them in the doc

Co-authored-by: Ubuntu <ubuntu@ip-172-31-32-136.us-west-2.compute.internal>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2022-05-18 13:46:23 -07:00

29 lines
827 B
ReStructuredText
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

.. _data_user_guide :
===========
User Guides
===========
If youre new to Ray Datasets, we recommend starting with the :ref:`Ray Datasets Quick Start <datasets_getting_started>`.
This user guide will help you navigate the Ray Datasets project and show you how achieve several tasks, for instance
you will learn
- how to load data and preprocess it for machine learning applications,
- how to use Tensors with Ray Datasets,
- how to run Dataset Pipelines in common scenarios,
- and how to tune your Ray Datasets applications for performance.
.. toctree::
:maxdepth: 2
creating-datasets
saving-datasets
transforming-datasets
accessing-datasets
pipelining-compute
dataset-ml-preprocessing
dataset-tensor-support
advanced-pipelines
random-access
custom-data
performance-tips