[docs] Add a talks section for ray.data (#20444)

This commit is contained in:
Richard Liaw 2021-11-16 14:30:08 -08:00 committed by GitHub
parent 05d21497db
commit cf357f6bce
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -34,9 +34,7 @@ Ray-integrated DataFrame libraries can also be seamlessly used with Datasets, to
:width: 650px
:align: center
See the following for more Dataset ML use cases and benchmarks:
- [slides] `Talk given at PyData 2021 <https://docs.google.com/presentation/d/1zANPlmrxQkjPU62I-p92oFO3rJrmjVhs73hL4YbM4C4>`_
See :ref:`the Talks section <data-talks>` for more Dataset ML use cases and benchmarks.
General Parallel Compute
------------------------
@ -192,7 +190,7 @@ Get started by creating Datasets from synthetic data using ``ray.data.range()``
.. code-block:: python
import ray
# Create a Dataset of Python objects.
ds = ray.data.range(10000)
# -> Dataset(num_blocks=200, num_rows=10000, schema=<class 'int'>)
@ -387,6 +385,15 @@ Datasets can read and write in parallel to `custom datasources <package-ref.html
# Write to a custom datasource.
ds.write_datasource(YourCustomDatasource(), **write_args)
.. _data-talks:
Talks and Materials
-------------------
- [slides] `Talk given at PyData 2021 <https://docs.google.com/presentation/d/1zANPlmrxQkjPU62I-p92oFO3rJrmjVhs73hL4YbM4C4>`_
Contributing
------------