ray/doc/source/data
matthewdeng cc08c01ade
[ml] add more preprocessors (#23904)
Adding some more common preprocessors:
* MaxAbsScaler
* RobustScaler
* PowerTransformer
* Normalizer
* FeatureHasher
* Tokenizer
* HashingVectorizer
* CountVectorizer

API docs: https://ray--23904.org.readthedocs.build/en/23904/ray-air/getting-started.html

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2022-04-25 21:12:59 +01:00
..
doc_code [docs] sphinx gallery removal, migrate to ipynb (#22467) 2022-02-19 01:19:07 -08:00
examples [Doc] [jobs] Add links to Job Submission and improve doc (#23209) 2022-03-18 12:52:13 -05:00
images [minor] Fix incorrect link to ray core user guide (#23316) 2022-03-17 20:58:56 -07:00
modin Fix broken links in documentation and put linkcheck linter in place on CI (#23340) 2022-03-18 21:02:52 -07:00
accessing-datasets.rst Cleanup the DatasetPipeline references in Getting Started; rename Exchanging to Accessing (#23786) 2022-04-12 17:10:14 -07:00
advanced-pipelines.rst [minor] Fix incorrect link to ray core user guide (#23316) 2022-03-17 20:58:56 -07:00
big_data_ingestion.yaml Revert "[docs] Clean up doc structure (first part) (#21667)" (#21763) 2022-01-20 15:30:56 -08:00
creating-datasets.rst [Dataset GA doc] Decompose the monolith of Getting Started page (and get them under User Guide) (#23311) 2022-03-18 11:25:43 -07:00
custom-data.rst [Docs] Ray Data docs target state (#21931) 2022-01-27 13:14:36 -08:00
dask-on-ray.rst Update dask version for Ray 1.12.0 (#23197) 2022-03-15 19:22:19 -07:00
dataset-ml-preprocessing.rst [Dataset GA doc] Decompose the monolith of Getting Started page (and get them under User Guide) (#23311) 2022-03-18 11:25:43 -07:00
dataset-tensor-support.rst Make a pass fixing Dataset API issues (#22886) 2022-03-08 13:07:55 -08:00
dataset.rst Remove dataset pipeline from the Getting Started page (#23756) 2022-04-07 12:52:04 -07:00
getting-started.rst Cleanup the DatasetPipeline references in Getting Started; rename Exchanging to Accessing (#23786) 2022-04-12 17:10:14 -07:00
integrations.rst Move the third-party data integrations (non-Dataset stuff) out of the user guides which is for Dataset (#23162) 2022-03-17 11:27:40 -07:00
key-concepts.rst Cleanup the DatasetPipeline references in Getting Started; rename Exchanging to Accessing (#23786) 2022-04-12 17:10:14 -07:00
mars-on-ray.rst [Docs] Ray Data docs target state (#21931) 2022-01-27 13:14:36 -08:00
package-ref.rst [ml] add more preprocessors (#23904) 2022-04-25 21:12:59 +01:00
performance-tips.rst [Docs] Ray Data docs target state (#21931) 2022-01-27 13:14:36 -08:00
pipelining-compute.rst Remove dataset pipeline from the Getting Started page (#23756) 2022-04-07 12:52:04 -07:00
random-access.rst [data] Fix small doc issues (#23813) 2022-04-09 12:09:08 -07:00
raydp.rst [Docs] Ray Data docs target state (#21931) 2022-01-27 13:14:36 -08:00
saving-datasets.rst [Dataset GA doc] Decompose the monolith of Getting Started page (and get them under User Guide) (#23311) 2022-03-18 11:25:43 -07:00
transforming-datasets.rst [data] Fix small doc issues (#23813) 2022-04-09 12:09:08 -07:00
user-guide.rst Cleanup the DatasetPipeline references in Getting Started; rename Exchanging to Accessing (#23786) 2022-04-12 17:10:14 -07:00