hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

Fork 0

mirror of https://github.com/vale981/ray synced 2025-03-10 05:16:49 -04:00

Commit graph

Author	SHA1	Message	Date
Eric Liang	1f509ab331	[air] Add DatasetParallelTrainer.dataset_config for configuring dataset ingest (#25337 ) This adds a per-dataset config object to DataParallelTrainer. These configs define how the Dataset should be read into the DataParallelTrainer. It configures the preprocessing, splitting, and ingest strategy per-dataset. DataParallelTrainers declare default DatasetConfigs for each dataset passed in the ``datasets`` argument. Users have the opportunity to selectively override these configs by passing the ``dataset_config`` argument. Trainers can also define user customizable values (e.g., XGBoostTrainer doesn't support streaming ingest). This PR adds the minimal support for dataset configs. Future PRs will: - Add support for streaming ingest - Move this config from DataParallelTrainer to ml.Trainer	2022-06-03 16:32:53 -07:00
Kai Fricke	4b9a89ad90	[air] Move python/ray/ml to python/ray/air (#25449 ) The package "ml" should be renamed to "air". Main question: Keep a `ml.py` with `from ray.air import *` for some level of backwards compatibility? I'd go for no to force people to use the new structure.	2022-06-03 21:53:44 +01:00
Eric Liang	995309f9a3	[docs] Add AIR data ingest docs (part 1-- bulk loading only) (#24799 )	2022-05-19 14:25:47 -07:00

Author

SHA1

Message

Date

Eric Liang

1f509ab331

[air] Add DatasetParallelTrainer.dataset_config for configuring dataset ingest (#25337 )

This adds a per-dataset config object to DataParallelTrainer. These configs define how the Dataset should be read into the DataParallelTrainer. It configures the preprocessing, splitting, and ingest strategy per-dataset. DataParallelTrainers declare default DatasetConfigs for each dataset passed in the ``datasets`` argument. Users have the opportunity to selectively override these configs by passing the ``dataset_config`` argument. Trainers can also define user customizable values (e.g., XGBoostTrainer doesn't support streaming ingest).

This PR adds the minimal support for dataset configs. Future PRs will:
- Add support for streaming ingest
- Move this config from DataParallelTrainer to ml.Trainer

2022-06-03 16:32:53 -07:00

Kai Fricke

4b9a89ad90

[air] Move python/ray/ml to python/ray/air (#25449 )

The package "ml" should be renamed to "air".

Main question: Keep a `ml.py` with `from ray.air import *` for some level of backwards compatibility?
I'd go for no to force people to use the new structure.

2022-06-03 21:53:44 +01:00

Eric Liang

995309f9a3

[docs] Add AIR data ingest docs (part 1-- bulk loading only) (#24799 )

2022-05-19 14:25:47 -07:00

3 commits