Chen Shen
6d17fe5fc5
[cuj2] merge latest change to cuj2 (groupby based filtering) and add a debug mode. ( #20742 )
...
This PR does two things:
merge latest groupby based filtering to CUJ2
add a debug mode so we only run dummy trainer for measure data processing performance.
2021-11-29 19:10:17 -08:00
Chen Shen
107aef89a8
[CUJ2] add nightly tests for running 500GB ray train ( #20195 )
...
* add
* update cluster env
* fix build
Co-authored-by: Matthew Deng <matthew.j.deng@gmail.com>
2021-11-21 20:04:45 -08:00
Chen Shen
b38ebd368c
[Dataset][nighlyt-test] spend less money #19488
...
Reduce the epoch and ensure everything runs in the same datacenter.
2021-10-18 18:53:50 -07:00
Chen Shen
7c99aae033
[dataset][nightly-test] add pipelined ingestion/training nightly test
2021-09-23 20:39:03 -07:00
Chen Shen
89f988e9cc
add dataset shuffle data loader ( #17917 )
2021-08-20 11:26:01 -07:00
Alex Wu
d9cd3800c7
Dataset speed up read ( #17435 )
2021-08-01 18:03:46 -07:00
SangBin Cho
e1cd8580a0
[Test] Add various fixes to the nightly dashboard to improve signals ( #17351 )
...
* Add various fixes to the nightly dashboard to improve signals
* Fix issues
2021-07-27 12:37:11 -07:00
Alex Wu
93c16346bf
[Dataset] imagenet nightly test ( #17069 )
2021-07-16 14:15:49 -07:00