ray/python
Richard Liaw aad3c50e2d
[tune] Cluster Fault Tolerance (#3309)
This PR introduces cluster-level fault tolerance for Tune by checkpointing global state. This occurs with relatively high frequency and allows users to easily resume experiments when the cluster crashes.

Note that this PR may affect automated workflows due to auto-prompting, but this is resolvable.
2018-12-29 11:42:25 +08:00
..
benchmarks Deprecate num_workers argument to ray.init and ray start. (#3114) 2018-10-28 20:12:49 -07:00
ray [tune] Cluster Fault Tolerance (#3309) 2018-12-29 11:42:25 +08:00
asv.conf.json [asv] Pushing to s3 (#2246) 2018-06-20 10:43:44 -07:00
build-wheel-macos.sh Update arrow to reduce plasma IPCs. (#3497) 2018-12-14 23:49:37 -05:00
build-wheel-manylinux1.sh Update arrow to reduce plasma IPCs. (#3497) 2018-12-14 23:49:37 -05:00
README-benchmarks.rst [rllib][asv] Support ASV for RLlib (#2304) 2018-06-28 17:20:09 -07:00
README-building-wheels.md [DataFrame] Add Parquet Support in Build Process (#1531) 2018-02-16 07:18:42 -08:00
setup.py Ensure numpy is at least 1.10.4 in setup.py (#2462) 2018-12-24 11:01:25 -08:00