diff --git a/doc/source/tune/api_docs/schedulers.rst b/doc/source/tune/api_docs/schedulers.rst index ffe33742e..07e440509 100644 --- a/doc/source/tune/api_docs/schedulers.rst +++ b/doc/source/tune/api_docs/schedulers.rst @@ -64,7 +64,7 @@ HyperBand Implementation Details Implementation details may deviate slightly from theory but are focused on increasing usability. Note: ``R``, ``s_max``, and ``eta`` are parameters of HyperBand given by the paper. -See `this post `_ for context. +See `this post `_ for context. 1. Both ``s_max`` (representing the ``number of brackets - 1``) and ``eta``, representing the downsampling rate, are fixed. In many practical settings, ``R``, which represents some resource unit and often the number of training iterations, @@ -82,7 +82,7 @@ HyperBand configurations that are not very intuitive: The implementation takes the same configuration as the example given in the paper and exposes ``max_t``, which is not a parameter in the paper. -2. The example in the `post `_ to calculate ``n_0`` +2. The example in the `post `_ to calculate ``n_0`` is actually a little different than the algorithm given in the paper. In this implementation, we implement ``n_0`` according to the paper (which is `n` in the below example): diff --git a/python/ray/tune/schedulers/hyperband.py b/python/ray/tune/schedulers/hyperband.py index 867826fc6..12e940102 100644 --- a/python/ray/tune/schedulers/hyperband.py +++ b/python/ray/tune/schedulers/hyperband.py @@ -57,7 +57,7 @@ class HyperBandScheduler(FIFOScheduler): Note that Tune's stopping criteria will be applied in conjunction with HyperBand's early stopping mechanisms. - See also: https://homes.cs.washington.edu/~jamieson/hyperband.html + See also: https://blog.ml.cmu.edu/2018/12/12/massively-parallel-hyperparameter-optimization/ Args: time_attr: The training result attr to use for comparing time. @@ -78,7 +78,7 @@ class HyperBandScheduler(FIFOScheduler): the difference is between bracket space-time allocation ratios. stop_last_trials: Whether to terminate the trials after reaching max_t. Defaults to True. - """ + """ # noqa: E501 _supports_buffered_results = False