ray/python/ray/tune/config_parser.py

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function


import argparse
import json

from ray.tune import TuneError
from ray.tune.result import DEFAULT_RESULTS_DIR
from ray.tune.trial import Resources


def json_to_resources(data):
    if type(data) is str:
        data = json.loads(data)
    for k in data:
        if k not in Resources._fields:
            raise TuneError(
                "Unknown resource type {}, must be one of {}".format(
                    k, Resources._fields))
    return Resources(
        data.get("cpu", 1), data.get("gpu", 0),
        data.get("driver_cpu_limit"), data.get("driver_gpu_limit"))


def resources_to_json(resources):
    return {
        "cpu": resources.cpu,
        "gpu": resources.gpu,
        "driver_cpu_limit": resources.driver_cpu_limit,
        "driver_gpu_limit": resources.driver_gpu_limit,
    }


def make_parser(**kwargs):
    """Returns a base argument parser for the ray.tune tool."""

    parser = argparse.ArgumentParser(**kwargs)

    # Note: keep this in sync with rllib/train.py
    parser.add_argument(
        "--run", default=None, type=str,
        help="The algorithm or model to train. This may refer to the name "
        "of a built-on algorithm (e.g. RLLib's DQN or PPO), or a "
        "user-defined trainable function or class registered in the "
        "tune registry.")
    parser.add_argument(
        "--stop", default="{}", type=json.loads,
        help="The stopping criteria, specified in JSON. The keys may be any "
        "field in TrainingResult, e.g. "
        "'{\"time_total_s\": 600, \"timesteps_total\": 100000}' to stop "
        "after 600 seconds or 100k timesteps, whichever is reached first.")
    parser.add_argument(
        "--config", default="{}", type=json.loads,
        help="Algorithm-specific configuration (e.g. env, hyperparams), "
        "specified in JSON.")
    parser.add_argument(
        "--resources", default='{"cpu": 1}', type=json_to_resources,
        help="Machine resources to allocate per trial, e.g. "
        "'{\"cpu\": 64, \"gpu\": 8}'. Note that GPUs will not be assigned "
        "unless you specify them here.")
    parser.add_argument(
        "--repeat", default=1, type=int,
        help="Number of times to repeat each trial.")
    parser.add_argument(
        "--local-dir", default=DEFAULT_RESULTS_DIR, type=str,
        help="Local dir to save training results to. Defaults to '{}'.".format(
            DEFAULT_RESULTS_DIR))
    parser.add_argument(
        "--upload-dir", default="", type=str,
        help="Optional URI to upload training results to.")
    parser.add_argument(
        "--checkpoint-freq", default=0, type=int,
        help="How many training iterations between checkpoints. "
        "A value of 0 (default) disables checkpointing.")
    parser.add_argument(
        "--scheduler", default="FIFO", type=str,
        help="FIFO (default), MedianStopping, or HyperBand.")
    parser.add_argument(
        "--scheduler-config", default="{}", type=json.loads,
        help="Config options to pass to the scheduler.")

    # Note: this currently only makes sense when running a single trial
    parser.add_argument("--restore", default=None, type=str,
                        help="If specified, restore from this checkpoint.")

    return parser
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00			`from __future__ import absolute_import`
			`from __future__ import division`
			`from __future__ import print_function`


			`import argparse`
			`import json`

[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`from ray.tune import TuneError`
[tune] Clean up result logging: move out of /tmp, add timestamp (#1297) 2017-12-15 14:19:08 -08:00			`from ray.tune.result import DEFAULT_RESULTS_DIR`
[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`from ray.tune.trial import Resources`
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00

[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`def json_to_resources(data):`
			`if type(data) is str:`
			`data = json.loads(data)`
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`for k in data:`
			`if k not in Resources._fields:`
			`raise TuneError(`
			`"Unknown resource type {}, must be one of {}".format(`
			`k, Resources._fields))`
[tune] Allow resources to not all be assigned to the driver (#1150) * dgpu * update * update * update * also support cmdline * limit * Update README.rst * documentation * typo * small coverage for driver_gpu_limit * lint * fix lint 2017-10-28 22:16:05 -07:00			`return Resources(`
changes (#1237) 2017-11-20 21:15:54 -08:00			`data.get("cpu", 1), data.get("gpu", 0),`
[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`data.get("driver_cpu_limit"), data.get("driver_gpu_limit"))`
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00

[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`def resources_to_json(resources):`
			`return {`
			`"cpu": resources.cpu,`
			`"gpu": resources.gpu,`
			`"driver_cpu_limit": resources.driver_cpu_limit,`
			`"driver_gpu_limit": resources.driver_gpu_limit,`
			`}`


			`def make_parser(**kwargs):`
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00			`"""Returns a base argument parser for the ray.tune tool."""`

[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`parser = argparse.ArgumentParser(**kwargs)`
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00
[tune] Add command line support for choosing early stopping schedulers (#1209) * command line support * add checkpoint freq * fix other flags * fix * docs * doc 2017-11-12 12:05:18 -08:00			`# Note: keep this in sync with rllib/train.py`
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`parser.add_argument(`
			`"--run", default=None, type=str,`
			`help="The algorithm or model to train. This may refer to the name "`
			`"of a built-on algorithm (e.g. RLLib's DQN or PPO), or a "`
			`"user-defined trainable function or class registered in the "`
			`"tune registry.")`
			`parser.add_argument(`
			`"--stop", default="{}", type=json.loads,`
			`help="The stopping criteria, specified in JSON. The keys may be any "`
			`"field in TrainingResult, e.g. "`
			`"'{\"time_total_s\": 600, \"timesteps_total\": 100000}' to stop "`
			`"after 600 seconds or 100k timesteps, whichever is reached first.")`
			`parser.add_argument(`
			`"--config", default="{}", type=json.loads,`
			`help="Algorithm-specific configuration (e.g. env, hyperparams), "`
			`"specified in JSON.")`
			`parser.add_argument(`
			`"--resources", default='{"cpu": 1}', type=json_to_resources,`
			`help="Machine resources to allocate per trial, e.g. "`
			`"'{\"cpu\": 64, \"gpu\": 8}'. Note that GPUs will not be assigned "`
			`"unless you specify them here.")`
			`parser.add_argument(`
			`"--repeat", default=1, type=int,`
			`help="Number of times to repeat each trial.")`
			`parser.add_argument(`
[tune] Clean up result logging: move out of /tmp, add timestamp (#1297) 2017-12-15 14:19:08 -08:00			`"--local-dir", default=DEFAULT_RESULTS_DIR, type=str,`
			`help="Local dir to save training results to. Defaults to '{}'.".format(`
			`DEFAULT_RESULTS_DIR))`
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`parser.add_argument(`
			`"--upload-dir", default="", type=str,`
			`help="Optional URI to upload training results to.")`
			`parser.add_argument(`
			`"--checkpoint-freq", default=0, type=int,`
			`help="How many training iterations between checkpoints. "`
			`"A value of 0 (default) disables checkpointing.")`
			`parser.add_argument(`
			`"--scheduler", default="FIFO", type=str,`
			`help="FIFO (default), MedianStopping, or HyperBand.")`
			`parser.add_argument(`
			`"--scheduler-config", default="{}", type=json.loads,`
			`help="Config options to pass to the scheduler.")`
[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00
[tune] Improve the tune Python API and variant generation (#1154) * new variant gen * wip * Sat Oct 21 18:21:34 PDT 2017 * update * comment * fix * update * update readme * fix * Update README.rst * Update README.rst * fix repeat * update * note on restore 2017-11-06 23:41:17 -08:00			`# Note: this currently only makes sense when running a single trial`
			`parser.add_argument("--restore", default=None, type=str,`
			`help="If specified, restore from this checkpoint.")`

[rllib] Initial work on integrating hyperparameter search tool (#1107) * clean up train * update * update train script * add tuned examples * add agent catalog * add tune lib * update * fix * testS * remove * train docs * comments * todo * fix resource parsing * fix cr test * add test * try to fix travis test 2017-10-13 16:18:16 -07:00			`return parser`