ray/examples/carla/train_ppo.py

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

from ray.tune import register_env, run_experiments

from env import CarlaEnv, ENV_CONFIG

env_name = "carla_env"
env_config = ENV_CONFIG.copy()
env_config.update({
    "verbose": False,
    "x_res": 80,
    "y_res": 80,
    "use_depth_camera": True,
    "discrete_actions": False,
    "max_steps": 150,
})
register_env(env_name, lambda: CarlaEnv(env_config))

run_experiments({
    "carla": {
        "run": "PPO",
        "env": "carla_env",
        "resources": {"cpu": 4, "gpu": 1},
        "config": {
            "num_workers": 1,
            "timesteps_per_batch": 2000,
            "min_steps_per_task": 100,
            "lambda": 0.95,
            "clip_param": 0.2,
            "num_sgd_iter": 20,
            "sgd_stepsize": 0.0001,
            "sgd_batchsize": 32,
            "devices": ["/gpu:0"],
            "tf_session_args": {
              "gpu_options": {"allow_growth": True}
            }
        },
    },
}, redirect_output=True)
[Examples] Add Carla test env (#1343) * add carla example * add reward * set obs * Sun Dec 17 16:06:00 PST 2017 * add spec * fix measurement * add train script * resize to 80x80 * null * initial small training run * robustify env, clean up action space * clean up vars * switch to town2 which is faster * tunify train.py * add discrete mode * update * fix excessive brakinG * fix the weather * rename * redirect output and from future import * doc * update * fix rebase * allow dqn gpu growht * adjust dqn hyperparams * better ppo parameters 2017-12-19 12:57:58 -08:00			`from __future__ import absolute_import`
			`from __future__ import division`
			`from __future__ import print_function`

			`from ray.tune import register_env, run_experiments`

			`from env import CarlaEnv, ENV_CONFIG`

			`env_name = "carla_env"`
			`env_config = ENV_CONFIG.copy()`
			`env_config.update({`
[carla] In carla example, save all images and measurements to local disk (#1350) * revamp saving * smaller jpgs * hide verbose * Tue Dec 19 22:25:01 PST 2017 * make sure temp dirs sort lexiographically * save total reward too * zero pad i * 160x160 dqn * ever higher res dqn 2017-12-21 15:19:55 -08:00			`"verbose": False,`
[Examples] Add Carla test env (#1343) * add carla example * add reward * set obs * Sun Dec 17 16:06:00 PST 2017 * add spec * fix measurement * add train script * resize to 80x80 * null * initial small training run * robustify env, clean up action space * clean up vars * switch to town2 which is faster * tunify train.py * add discrete mode * update * fix excessive brakinG * fix the weather * rename * redirect output and from future import * doc * update * fix rebase * allow dqn gpu growht * adjust dqn hyperparams * better ppo parameters 2017-12-19 12:57:58 -08:00			`"x_res": 80,`
			`"y_res": 80,`
			`"use_depth_camera": True,`
			`"discrete_actions": False,`
			`"max_steps": 150,`
			`})`
			`register_env(env_name, lambda: CarlaEnv(env_config))`

			`run_experiments({`
			`"carla": {`
			`"run": "PPO",`
			`"env": "carla_env",`
			`"resources": {"cpu": 4, "gpu": 1},`
			`"config": {`
			`"num_workers": 1,`
			`"timesteps_per_batch": 2000,`
			`"min_steps_per_task": 100,`
			`"lambda": 0.95,`
			`"clip_param": 0.2,`
			`"num_sgd_iter": 20,`
			`"sgd_stepsize": 0.0001,`
			`"sgd_batchsize": 32,`
			`"devices": ["/gpu:0"],`
			`"tf_session_args": {`
			`"gpu_options": {"allow_growth": True}`
			`}`
			`},`
			`},`
			`}, redirect_output=True)`