ray/rllib/__init__.py

import logging

# Note: do not introduce unnecessary library dependencies here, e.g. gym.
# This file is imported from the tune module in order to register RLlib agents.
from ray.rllib.env.base_env import BaseEnv
from ray.rllib.env.external_env import ExternalEnv
from ray.rllib.env.multi_agent_env import MultiAgentEnv
from ray.rllib.env.vector_env import VectorEnv
from ray.rllib.evaluation.rollout_worker import RolloutWorker
from ray.rllib.policy.policy import Policy
from ray.rllib.policy.sample_batch import SampleBatch
from ray.rllib.policy.tf_policy import TFPolicy
from ray.rllib.policy.torch_policy import TorchPolicy
from ray.tune.registry import register_trainable


def _setup_logger():
    logger = logging.getLogger("ray.rllib")
    handler = logging.StreamHandler()
    handler.setFormatter(
        logging.Formatter(
            "%(asctime)s\t%(levelname)s %(filename)s:%(lineno)s -- %(message)s"
        ))
    logger.addHandler(handler)
    logger.propagate = False


def _register_all():
    from ray.rllib.agents.trainer import Trainer, with_common_config
    from ray.rllib.agents.registry import ALGORITHMS, get_trainer_class
    from ray.rllib.contrib.registry import CONTRIBUTED_ALGORITHMS

    for key in list(ALGORITHMS.keys()) + list(CONTRIBUTED_ALGORITHMS.keys(
    )) + ["__fake", "__sigmoid_fake_data", "__parameter_tuning"]:
        register_trainable(key, get_trainer_class(key))

    def _see_contrib(name):
        """Returns dummy agent class warning algo is in contrib/."""

        class _SeeContrib(Trainer):
            _name = "SeeContrib"
            _default_config = with_common_config({})

            def setup(self, config):
                raise NameError(
                    "Please run `contrib/{}` instead.".format(name))

        return _SeeContrib

    # also register the aliases minus contrib/ to give a good error message
    for key in list(CONTRIBUTED_ALGORITHMS.keys()):
        assert key.startswith("contrib/")
        alias = key.split("/", 1)[1]
        register_trainable(alias, _see_contrib(alias))


_setup_logger()
_register_all()

__all__ = [
    "Policy",
    "TFPolicy",
    "TorchPolicy",
    "RolloutWorker",
    "SampleBatch",
    "BaseEnv",
    "MultiAgentEnv",
    "VectorEnv",
    "ExternalEnv",
]
[rllib] switch to python logger (#3098) * logg * set rllib logger * comment * info * rlib * comment * add format * fix lint * add file info * update * add ts * lint * better docs * fix value error * soft log level 2018-10-21 23:43:57 -07:00			`import logging`

[tune] Ray Tune API cleanup (#1454) Remove rllib dep: trainable is now a standalone abstract class that can be easily subclassed. Clean up hyperband: fix debug string and add an example. Remove YAML api / ScriptRunner: this was never really used. Move ray.init() out of run_experiments(): This provides greater flexibility and should be less confusing since there isn't an implicit init() done there. Note that this is a breaking API change for tune. 2018-01-24 16:55:17 -08:00			`# Note: do not introduce unnecessary library dependencies here, e.g. gym.`
			`# This file is imported from the tune module in order to register RLlib agents.`
[rllib] annotate public vs developer vs private APIs (#3808) 2019-01-23 21:27:26 -08:00			`from ray.rllib.env.base_env import BaseEnv`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`from ray.rllib.env.external_env import ExternalEnv`
[rllib] Document "v2" APIs (#2316) * re * wip * wip * a3c working * torch support * pg works * lint * rm v2 * consumer id * clean up pg * clean up more * fix python 2.7 * tf session management * docs * dqn wip * fix compile * dqn * apex runs * up * impotrs * ddpg * quotes * fix tests * fix last r * fix tests * lint * pass checkpoint restore * kwar * nits * policy graph * fix yapf * com * class * pyt * vectorization * update * test cpe * unit test * fix ddpg2 * changes * wip * args * faster test * common * fix * add alg option * batch mode and policy serving * multi serving test * todo * wip * serving test * doc async env * num envs * comments * thread * remove init hook * update * fix ppo * comments1 * fix * updates * add jenkins tests * fix * fix pytorch * fix * fixes * fix a3c policy * fix squeeze * fix trunc on apex * fix squeezing for real * update * remove horizon test for now * multiagent wip * update * fix race condition * fix ma * t * doc * st * wip * example * wip * working * cartpole * wip * batch wip * fix bug * make other_batches None default * working * debug * nit * warn * comments * fix ppo * fix obs filter * update * wip * tf * update * fix * cleanup * cleanup * spacing * model * fix * dqn * fix ddpg * doc * keep names * update * fix * com * docs * clarify model outputs * Update torch_policy_graph.py * fix obs filter * pass thru worker index * fix * rename * vlad torch comments * fix log action * debug name * fix lstm * remove unused ddpg net * remove conv net * revert lstm * wip * wip * cast * wip * works * fix a3c * works * lstm util test * doc * clean up * update * fix lstm check * move to end * fix sphinx * fix cmd * remove bad doc * envs * vec * doc prep * models * rl * alg * up * clarify * copy * async sa * fix * comments * fix a3c conf * tune lstm * fix reshape * fix * back to 16 * tuned a3c update * update * tuned * optional * merge * wip * fix up * move pg class * rename env * wip * update * tip * alg * readme * fix catalog * readme * doc * context * remove prep * comma * add env * link to paper * paper * update * rnn * update * wip * clean up ev creation * fix * fix * fix * fix lint * up * no comma * ma * Update run_multi_node_tests.sh * fix * sphinx is stupid * sphinx is stupid * clarify torch graph * no horizon * fix config * sb * Update test_optimizers.py 2018-07-01 00:05:08 -07:00			`from ray.rllib.env.multi_agent_env import MultiAgentEnv`
			`from ray.rllib.env.vector_env import VectorEnv`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`from ray.rllib.evaluation.rollout_worker import RolloutWorker`
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819) This implements some of the renames proposed in #4813 We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch. 2019-05-20 16:46:05 -07:00			`from ray.rllib.policy.policy import Policy`
			`from ray.rllib.policy.sample_batch import SampleBatch`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`from ray.rllib.policy.tf_policy import TFPolicy`
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 2020-08-20 17:05:57 +02:00			`from ray.rllib.policy.torch_policy import TorchPolicy`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`from ray.tune.registry import register_trainable`
[rllib] Refactor rllib to have a common sample collection pathway (#2149) 2018-06-09 00:21:35 -07:00
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00
[rllib] switch to python logger (#3098) * logg * set rllib logger * comment * info * rlib * comment * add format * fix lint * add file info * update * add ts * lint * better docs * fix value error * soft log level 2018-10-21 23:43:57 -07:00			`def _setup_logger():`
			`logger = logging.getLogger("ray.rllib")`
			`handler = logging.StreamHandler()`
			`handler.setFormatter(`
			`logging.Formatter(`
			`"%(asctime)s\t%(levelname)s %(filename)s:%(lineno)s -- %(message)s"`
			`))`
			`logger.addHandler(handler)`
			`logger.propagate = False`


[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`def _register_all():`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`from ray.rllib.agents.trainer import Trainer, with_common_config`
[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718) 2021-02-08 12:05:16 +01:00			`from ray.rllib.agents.registry import ALGORITHMS, get_trainer_class`
[rllib] [rfc] add contrib module and guideline for merging (#3565) This adds guidelines for merging code into `rllib/contrib` vs `rllib/agents`. Also, clean up the agent import code to make registration easier. 2018-12-21 03:44:34 +09:00			`from ray.rllib.contrib.registry import CONTRIBUTED_ALGORITHMS`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00
[rllib] [rfc] add contrib module and guideline for merging (#3565) This adds guidelines for merging code into `rllib/contrib` vs `rllib/agents`. Also, clean up the agent import code to make registration easier. 2018-12-21 03:44:34 +09:00			`for key in list(ALGORITHMS.keys()) + list(CONTRIBUTED_ALGORITHMS.keys(`
			`)) + ["__fake", "__sigmoid_fake_data", "__parameter_tuning"]:`
[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718) 2021-02-08 12:05:16 +01:00			`register_trainable(key, get_trainer_class(key))`
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`def _see_contrib(name):`
			`"""Returns dummy agent class warning algo is in contrib/."""`

			`class _SeeContrib(Trainer):`
			`_name = "SeeContrib"`
			`_default_config = with_common_config({})`

[tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00			`def setup(self, config):`
MADDPG implementation in RLlib (#5348) 2019-08-06 19:22:06 -04:00			`raise NameError(`
			"Please run `contrib/{}` instead.".format(name))

			`return _SeeContrib`

			`# also register the aliases minus contrib/ to give a good error message`
			`for key in list(CONTRIBUTED_ALGORITHMS.keys()):`
			`assert key.startswith("contrib/")`
			`alias = key.split("/", 1)[1]`
			`register_trainable(alias, _see_contrib(alias))`

[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00
[rllib] switch to python logger (#3098) * logg * set rllib logger * comment * info * rlib * comment * add format * fix lint * add file info * update * add ts * lint * better docs * fix value error * soft log level 2018-10-21 23:43:57 -07:00			`_setup_logger()`
[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00			`_register_all()`
[rllib] Refactor rllib to have a common sample collection pathway (#2149) 2018-06-09 00:21:35 -07:00
			`__all__ = [`
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819) This implements some of the renames proposed in #4813 We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch. 2019-05-20 16:46:05 -07:00			`"Policy",`
			`"TFPolicy",`
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 2020-08-20 17:05:57 +02:00			`"TorchPolicy",`
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00			`"RolloutWorker",`
[rllib] format with yapf (#2427) * initial yapf * manual fix yapf bugs 2018-07-19 15:30:36 -07:00			`"SampleBatch",`
[rllib] annotate public vs developer vs private APIs (#3808) 2019-01-23 21:27:26 -08:00			`"BaseEnv",`
[rllib] format with yapf (#2427) * initial yapf * manual fix yapf bugs 2018-07-19 15:30:36 -07:00			`"MultiAgentEnv",`
			`"VectorEnv",`
[rllib] Rename ServingEnv => ExternalEnv (#3302) 2018-11-12 16:31:27 -08:00			`"ExternalEnv",`
[rllib] Refactor rllib to have a common sample collection pathway (#2149) 2018-06-09 00:21:35 -07:00			`]`