ray/rllib/tests/test_pettingzoo_env.py

import unittest
from copy import deepcopy
from numpy import float32
from pettingzoo.butterfly import pistonball_v6
from pettingzoo.mpe import simple_spread_v2
from supersuit import normalize_obs_v0, dtype_v0, color_reduction_v0

import ray
from ray.rllib.algorithms.registry import get_algorithm_class
from ray.rllib.env import PettingZooEnv
from ray.tune.registry import register_env


class TestPettingZooEnv(unittest.TestCase):
    def setUp(self) -> None:
        ray.init()

    def tearDown(self) -> None:
        ray.shutdown()

    def test_pettingzoo_pistonball_v6_policies_are_dict_env(self):
        def env_creator(config):
            env = pistonball_v6.env()
            env = dtype_v0(env, dtype=float32)
            env = color_reduction_v0(env, mode="R")
            env = normalize_obs_v0(env)
            return env

        config = deepcopy(get_algorithm_class("PPO").get_default_config())
        config["env_config"] = {"local_ratio": 0.5}
        # Register env
        register_env("pistonball", lambda config: PettingZooEnv(env_creator(config)))
        env = PettingZooEnv(env_creator(config))
        observation_space = env.observation_space
        action_space = env.action_space
        del env

        config["multiagent"] = {
            # Setup a single, shared policy for all agents.
            "policies": {"av": (None, observation_space, action_space, {})},
            # Map all agents to that policy.
            "policy_mapping_fn": lambda agent_id, episode, **kwargs: "av",
        }

        config["log_level"] = "DEBUG"
        config["num_workers"] = 1
        # Fragment length, collected at once from each worker
        # and for each agent!
        config["rollout_fragment_length"] = 30
        # Training batch size -> Fragments are concatenated up to this point.
        config["train_batch_size"] = 200
        # After n steps, force reset simulation
        config["horizon"] = 200
        # Default: False
        config["no_done_at_end"] = False
        algo = get_algorithm_class("PPO")(env="pistonball", config=config)
        algo.train()
        algo.stop()

    def test_pettingzoo_env(self):
        register_env("simple_spread", lambda _: PettingZooEnv(simple_spread_v2.env()))
        env = PettingZooEnv(simple_spread_v2.env())
        observation_space = env.observation_space
        action_space = env.action_space
        del env

        agent_class = get_algorithm_class("PPO")

        config = deepcopy(agent_class.get_default_config())

        config["multiagent"] = {
            # Set of policy IDs (by default, will use Trainer's
            # default policy class, the env's obs/act spaces and config={}).
            "policies": {"av": (None, observation_space, action_space, {})},
            # Mapping function that always returns "av" as policy ID to use
            # (for any agent).
            "policy_mapping_fn": lambda agent_id, episode, **kwargs: "av",
        }

        config["log_level"] = "DEBUG"
        config["num_workers"] = 0
        config["rollout_fragment_length"] = 30
        config["train_batch_size"] = 200
        config["horizon"] = 200  # After n steps, force reset simulation
        config["no_done_at_end"] = False

        agent = agent_class(env="simple_spread", config=config)
        agent.train()


if __name__ == "__main__":
    import pytest
    import sys

    sys.exit(pytest.main(["-v", __file__]))
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00			`import unittest`
			`from copy import deepcopy`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`from numpy import float32`
[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 (#22519) 2022-03-01 05:23:27 -05:00			`from pettingzoo.butterfly import pistonball_v6`
Updated pettingzoo env to acomidate api changes and fixes (#11873) * Updated pettingzoo env to acomidate api changes and fixes * fixed test failure * fixed linting issue * fixed test failure 2020-11-09 19:09:49 -05:00			`from pettingzoo.mpe import simple_spread_v2`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`from supersuit import normalize_obs_v0, dtype_v0, color_reduction_v0`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00
[RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2022-06-11 15:10:39 +02:00			`import ray`
			`from ray.rllib.algorithms.registry import get_algorithm_class`
			`from ray.rllib.env import PettingZooEnv`
			`from ray.tune.registry import register_env`

Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00
			`class TestPettingZooEnv(unittest.TestCase):`
			`def setUp(self) -> None:`
			`ray.init()`

			`def tearDown(self) -> None:`
			`ray.shutdown()`

[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 (#22519) 2022-03-01 05:23:27 -05:00			`def test_pettingzoo_pistonball_v6_policies_are_dict_env(self):`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`def env_creator(config):`
[RLlib] Update pettingzoo==1.15.0 supersuit==3.3.3 (#22519) 2022-03-01 05:23:27 -05:00			`env = pistonball_v6.env()`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`env = dtype_v0(env, dtype=float32)`
			`env = color_reduction_v0(env, mode="R")`
			`env = normalize_obs_v0(env)`
			`return env`

[RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2022-06-11 15:10:39 +02:00			`config = deepcopy(get_algorithm_class("PPO").get_default_config())`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`config["env_config"] = {"local_ratio": 0.5}`
			`# Register env`
			`register_env("pistonball", lambda config: PettingZooEnv(env_creator(config)))`
			`env = PettingZooEnv(env_creator(config))`
			`observation_space = env.observation_space`
			`action_space = env.action_space`
			`del env`

			`config["multiagent"] = {`
			`# Setup a single, shared policy for all agents.`
			`"policies": {"av": (None, observation_space, action_space, {})},`
			`# Map all agents to that policy.`
			`"policy_mapping_fn": lambda agent_id, episode, **kwargs: "av",`
			`}`

			`config["log_level"] = "DEBUG"`
			`config["num_workers"] = 1`
			`# Fragment length, collected at once from each worker`
			`# and for each agent!`
			`config["rollout_fragment_length"] = 30`
			`# Training batch size -> Fragments are concatenated up to this point.`
			`config["train_batch_size"] = 200`
			`# After n steps, force reset simulation`
			`config["horizon"] = 200`
			`# Default: False`
			`config["no_done_at_end"] = False`
[RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2022-06-11 15:10:39 +02:00			`algo = get_algorithm_class("PPO")(env="pistonball", config=config)`
			`algo.train()`
			`algo.stop()`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00			`def test_pettingzoo_env(self):`
Updated pettingzoo env to acomidate api changes and fixes (#11873) * Updated pettingzoo env to acomidate api changes and fixes * fixed test failure * fixed linting issue * fixed test failure 2020-11-09 19:09:49 -05:00			`register_env("simple_spread", lambda _: PettingZooEnv(simple_spread_v2.env()))`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`env = PettingZooEnv(simple_spread_v2.env())`
			`observation_space = env.observation_space`
			`action_space = env.action_space`
			`del env`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00
[RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2022-06-11 15:10:39 +02:00			`agent_class = get_algorithm_class("PPO")`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00
[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571) 2021-11-23 23:01:05 +01:00			`config = deepcopy(agent_class.get_default_config())`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00
			`config["multiagent"] = {`
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046) 2021-07-15 05:51:24 -04:00			`# Set of policy IDs (by default, will use Trainer's`
			`# default policy class, the env's obs/act spaces and config={}).`
[rllib] bug fix for rllib pettingzoo pistonball_v4 example (#17701) * bug fix for rllib pettingzoo pistonball_v4 example * adding test for PR 17701 * ran scripts/format.sh * ok Signed-off-by: Richard Liaw <rliaw@berkeley.edu> Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2021-08-12 03:25:00 -04:00			`"policies": {"av": (None, observation_space, action_space, {})},`
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046) 2021-07-15 05:51:24 -04:00			`# Mapping function that always returns "av" as policy ID to use`
			`# (for any agent).`
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569) 2021-06-21 13:46:01 +02:00			`"policy_mapping_fn": lambda agent_id, episode, **kwargs: "av",`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00			`}`

			`config["log_level"] = "DEBUG"`
			`config["num_workers"] = 0`
			`config["rollout_fragment_length"] = 30`
			`config["train_batch_size"] = 200`
			`config["horizon"] = 200 # After n steps, force reset simulation`
			`config["no_done_at_end"] = False`

Updated pettingzoo env to acomidate api changes and fixes (#11873) * Updated pettingzoo env to acomidate api changes and fixes * fixed test failure * fixed linting issue * fixed test failure 2020-11-09 19:09:49 -05:00			`agent = agent_class(env="simple_spread", config=config)`
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00			`agent.train()`


			`if __name__ == "__main__":`
			`import pytest`
			`import sys`
[CI] Format Python code with Black (#21975) See #21316 and #21311 for the motivation behind these changes. 2022-01-29 18:41:57 -08:00
Pettingzoo environment support (#9271) * added pettingzoo wrapper env and example * added docs, examples for pettingzoo env support * fixed pettingzoo env flake8, added test * fixed pettingzoo env import * fixed pettingzoo env import * fixed pettingzoo import issue * fixed pettingzoo test * fixed linting problem * fixed bad quotes * future proofed pettingzoo dependency * fixed ray init in pettingzoo env * lint * manual lint Co-authored-by: Eric Liang <ekhliang@gmail.com> 2020-07-06 22:32:26 -06:00			`sys.exit(pytest.main(["-v", __file__]))`