ray/rllib/algorithms/mbmpo/tests/test_mbmpo.py

import unittest

import ray
import ray.rllib.algorithms.mbmpo as mbmpo
from ray.rllib.utils.test_utils import (
    check_compute_single_action,
    check_train_results,
    framework_iterator,
)


class TestMBMPO(unittest.TestCase):
    @classmethod
    def setUpClass(cls):
        ray.init()

    @classmethod
    def tearDownClass(cls):
        ray.shutdown()

    def test_mbmpo_compilation(self):
        """Test whether MBMPO can be built with all frameworks."""
        config = (
            mbmpo.MBMPOConfig()
            .rollouts(num_rollout_workers=2, horizon=200)
            .training(dynamics_model={"ensemble_size": 2})
            .environment(env="ray.rllib.examples.env.mbmpo_env.CartPoleWrapper")
        )
        num_iterations = 1

        # Test for torch framework (tf not implemented yet).
        for _ in framework_iterator(config, frameworks="torch"):
            trainer = config.build()

            for i in range(num_iterations):
                results = trainer.train()
                check_train_results(results)
                print(results)

            check_compute_single_action(trainer, include_prev_action_reward=False)
            trainer.stop()


if __name__ == "__main__":
    import pytest
    import sys

    sys.exit(pytest.main(["-v", __file__]))
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00			`import unittest`

			`import ray`
[RLlib] Retry agents -> algorithms. with proper doc changes this time. (#24797) 2022-05-16 00:45:32 -07:00			`import ray.rllib.algorithms.mbmpo as mbmpo`
[CI] Format Python code with Black (#21975) See #21316 and #21311 for the motivation behind these changes. 2022-01-29 18:41:57 -08:00			`from ray.rllib.utils.test_utils import (`
			`check_compute_single_action,`
			`check_train_results,`
			`framework_iterator,`
			`)`
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00

			`class TestMBMPO(unittest.TestCase):`
			`@classmethod`
			`def setUpClass(cls):`
			`ray.init()`

			`@classmethod`
			`def tearDownClass(cls):`
			`ray.shutdown()`

			`def test_mbmpo_compilation(self):`
[RLlib] Move all remaining algos into `algorithms` directory. (#25366) 2022-06-04 07:35:24 +02:00			`"""Test whether MBMPO can be built with all frameworks."""`
[RLlib] MB-MPO TrainerConfig objects. (#25278) 2022-05-30 17:33:01 +02:00			`config = (`
			`mbmpo.MBMPOConfig()`
			`.rollouts(num_rollout_workers=2, horizon=200)`
			`.training(dynamics_model={"ensemble_size": 2})`
			`.environment(env="ray.rllib.examples.env.mbmpo_env.CartPoleWrapper")`
			`)`
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00			`num_iterations = 1`

			`# Test for torch framework (tf not implemented yet).`
			`for _ in framework_iterator(config, frameworks="torch"):`
[RLlib] MB-MPO TrainerConfig objects. (#25278) 2022-05-30 17:33:01 +02:00			`trainer = config.build()`
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 2021-09-30 16:39:05 +02:00
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00			`for i in range(num_iterations):`
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 2021-09-30 16:39:05 +02:00			`results = trainer.train()`
			`check_train_results(results)`
			`print(results)`

[CI] Format Python code with Black (#21975) See #21316 and #21311 for the motivation behind these changes. 2022-01-29 18:41:57 -08:00			`check_compute_single_action(trainer, include_prev_action_reward=False)`
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00			`trainer.stop()`


			`if __name__ == "__main__":`
			`import pytest`
			`import sys`
[CI] Format Python code with Black (#21975) See #21316 and #21311 for the motivation behind these changes. 2022-01-29 18:41:57 -08:00
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00			`sys.exit(pytest.main(["-v", __file__]))`