ray/rllib/utils/schedules/exponential_schedule.py

from typing import Optional

from ray.rllib.utils.annotations import override, PublicAPI
from ray.rllib.utils.framework import try_import_torch
from ray.rllib.utils.schedules.schedule import Schedule
from ray.rllib.utils.typing import TensorType

torch, _ = try_import_torch()


@PublicAPI
class ExponentialSchedule(Schedule):
    """Exponential decay schedule from `initial_p` to `final_p`.

    Reduces output over `schedule_timesteps`. After this many time steps
    always returns `final_p`.
    """

    def __init__(
        self,
        schedule_timesteps: int,
        framework: Optional[str] = None,
        initial_p: float = 1.0,
        decay_rate: float = 0.1,
    ):
        """Initializes a ExponentialSchedule instance.

        Args:
            schedule_timesteps: Number of time steps for which to
                linearly anneal initial_p to final_p.
            framework: The framework descriptor string, e.g. "tf",
                "torch", or None.
            initial_p: Initial output value.
            decay_rate: The percentage of the original value after
                100% of the time has been reached (see formula above).
                >0.0: The smaller the decay-rate, the stronger the decay.
                1.0: No decay at all.
        """
        super().__init__(framework=framework)
        assert schedule_timesteps > 0
        self.schedule_timesteps = schedule_timesteps
        self.initial_p = initial_p
        self.decay_rate = decay_rate

    @override(Schedule)
    def _value(self, t: TensorType) -> TensorType:
        """Returns the result of: initial_p * decay_rate ** (`t`/t_max)."""
        if self.framework == "torch" and torch and isinstance(t, torch.Tensor):
            t = t.float()
        return self.initial_p * self.decay_rate ** (t / self.schedule_timesteps)
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786) 2021-12-15 22:32:52 +01:00			`from typing import Optional`

[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00			`from ray.rllib.utils.annotations import override, PublicAPI`
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. (#9782) 2020-07-30 12:49:32 +02:00			`from ray.rllib.utils.framework import try_import_torch`
[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00			`from ray.rllib.utils.schedules.schedule import Schedule`
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786) 2021-12-15 22:32:52 +01:00			`from ray.rllib.utils.typing import TensorType`
[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. (#9782) 2020-07-30 12:49:32 +02:00			`torch, _ = try_import_torch()`

[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2022-05-24 22:14:25 -07:00			`@PublicAPI`
[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00			`class ExponentialSchedule(Schedule):`
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786) 2021-12-15 22:32:52 +01:00			"""Exponential decay schedule from `initial_p` to `final_p`.

			Reduces output over `schedule_timesteps`. After this many time steps
			always returns `final_p`.
			`"""`

[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00			`def __init__(`
			`self,`
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786) 2021-12-15 22:32:52 +01:00			`schedule_timesteps: int,`
			`framework: Optional[str] = None,`
			`initial_p: float = 1.0,`
			`decay_rate: float = 0.1,`
			`):`
			`"""Initializes a ExponentialSchedule instance.`

			`Args:`
			`schedule_timesteps: Number of time steps for which to`
			`linearly anneal initial_p to final_p.`
			`framework: The framework descriptor string, e.g. "tf",`
			`"torch", or None.`
			`initial_p: Initial output value.`
			`decay_rate: The percentage of the original value after`
[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00			`100% of the time has been reached (see formula above).`
			`>0.0: The smaller the decay-rate, the stronger the decay.`
			`1.0: No decay at all.`
			`"""`
			`super().__init__(framework=framework)`
			`assert schedule_timesteps > 0`
			`self.schedule_timesteps = schedule_timesteps`
			`self.initial_p = initial_p`
			`self.decay_rate = decay_rate`

[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) 2020-05-04 23:53:38 +02:00			`@override(Schedule)`
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786) 2021-12-15 22:32:52 +01:00			`def _value(self, t: TensorType) -> TensorType:`
			"""Returns the result of: initial_p * decay_rate ** (`t`/t_max)."""
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. (#9782) 2020-07-30 12:49:32 +02:00			`if self.framework == "torch" and torch and isinstance(t, torch.Tensor):`
			`t = t.float()`
[RLlib] Schedule-classes multi-framework support. (#6926) 2020-01-28 20:07:55 +01:00			`return self.initial_p * self.decay_rate ** (t / self.schedule_timesteps)`