ray/rllib/utils/exploration
2021-01-18 10:29:03 -08:00
..
tests [RLlib] Solve PyTorch/TF-eager A3C async race condition between calling model and its value function. (#13467) 2021-01-18 10:29:03 -08:00
__init__.py [RLlib] Curiosity (intrinsic motivation) Exploration module. (#9912) 2020-08-13 20:14:16 +02:00
curiosity.py [RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363) 2021-01-14 14:44:33 +01:00
epsilon_greedy.py Fix the linter failure. (#11755) 2020-11-02 18:02:15 +01:00
exploration.py [RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945) 2020-11-29 12:31:24 +01:00
gaussian_noise.py [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 2020-10-12 13:49:11 -07:00
ornstein_uhlenbeck_noise.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
parameter_noise.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
per_worker_epsilon_greedy.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
per_worker_gaussian_noise.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
per_worker_ornstein_uhlenbeck_noise.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
random.py [RLlib] Support Simplex action spaces for SAC (torch and tf). (#11909) 2020-11-11 18:45:28 +01:00
soft_q.py [RLlib] Exploration class type annotations. (#11251) 2020-10-07 21:59:14 +02:00
stochastic_sampling.py [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 2020-10-12 13:49:11 -07:00