.. |
exploration
|
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
2020-02-22 14:19:49 -08:00 |
schedules
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
tests
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
__init__.py
|
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
2020-02-22 14:19:49 -08:00 |
actors.py
|
[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139)
|
2020-02-13 22:30:44 -08:00 |
annotations.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
compression.py
|
Stop vendoring pyarrow (#7233)
|
2020-02-19 19:01:26 -08:00 |
debug.py
|
[Core/RLlib] Move log_once from rllib to ray.util. (#7273)
|
2020-02-27 10:40:44 -08:00 |
deprecation.py
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
error.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
experimental_dsl.py
|
[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213)
|
2020-02-19 16:07:37 -08:00 |
explained_variance.py
|
[RLlib] Implement PPO torch version. (#6826)
|
2020-01-20 23:06:50 -08:00 |
filter.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
filter_manager.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
framework.py
|
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
2020-02-22 14:19:49 -08:00 |
from_config.py
|
[rllib] Fix torch GPU / yaml load warning (#7278)
|
2020-02-23 13:13:43 -08:00 |
memory.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
numpy.py
|
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
2020-02-22 14:19:49 -08:00 |
policy_client.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
policy_server.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
seed.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
sgd.py
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
test_utils.py
|
[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238)
|
2020-02-22 11:02:31 -08:00 |
tf_ops.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
tf_run_builder.py
|
[Core/RLlib] Move log_once from rllib to ray.util. (#7273)
|
2020-02-27 10:40:44 -08:00 |
timer.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
torch_ops.py
|
[rllib] Fix torch GPU / yaml load warning (#7278)
|
2020-02-23 13:13:43 -08:00 |
tracking_dict.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
tuple_actions.py
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
window_stat.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |