.. |
exploration
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
schedules
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
spaces
|
[RLlib] utils/spaces ... (#8608)
|
2020-05-27 10:21:30 +02:00 |
tests
|
[RLlib] Make envs specifiable in configs by their class path. (#8750)
|
2020-06-03 08:14:29 +02:00 |
__init__.py
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
actors.py
|
[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139)
|
2020-02-13 22:30:44 -08:00 |
annotations.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
compression.py
|
Stop vendoring pyarrow (#7233)
|
2020-02-19 19:01:26 -08:00 |
debug.py
|
[Core/RLlib] Move log_once from rllib to ray.util. (#7273)
|
2020-02-27 10:40:44 -08:00 |
deprecation.py
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
error.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
explained_variance.py
|
[RLlib] Implement PPO torch version. (#6826)
|
2020-01-20 23:06:50 -08:00 |
filter.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
filter_manager.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
framework.py
|
[rllib] Add type annotations to Trainer class (#8642)
|
2020-06-03 12:47:35 -07:00 |
from_config.py
|
[RLlib] Make envs specifiable in configs by their class path. (#8750)
|
2020-06-03 08:14:29 +02:00 |
memory.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
numpy.py
|
[RLlib] DDPG PyTorch actor-model was missing sigmoid layer (#8188)
|
2020-04-26 23:08:13 +02:00 |
policy_client.py
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
policy_server.py
|
[rllib] Add high-performance external application connector (#7641)
|
2020-03-20 12:43:57 -07:00 |
seed.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
sgd.py
|
WIP. (#8456)
|
2020-05-15 21:43:27 +02:00 |
test_utils.py
|
[RLlib] Fix use_lstm flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734)
|
2020-06-05 15:40:30 +02:00 |
tf_ops.py
|
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
2020-04-29 12:12:59 +02:00 |
tf_run_builder.py
|
[RLlib] SAC add discrete action support. (#7320)
|
2020-03-06 10:37:12 -08:00 |
timer.py
|
[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299)
|
2020-02-28 16:44:17 -08:00 |
torch_ops.py
|
[RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480)
|
2020-06-05 21:07:02 +02:00 |
tracking_dict.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
tuple_actions.py
|
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
|
2020-04-28 14:59:16 +02:00 |
window_stat.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |