ray/rllib/utils at 79a4eac48c7e94b1e955ec3d1fcfcaad4cf6962c - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Sven Mika baa053496a [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414 )		2020-05-26 11:10:27 +02:00
..
exploration	[RLlib] Issue 8319 DDPG (MA or num_envs_per_worker > 1) broken. (#8324 )	2020-05-08 08:26:32 +02:00
schedules	[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414 )	2020-05-26 11:10:27 +02:00
tests	[RLlib] Add testing framework_iterator. (#7852 )	2020-04-03 12:24:25 -07:00
__init__.py	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
actors.py	[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139 )	2020-02-13 22:30:44 -08:00
annotations.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
compression.py	Stop vendoring pyarrow (#7233 )	2020-02-19 19:01:26 -08:00
debug.py	[Core/RLlib] Move `log_once` from rllib to ray.util. (#7273 )	2020-02-27 10:40:44 -08:00
deprecation.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
error.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
explained_variance.py	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
filter.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
filter_manager.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
framework.py	[RLlib] PyTorch version of ES (Evolution Strategies). (#8104 )	2020-04-20 21:47:28 +02:00
from_config.py	[rllib] Fix torch GPU / yaml load warning (#7278 )	2020-02-23 13:13:43 -08:00
memory.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
numpy.py	[RLlib] DDPG PyTorch actor-model was missing sigmoid layer (#8188 )	2020-04-26 23:08:13 +02:00
policy_client.py	[rllib] Add high-performance external application connector (#7641 )	2020-03-20 12:43:57 -07:00
policy_server.py	[rllib] Add high-performance external application connector (#7641 )	2020-03-20 12:43:57 -07:00
seed.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
sgd.py	WIP. (#8456 )	2020-05-15 21:43:27 +02:00
space_utils.py	[RLlib] `Policy.compute_single_action()` broken for nested actions (Issue 8411). (#8514 )	2020-05-20 22:29:08 +02:00
test_utils.py	[RLlib] Add light-weight `Trainer.compute_action()` tests for all Algos. (#8356 )	2020-05-08 16:31:31 +02:00
tf_ops.py	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
tf_run_builder.py	[RLlib] SAC add discrete action support. (#7320 )	2020-03-06 10:37:12 -08:00
timer.py	[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299 )	2020-02-28 16:44:17 -08:00
torch_ops.py	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
tracking_dict.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
tuple_actions.py	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
window_stat.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00