ray/rllib/utils at 43043ee4d5c672f7fbc22ac9559e8164731fd053 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 ) * WIP. * Fixes. * LINT. * WIP. * WIP. * Fixes. * Fixes. * Fixes. * Fixes. * WIP. * Fixes. * Test * Fix. * Fixes and LINT. * Fixes and LINT. * LINT.		2020-06-30 10:13:20 +02:00
..
exploration	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
schedules	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
spaces	Refactor #8792 to integrate latest master (#8956 )	2020-06-17 10:55:52 +02:00
tests	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
__init__.py	[RLlib] Add testing `Policy.compute_single_action()` for all agents. (#8903 )	2020-06-13 17:51:50 +02:00
actors.py	Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839 )	2020-06-08 21:29:46 -07:00
annotations.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
compression.py	Stop vendoring pyarrow (#7233 )	2020-02-19 19:01:26 -08:00
debug.py	[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893 )	2020-06-12 20:17:27 -07:00
deprecation.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
error.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
filter.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
filter_manager.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
framework.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
from_config.py	[RLlib] Make envs specifiable in configs by their class path. (#8750 )	2020-06-03 08:14:29 +02:00
memory.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
numpy.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
policy_client.py	[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590 )	2020-05-30 22:48:34 +02:00
policy_server.py	[rllib] Add high-performance external application connector (#7641 )	2020-03-20 12:43:57 -07:00
sgd.py	WIP. (#8456 )	2020-05-15 21:43:27 +02:00
test_utils.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
tf_ops.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
tf_run_builder.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
timer.py	[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299 )	2020-02-28 16:44:17 -08:00
torch_ops.py	This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970 )	2020-06-27 20:50:01 +02:00
tracking_dict.py	This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970 )	2020-06-27 20:50:01 +02:00
tuple_actions.py	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
types.py	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00
window_stat.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00