ray/rllib/policy at eb8eb2c71a7c4c3bb566a6e0afa84366e7bb29ab - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 5c6d5d4ab1 This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970 )		2020-06-27 20:50:01 +02:00
..
tests	[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520 )	2020-05-27 16:19:13 +02:00
__init__.py	[rllib] Add type annotations for evaluation/, env/ packages (#9003 )	2020-06-19 13:09:05 -07:00
dynamic_tf_policy.py	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
eager_tf_policy.py	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00
policy.py	This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970 )	2020-06-27 20:50:01 +02:00
rnn_sequencing.py	[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893 )	2020-06-12 20:17:27 -07:00
sample_batch.py	[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893 )	2020-06-12 20:17:27 -07:00
tf_policy.py	[RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480 )	2020-06-05 21:07:02 +02:00
tf_policy_template.py	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00
torch_policy.py	This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970 )	2020-06-27 20:50:01 +02:00
torch_policy_template.py	[RLlib] Minor cleanup in preparation to tf2.x support. (#9130 )	2020-06-25 19:01:32 +02:00