ray/rllib/models/tf
2020-06-27 20:50:01 +02:00
..
layers [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 2020-06-25 19:01:32 +02:00
__init__.py [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 2020-06-25 19:01:32 +02:00
attention_net.py [RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624) 2020-05-27 10:19:47 +02:00
fcnet.py [RLlib] Fix use_lstm flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734) 2020-06-05 15:40:30 +02:00
fcnet_v1.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
fcnet_v2.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
lstm_v1.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
misc.py [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
modelv1_compat.py [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
recurrent_net.py This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970) 2020-06-27 20:50:01 +02:00
recurrent_tf_model_v2.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
tf_action_dist.py [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 2020-06-25 19:01:32 +02:00
tf_modelv2.py [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
visionnet.py [RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785) 2020-06-20 00:05:19 +02:00
visionnet_v1.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
visionnet_v2.py [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00