ray/rllib/examples at 2fb53396ad1dc7a5b7bd0f6135b48c4f40c5adf6 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 446cbdf2e0 [RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890 ) * Add `RandomEnv` example to examples folder. Convert warning into Error message when using an LSTM in a non-shared-vf network (after the warning, the program would crash). * LINT. * Fix issue #6884. LSTM + non-shared vf NN + PPO crashes when using a Tuple action space. * LINT * Change warning message for Model: shared_vf=False, LSTM=True cases. * Bug fix. * Add examples/random_env.py test to Jenkins.		2020-01-24 10:29:35 -08:00
..
export	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
serving	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
autoregressive_action_dist.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
batch_norm_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
cartpole_lstm.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
centralized_critic.py	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
centralized_critic_2.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_env.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_fast_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_keras_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_keras_rnn_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_loss.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_metrics_and_callbacks.py	Allow EntropyCoeffSchedule to accept custom schedule (#6158 )	2019-11-14 00:45:43 -08:00
custom_tf_policy.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_torch_policy.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
custom_train_fn.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
dmlab_watermaze.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
eager_execution.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
hierarchical_training.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
multiagent_cartpole.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
multiagent_custom_policy.py	[RLlib] Policy-classes cleanup and torch/tf unification. (#6770 )	2020-01-17 22:26:28 -08:00
multiagent_two_trainers.py	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
parametric_action_cartpole.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
random_env.py	[RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890 )	2020-01-24 10:29:35 -08:00
rock_paper_scissors_multiagent.py	[RLlib] Policy-classes cleanup and torch/tf unification. (#6770 )	2020-01-17 22:26:28 -08:00
rollout_worker_custom_workflow.py	[RLlib] Policy-classes cleanup and torch/tf unification. (#6770 )	2020-01-17 22:26:28 -08:00
saving_experiences.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
twostep_game.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00