ray/rllib/examples
Sven Mika 446cbdf2e0 [RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890)
* Add `RandomEnv` example to examples folder.
Convert warning into Error message when using an LSTM in a non-shared-vf network (after the warning, the program would crash).

* LINT.

* Fix issue #6884. LSTM + non-shared vf NN + PPO crashes when using a Tuple action space.

* LINT

* Change warning message for Model: shared_vf=False, LSTM=True cases.

* Bug fix.

* Add examples/random_env.py test to Jenkins.
2020-01-24 10:29:35 -08:00
..
export Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
serving Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
autoregressive_action_dist.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
batch_norm_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
cartpole_lstm.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
centralized_critic.py [RLlib] Implement PPO torch version. (#6826) 2020-01-20 23:06:50 -08:00
centralized_critic_2.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_env.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_fast_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_keras_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_keras_rnn_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_loss.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_metrics_and_callbacks.py Allow EntropyCoeffSchedule to accept custom schedule (#6158) 2019-11-14 00:45:43 -08:00
custom_tf_policy.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_torch_policy.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_train_fn.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
dmlab_watermaze.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
eager_execution.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
hierarchical_training.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
multiagent_cartpole.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
multiagent_custom_policy.py [RLlib] Policy-classes cleanup and torch/tf unification. (#6770) 2020-01-17 22:26:28 -08:00
multiagent_two_trainers.py [RLlib] Implement PPO torch version. (#6826) 2020-01-20 23:06:50 -08:00
parametric_action_cartpole.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
random_env.py [RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890) 2020-01-24 10:29:35 -08:00
rock_paper_scissors_multiagent.py [RLlib] Policy-classes cleanup and torch/tf unification. (#6770) 2020-01-17 22:26:28 -08:00
rollout_worker_custom_workflow.py [RLlib] Policy-classes cleanup and torch/tf unification. (#6770) 2020-01-17 22:26:28 -08:00
saving_experiences.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
twostep_game.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00