ray/rllib/examples
2020-04-20 12:58:01 -07:00
..
export Change /tmp to platform-specific temporary directory (#7529) 2020-03-16 18:10:14 -07:00
serving [rllib] Add high-performance external application connector (#7641) 2020-03-20 12:43:57 -07:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
autoregressive_action_dist.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
batch_norm_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
cartpole_lstm.py [RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178) 2020-02-15 14:50:44 -08:00
centralized_critic.py [rllib] Fix truncate episodes mode in central critic example (#8073) 2020-04-20 12:58:01 -07:00
centralized_critic_2.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_env.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
custom_eval.py [RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178) 2020-02-15 14:50:44 -08:00
custom_fast_model.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
custom_keras_cnn_plus_rnn_model.py [RLlib] Fix broken example: tf-eager with custom-RNN (#6732). (#7021) 2020-02-06 09:44:08 -08:00
custom_keras_model.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
custom_keras_rnn_model.py [RLlib] Working/learning example: PPO + torch + LSTM. (#7797) 2020-03-31 22:00:28 -07:00
custom_loss.py [RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178) 2020-02-15 14:50:44 -08:00
custom_metrics_and_callbacks.py [RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972) 2020-04-16 16:06:42 -07:00
custom_metrics_and_callbacks_legacy.py [RLlib] Added DefaultCallbacks which replaces old callbacks dict interface (#6972) 2020-04-16 16:06:42 -07:00
custom_tf_policy.py [RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178) 2020-02-15 14:50:44 -08:00
custom_torch_policy.py [RLlib] Assert correct policy class being used in Worker. (#7769) 2020-03-30 14:03:29 -07:00
custom_torch_rnn_model.py [RLlib] Working/learning example: PPO + torch + LSTM. (#7797) 2020-03-31 22:00:28 -07:00
custom_train_fn.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
dmlab_watermaze.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
eager_execution.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
hierarchical_training.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
multi_agent_cartpole.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
multi_agent_custom_policy.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
multi_agent_two_trainers.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
parametric_action_cartpole.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
random_env.py [RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890) 2020-01-24 10:29:35 -08:00
rock_paper_scissors_multiagent.py [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 2020-04-01 00:43:21 -07:00
rollout_worker_custom_workflow.py [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
saving_experiences.py Change /tmp to platform-specific temporary directory (#7529) 2020-03-16 18:10:14 -07:00
twostep_game.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00