ray/rllib/examples
Kai Fricke dc0b445323
[rllib/release/2.0.0] Fix rllib connect test (#27162)
Why are these changes needed?
Follow-up from #27155 - this will let the connect test pass
2022-07-28 14:23:23 -07:00
..
bandit [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
connectors [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
documentation [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
env [RLlib] Beef up worker failure test. (#26953) 2022-07-27 00:10:45 -07:00
export [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
inference_and_serving [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
models Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
multi_agent_and_self_play Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153) 2022-02-08 16:43:00 +01:00
policy [RLlib] more connector polishes and fixes. (#26645) 2022-07-19 08:50:28 -07:00
serving [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
simulators/sumo Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00
tune [rllib/release/2.0.0] Fix rllib connect test (#27162) 2022-07-28 14:23:23 -07:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
action_masking.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
attention_net.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
attention_net_supervised.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
autoregressive_action_dist.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
bare_metal_policy_with_custom_view_reqs.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
batch_norm_model.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
cartpole_lstm.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
centralized_critic.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
centralized_critic_2.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
checkpoint_by_custom_criteria.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
coin_game_env.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
complex_struct_space.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
compute_adapted_gae_on_postprocess_trajectory.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
curriculum_learning.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_env.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_eval.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_experiment.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
custom_fast_model.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_input_api.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_keras_model.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_logger.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_loss.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
custom_metrics_and_callbacks.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_metrics_and_callbacks_legacy.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_model_api.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
custom_model_loss_and_metrics.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_observation_filters.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_rnn_model.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_tf_policy.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_torch_policy.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_train_fn.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
custom_vector_env.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
deterministic_training.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
dmlab_watermaze.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
eager_execution.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
env_rendering_and_recording.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
fractional_gpus.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
hierarchical_training.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
iterated_prisoners_dilemma_env.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
lstm_auto_wrapping.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
mobilenet_v2_with_lstm.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_cartpole.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_custom_policy.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_different_spaces_for_agents.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_independent_learning.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_parameter_sharing.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
multi_agent_two_trainers.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
nested_action_spaces.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
offline_rl.py [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
parallel_evaluation_and_training.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
parametric_actions_cartpole.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
parametric_actions_cartpole_embeddings_learnt_by_model.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
partial_gpus.py [Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128) 2022-03-15 17:34:21 +01:00
preprocessing_disabled.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
random_parametric_agent.py [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
re3_exploration.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
recommender_system_with_recsim_and_slateq.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
remote_base_env_with_custom_api.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
remote_envs_with_inference_done_on_main_node.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
replay_buffer_api.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
restore_1_of_n_agents_from_checkpoint.py [RLlib] Trainer to Algorithm renaming. (#25539) 2022-06-11 15:10:39 +02:00
rnnsac_stateless_cartpole.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
rock_paper_scissors_multiagent.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
rollout_worker_custom_workflow.py [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
saving_experiences.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
sb2rllib_rllib_example.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
sb2rllib_sb_example.py [RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897) 2021-05-19 16:36:59 +02:00
self_play_league_based_with_open_spiel.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
self_play_with_open_spiel.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
sumo_env_local.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
trajectory_view_api.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
two_step_game.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
two_trainer_workflow.py [RLlib] Get rid of all these deprecation warnings. (#27085) 2022-07-27 10:48:54 -07:00
unity3d_env_local.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00
vizdoom_with_attention_net.py [air] update rllib example to use Tuner API. (#26987) 2022-07-27 12:12:59 +01:00