ray/rllib/examples at master - hiro/ray - Forgejo: Beyond coding. We Forge.

Jun Gong 62b91cbec0 [docs][rllib] Documentation for connectors. (#27528 ) Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2022-08-19 14:35:07 -07:00
..
bandit	[RLlib] fix bandit pre-merge tests (#27554 )	2022-08-07 17:48:29 -07:00
connectors	[docs][rllib] Documentation for connectors. (#27528 )	2022-08-19 14:35:07 -07:00
documentation	[RLlib] Get rid of all these deprecation warnings. (#27085 )	2022-07-27 10:48:54 -07:00
env	[RLlib] Eval workers use async req manager. (#27390 )	2022-08-16 12:05:55 +02:00
export	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
inference_and_serving	[RLlib] Add Decision Transformer (DT) (#27890 )	2022-08-17 13:49:13 -07:00
models	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00
multi_agent_and_self_play	Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153 )	2022-02-08 16:43:00 +01:00
policy	[RLlib] Fix test_ope flakiness (#27676 )	2022-08-09 16:12:30 -07:00
serving	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
simulators/sumo	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00
tune	[rllib/release] Fix rllib connect test with Tuner() API (#27155 )	2022-07-28 11:08:02 +01:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
action_masking.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
attention_net.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
attention_net_supervised.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
autoregressive_action_dist.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
bare_metal_policy_with_custom_view_reqs.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
batch_norm_model.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
cartpole_lstm.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
centralized_critic.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
centralized_critic_2.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
checkpoint_by_custom_criteria.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
coin_game_env.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
complex_struct_space.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
compute_adapted_gae_on_postprocess_trajectory.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
curriculum_learning.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_env.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_eval.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_experiment.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
custom_fast_model.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_input_api.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
custom_keras_model.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
custom_logger.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_loss.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
custom_metrics_and_callbacks.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_metrics_and_callbacks_legacy.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_model_api.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
custom_model_loss_and_metrics.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_observation_filters.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_rnn_model.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_tf_policy.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_torch_policy.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_train_fn.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
custom_vector_env.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
deterministic_training.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
dmlab_watermaze.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
eager_execution.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
env_rendering_and_recording.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
fractional_gpus.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
hierarchical_training.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
iterated_prisoners_dilemma_env.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
lstm_auto_wrapping.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
mobilenet_v2_with_lstm.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
multi_agent_cartpole.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
multi_agent_custom_policy.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
multi_agent_different_spaces_for_agents.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
multi_agent_independent_learning.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
multi_agent_parameter_sharing.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
multi_agent_two_trainers.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
nested_action_spaces.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
offline_rl.py	[RLlib] Remove unneeded args from offline learning examples. (#26666 )	2022-08-17 17:59:27 +02:00
parallel_evaluation_and_training.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
parametric_actions_cartpole.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
parametric_actions_cartpole_embeddings_learnt_by_model.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
partial_gpus.py	[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128 )	2022-03-15 17:34:21 +01:00
preprocessing_disabled.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
random_parametric_agent.py	[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869 )	2022-06-20 15:54:00 +02:00
re3_exploration.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
recommender_system_with_recsim_and_slateq.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
remote_base_env_with_custom_api.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
remote_envs_with_inference_done_on_main_node.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
replay_buffer_api.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
restore_1_of_n_agents_from_checkpoint.py	[RLlib] `Trainer` to `Algorithm` renaming. (#25539 )	2022-06-11 15:10:39 +02:00
rnnsac_stateless_cartpole.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
rock_paper_scissors_multiagent.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
rollout_worker_custom_workflow.py	[RLlib] Get rid of all these deprecation warnings. (#27085 )	2022-07-27 10:48:54 -07:00
saving_experiences.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
sb2rllib_rllib_example.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
sb2rllib_sb_example.py	[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )	2021-05-19 16:36:59 +02:00
self_play_league_based_with_open_spiel.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
self_play_with_open_spiel.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
sumo_env_local.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
trajectory_view_api.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
two_step_game.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
two_trainer_workflow.py	[RLlib] Move learning_starts logic from buffers into `training_step()`. (#26032 )	2022-08-11 13:07:30 +02:00
unity3d_env_local.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00
vizdoom_with_attention_net.py	[air] update rllib example to use Tuner API. (#26987 )	2022-07-27 12:12:59 +01:00

bandit

[RLlib] fix bandit pre-merge tests (#27554 )

2022-08-07 17:48:29 -07:00

connectors

[docs][rllib] Documentation for connectors. (#27528 )

2022-08-19 14:35:07 -07:00

documentation

[RLlib] Get rid of all these deprecation warnings. (#27085 )

2022-07-27 10:48:54 -07:00

env

[RLlib] Eval workers use async req manager. (#27390 )

2022-08-16 12:05:55 +02:00

export

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

inference_and_serving

[RLlib] Add Decision Transformer (DT) (#27890 )

2022-08-17 13:49:13 -07:00

models

Clean up docstyle in python modules and add LINT rule (#25272 )

2022-06-01 11:27:54 -07:00

multi_agent_and_self_play

Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153 )

2022-02-08 16:43:00 +01:00

policy

[RLlib] Fix test_ope flakiness (#27676 )

2022-08-09 16:12:30 -07:00

serving

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

simulators/sumo

Clean up docstyle in python modules and add LINT rule (#25272 )

2022-06-01 11:27:54 -07:00

tune

[rllib/release] Fix rllib connect test with Tuner() API (#27155 )

2022-07-28 11:08:02 +01:00

__init__.py

[rllib] Try moving RLlib to top level dir (#5324 )

2019-08-05 23:25:49 -07:00

action_masking.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

attention_net.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

attention_net_supervised.py

[CI] Format Python code with Black (#21975 )

2022-01-29 18:41:57 -08:00

autoregressive_action_dist.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

bare_metal_policy_with_custom_view_reqs.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

batch_norm_model.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

cartpole_lstm.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

centralized_critic.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

centralized_critic_2.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

checkpoint_by_custom_criteria.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

coin_game_env.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

complex_struct_space.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

compute_adapted_gae_on_postprocess_trajectory.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

curriculum_learning.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_env.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_eval.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_experiment.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

custom_fast_model.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_input_api.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

custom_keras_model.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

custom_logger.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_loss.py

[CI] Format Python code with Black (#21975 )

2022-01-29 18:41:57 -08:00

custom_metrics_and_callbacks.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_metrics_and_callbacks_legacy.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_model_api.py

[CI] Format Python code with Black (#21975 )

2022-01-29 18:41:57 -08:00

custom_model_loss_and_metrics.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_observation_filters.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_rnn_model.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_tf_policy.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_torch_policy.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_train_fn.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

custom_vector_env.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

deterministic_training.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

dmlab_watermaze.py

[CI] Format Python code with Black (#21975 )

2022-01-29 18:41:57 -08:00

eager_execution.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

env_rendering_and_recording.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

fractional_gpus.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

hierarchical_training.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

iterated_prisoners_dilemma_env.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

lstm_auto_wrapping.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

mobilenet_v2_with_lstm.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

multi_agent_cartpole.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

multi_agent_custom_policy.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

multi_agent_different_spaces_for_agents.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

multi_agent_independent_learning.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

multi_agent_parameter_sharing.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

multi_agent_two_trainers.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

nested_action_spaces.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

offline_rl.py

[RLlib] Remove unneeded args from offline learning examples. (#26666 )

2022-08-17 17:59:27 +02:00

parallel_evaluation_and_training.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

parametric_actions_cartpole.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

parametric_actions_cartpole_embeddings_learnt_by_model.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

partial_gpus.py

[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128 )

2022-03-15 17:34:21 +01:00

preprocessing_disabled.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

random_parametric_agent.py

[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869 )

2022-06-20 15:54:00 +02:00

re3_exploration.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

recommender_system_with_recsim_and_slateq.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

remote_base_env_with_custom_api.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

remote_envs_with_inference_done_on_main_node.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

replay_buffer_api.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

restore_1_of_n_agents_from_checkpoint.py

[RLlib] Trainer to Algorithm renaming. (#25539 )

2022-06-11 15:10:39 +02:00

rnnsac_stateless_cartpole.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

rock_paper_scissors_multiagent.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

rollout_worker_custom_workflow.py

[RLlib] Get rid of all these deprecation warnings. (#27085 )

2022-07-27 10:48:54 -07:00

saving_experiences.py

[CI] Format Python code with Black (#21975 )

2022-01-29 18:41:57 -08:00

sb2rllib_rllib_example.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

sb2rllib_sb_example.py

[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )

2021-05-19 16:36:59 +02:00

self_play_league_based_with_open_spiel.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

self_play_with_open_spiel.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

sumo_env_local.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

trajectory_view_api.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

two_step_game.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

two_trainer_workflow.py

[RLlib] Move learning_starts logic from buffers into training_step(). (#26032 )

2022-08-11 13:07:30 +02:00

unity3d_env_local.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00

vizdoom_with_attention_net.py

[air] update rllib example to use Tuner API. (#26987 )

2022-07-27 12:12:59 +01:00