ray/rllib/examples at 82530641638cb8010eccc196c6b1e56160065830 - hiro/ray

Sven Mika 649580d735 [RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
..
env	[rllib] Improve test learning check, fix flaky two step qmix (#16843 )	2021-07-06 19:39:12 +01:00
export	[RLlib] ONNX export for tensorflow (1.x) and torch (#16805 )	2021-07-13 12:38:11 -04:00
models	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
policy	Revert "RockPaperScissors Pettingzoo" (#16886 )	2021-07-06 09:43:47 -07:00
serving	[RLlib] External env enhancements + more examples. (#16583 )	2021-06-23 09:09:01 +02:00
simulators/sumo	[RLlib] Integration with SUMO Simulator (#11710 )	2020-11-03 09:45:03 +01:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
attention_net.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
attention_net_supervised.py	[RLlib] Support easy `use_attention=True` flag for using the GTrXL model. (#11698 )	2021-01-01 14:06:23 -05:00
autoregressive_action_dist.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
batch_norm_model.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
cartpole_lstm.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
centralized_critic.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
centralized_critic_2.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
checkpoint_by_custom_criteria.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
coin_game_env.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
complex_struct_space.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
curriculum_learning.py	[RLlib] Add simple curriculum learning API and example script. (#15740 )	2021-05-16 17:35:10 +02:00
custom_env.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_eval.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_experiment.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
custom_fast_model.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_input_api.py	[rllib] Enhancements to Input API for customizing offline datasets (#16957 )	2021-07-10 15:05:25 -07:00
custom_keras_model.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_logger.py	[rllib] Read "logger_config" first before "prefix". (#15871 )	2021-05-18 10:50:46 -07:00
custom_loss.py	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
custom_metrics_and_callbacks.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_metrics_and_callbacks_legacy.py	[RLlib] Fix all example scripts to run on GPUs. (#11105 )	2020-10-02 23:07:44 +02:00
custom_model_api.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_model_loss_and_metrics.py	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
custom_observation_filters.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
custom_rnn_model.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
custom_tf_policy.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
custom_torch_policy.py	[RLlib] JAXPolicy prep. PR #1 . (#13077 )	2020-12-26 20:14:18 -05:00
custom_train_fn.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
deterministic_training.py	[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682 )	2021-05-18 11:00:24 +02:00
dmlab_watermaze.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
eager_execution.py	[RLlib] Update tf1.x vs tf2.x documentation and eager example script. (#17030 )	2021-07-13 20:02:17 -04:00
env_rendering_and_recording.py	[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428 )	2021-06-19 08:57:53 +02:00
fractional_gpus.py	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
hierarchical_training.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
iterated_prisoners_dilemma_env.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
lstm_auto_wrapping.py	[RLlib] Preparatory PR for: Documentation on Model Building. (#13260 )	2021-01-08 10:56:09 +01:00
mobilenet_v2_with_lstm.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
multi_agent_cartpole.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
multi_agent_custom_policy.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
multi_agent_independent_learning.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
multi_agent_parameter_sharing.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
multi_agent_two_trainers.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
nested_action_spaces.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
offline_rl.py	[RLlib] CQL TensorFlow support (#15841 )	2021-05-18 11:10:46 +02:00
parallel_evaluation_and_training.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
parametric_actions_cartpole.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
partial_gpus.py	[RLlib] Fix ModelV2 custom metrics for torch. (#16734 )	2021-07-01 13:01:40 +02:00
pettingzoo_env.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
random_parametric_agent.py	[RLLib] Random Parametric Trainer (#11366 )	2020-11-04 11:12:51 +01:00
recsim_with_slateq.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
restore_1_of_n_agents_from_checkpoint.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
rock_paper_scissors_multiagent.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
rollout_worker_custom_workflow.py	[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064 )	2020-12-27 09:46:03 -05:00
saving_experiences.py	[Core] First pass at privatizing non-public Python APIs. (#14607 )	2021-03-10 22:47:28 -08:00
sb2rllib_rllib_example.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
sb2rllib_sb_example.py	[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )	2021-05-19 16:36:59 +02:00
self_play_with_open_spiel_connect_4.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
serve_and_rllib.py	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
sumo_env_local.py	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
trajectory_view_api.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
two_step_game.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
two_trainer_workflow.py	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
unity3d_env_local.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
vizdoom_with_attention_net.py	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00