.. |
env
|
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664)
|
2021-07-28 15:25:09 -04:00 |
export
|
[RLlib] ONNX export for tensorflow (1.x) and torch (#16805)
|
2021-07-13 12:38:11 -04:00 |
models
|
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664)
|
2021-07-28 15:25:09 -04:00 |
policy
|
[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896)
|
2021-07-22 10:55:07 -04:00 |
serving
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
simulators/sumo
|
[RLlib] Integration with SUMO Simulator (#11710)
|
2020-11-03 09:45:03 +01:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
attention_net.py
|
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038)
|
2021-07-26 22:25:48 -04:00 |
attention_net_supervised.py
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
autoregressive_action_dist.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
batch_norm_model.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
cartpole_lstm.py
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
centralized_critic.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
centralized_critic_2.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
checkpoint_by_custom_criteria.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
coin_game_env.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
complex_struct_space.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
curriculum_learning.py
|
[RLlib] Add simple curriculum learning API and example script. (#15740)
|
2021-05-16 17:35:10 +02:00 |
custom_env.py
|
[RLlib] Discussion 2294: Custom vector env example and fix. (#16083)
|
2021-07-28 10:40:04 -04:00 |
custom_eval.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_experiment.py
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
custom_fast_model.py
|
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
2021-07-20 14:58:13 -04:00 |
custom_input_api.py
|
[rllib] Enhancements to Input API for customizing offline datasets (#16957)
|
2021-07-10 15:05:25 -07:00 |
custom_keras_model.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_logger.py
|
[rllib] Read "logger_config" first before "prefix". (#15871)
|
2021-05-18 10:50:46 -07:00 |
custom_loss.py
|
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
2021-07-01 13:01:40 +02:00 |
custom_metrics_and_callbacks.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_metrics_and_callbacks_legacy.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_model_api.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_model_loss_and_metrics.py
|
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
2021-07-01 13:01:40 +02:00 |
custom_observation_filters.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_rnn_model.py
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
custom_tf_policy.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
custom_torch_policy.py
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
custom_train_fn.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
custom_vector_env.py
|
[RLlib] Discussion 2294: Custom vector env example and fix. (#16083)
|
2021-07-28 10:40:04 -04:00 |
deterministic_training.py
|
[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682)
|
2021-05-18 11:00:24 +02:00 |
dmlab_watermaze.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
eager_execution.py
|
[RLlib] Update tf1.x vs tf2.x documentation and eager example script. (#17030)
|
2021-07-13 20:02:17 -04:00 |
env_rendering_and_recording.py
|
[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428)
|
2021-06-19 08:57:53 +02:00 |
fractional_gpus.py
|
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
2021-07-01 13:01:40 +02:00 |
hierarchical_training.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
iterated_prisoners_dilemma_env.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
lstm_auto_wrapping.py
|
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
2021-01-08 10:56:09 +01:00 |
mobilenet_v2_with_lstm.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
multi_agent_cartpole.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
multi_agent_custom_policy.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
multi_agent_independent_learning.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
multi_agent_parameter_sharing.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
multi_agent_two_trainers.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
nested_action_spaces.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
offline_rl.py
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
parallel_evaluation_and_training.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
parametric_actions_cartpole.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
parametric_actions_cartpole_embeddings_learnt_by_model.py
|
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664)
|
2021-07-28 15:25:09 -04:00 |
partial_gpus.py
|
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
2021-07-01 13:01:40 +02:00 |
pettingzoo_env.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
random_parametric_agent.py
|
[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896)
|
2021-07-22 10:55:07 -04:00 |
recsim_with_slateq.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
remote_vector_env_with_custom_api.py
|
[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118)
|
2021-07-25 16:55:51 -04:00 |
restore_1_of_n_agents_from_checkpoint.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
rnnsac_stateless_cartpole.py
|
[RLlib] Add RNN-SAC agent (#16577)
|
2021-07-25 10:04:52 -04:00 |
rock_paper_scissors_multiagent.py
|
[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896)
|
2021-07-22 10:55:07 -04:00 |
rollout_worker_custom_workflow.py
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
saving_experiences.py
|
[Core] First pass at privatizing non-public Python APIs. (#14607)
|
2021-03-10 22:47:28 -08:00 |
sb2rllib_rllib_example.py
|
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038)
|
2021-07-26 22:25:48 -04:00 |
sb2rllib_sb_example.py
|
[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897)
|
2021-05-19 16:36:59 +02:00 |
self_play_league_based_with_open_spiel.py
|
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077)
|
2021-07-22 10:59:13 -04:00 |
self_play_with_open_spiel.py
|
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077)
|
2021-07-22 10:59:13 -04:00 |
serve_and_rllib.py
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
sumo_env_local.py
|
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
2021-06-21 13:46:01 +02:00 |
trajectory_view_api.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
two_step_game.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
two_trainer_workflow.py
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
unity3d_env_local.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |
vizdoom_with_attention_net.py
|
[RLlib] Examples scripts add argparse help and replace --torch with --framework . (#15832)
|
2021-05-18 13:18:12 +02:00 |