.. |
env
|
[RLlib] MAML: Add cartpole mass test for PyTorch. (#13679)
|
2021-01-25 12:32:41 +01:00 |
export
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
models
|
[RLlib] Trajectory view API example script (enhancements and tf2 support). (#13786)
|
2021-02-02 18:42:18 +01:00 |
policy
|
[RLlib] Trajectory view API docs. (#12718)
|
2020-12-30 17:32:21 -08:00 |
serving
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
simulators/sumo
|
[RLlib] Integration with SUMO Simulator (#11710)
|
2020-11-03 09:45:03 +01:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
attention_net.py
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
attention_net_supervised.py
|
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698)
|
2021-01-01 14:06:23 -05:00 |
autoregressive_action_dist.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
batch_norm_model.py
|
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363)
|
2021-01-14 14:44:33 +01:00 |
cartpole_lstm.py
|
[RLlib] Deprecate vf_share_layers in top-level PPO/MAML/MB-MPO configs. (#13397)
|
2021-01-19 09:51:35 +01:00 |
centralized_critic.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
centralized_critic_2.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
complex_struct_space.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_env.py
|
[RLlib] Deprecate vf_share_layers in top-level PPO/MAML/MB-MPO configs. (#13397)
|
2021-01-19 09:51:35 +01:00 |
custom_eval.py
|
[RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063)
|
2020-11-19 19:01:14 +01:00 |
custom_fast_model.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_keras_model.py
|
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363)
|
2021-01-14 14:44:33 +01:00 |
custom_loss.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_metrics_and_callbacks.py
|
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
2020-12-21 18:38:34 -08:00 |
custom_metrics_and_callbacks_legacy.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_model_api.py
|
[RLlib] Add more detailed Documentation on Model building API (#13261)
|
2021-01-09 12:38:29 +01:00 |
custom_observation_filters.py
|
[rllib] Rrk/12079 custom filters (#12095)
|
2020-11-19 13:20:20 -08:00 |
custom_rnn_model.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
custom_tf_policy.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
custom_torch_policy.py
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
custom_train_fn.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
dmlab_watermaze.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
eager_execution.py
|
[RLlib] Fix RNN learning for tf-eager/tf2.x. (#11720)
|
2020-11-02 11:18:41 +01:00 |
hierarchical_training.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
lstm_auto_wrapping.py
|
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
2021-01-08 10:56:09 +01:00 |
mobilenet_v2_with_lstm.py
|
[RLlib] Deprecate vf_share_layers in top-level PPO/MAML/MB-MPO configs. (#13397)
|
2021-01-19 09:51:35 +01:00 |
multi_agent_cartpole.py
|
[RLlib] Issue 12233 shared tf layers example not really shared (only works for tf1.x, not tf2.x). (#12399)
|
2020-11-25 11:27:19 -08:00 |
multi_agent_custom_policy.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
multi_agent_independent_learning.py
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
multi_agent_parameter_sharing.py
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
multi_agent_two_trainers.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
nested_action_spaces.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
parametric_actions_cartpole.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
pettingzoo_env.py
|
Updated pettingzoo env to acomidate api changes and fixes (#11873)
|
2020-11-09 16:09:49 -08:00 |
random_parametric_agent.py
|
[RLLib] Random Parametric Trainer (#11366)
|
2020-11-04 11:12:51 +01:00 |
rock_paper_scissors_multiagent.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
rollout_worker_custom_workflow.py
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
saving_experiences.py
|
[rllib] Add execution module to package ref (#10941)
|
2020-09-21 23:03:06 -07:00 |
slateq.py
|
[RLlib] Implement the SlateQ algorithm (#11450)
|
2020-11-03 09:52:04 +01:00 |
sumo_env_local.py
|
[RLlib] Integration with SUMO Simulator (#11710)
|
2020-11-03 09:45:03 +01:00 |
trajectory_view_api.py
|
[RLlib] Trajectory view API example script (enhancements and tf2 support). (#13786)
|
2021-02-02 18:42:18 +01:00 |
two_step_game.py
|
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
2020-10-02 23:07:44 +02:00 |
two_trainer_workflow.py
|
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
2020-12-08 16:41:45 -08:00 |
unity3d_env_local.py
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |