.. |
env
|
[RLlib] Dreamer (#10172)
|
2020-08-26 13:24:05 +02:00 |
export
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
models
|
[rllib] Try fixing torch GPU and masking errors (#10168)
|
2020-08-25 18:34:19 -07:00 |
policy
|
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
2020-08-21 12:35:16 +02:00 |
serving
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
attention_net.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
attention_net_supervised.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
autoregressive_action_dist.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
batch_norm_model.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
cartpole_lstm.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
centralized_critic.py
|
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
|
2020-09-02 14:03:01 +02:00 |
centralized_critic_2.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
complex_struct_space.py
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
custom_env.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
custom_eval.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
custom_fast_model.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
custom_keras_model.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
custom_loss.py
|
[rllib] Remove extra model config kwargs passed incorrectly for Torch models (#10055)
|
2020-08-17 11:12:20 -07:00 |
custom_metrics_and_callbacks.py
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
custom_metrics_and_callbacks_legacy.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
custom_rnn_model.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
custom_tf_policy.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
custom_torch_policy.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
custom_train_fn.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
dmlab_watermaze.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
eager_execution.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
hierarchical_training.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
mobilenet_v2_with_lstm.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
multi_agent_cartpole.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
multi_agent_custom_policy.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
multi_agent_independent_learning.py
|
Multi-agent Algorithm Documentation Updates (#9722)
|
2020-09-03 22:37:46 -07:00 |
multi_agent_parameter_sharing.py
|
Multi-agent Algorithm Documentation Updates (#9722)
|
2020-09-03 22:37:46 -07:00 |
multi_agent_two_trainers.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
nested_action_spaces.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
parametric_actions_cartpole.py
|
[RLlib] Issue #9366 (DQN w/o dueling produces invalid actions). (#9386)
|
2020-07-10 12:43:03 +02:00 |
pettingzoo_env.py
|
Fixed pettingzoo env example (#9973)
|
2020-08-25 13:22:25 +02:00 |
rock_paper_scissors_multiagent.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
rollout_worker_custom_workflow.py
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
saving_experiences.py
|
Change /tmp to platform-specific temporary directory (#7529)
|
2020-03-16 18:10:14 -07:00 |
two_step_game.py
|
[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527)
|
2020-07-17 12:14:34 +02:00 |
two_trainer_workflow.py
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
unity3d_env_local.py
|
[RLlib] Unity3d soccer benchmarks (#8834)
|
2020-06-11 14:29:57 +02:00 |