.. |
tests
|
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)
|
2021-03-09 17:26:20 +01:00 |
utils
|
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)
|
2021-03-09 17:26:20 +01:00 |
__init__.py
|
[RLlib] rllib/examples folder restructuring (#8250)
|
2020-05-01 22:59:34 +02:00 |
ant_rand_goal.py
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
cartpole_mass.py
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
coin_game_non_vectorized_env.py
|
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)
|
2021-03-09 17:26:20 +01:00 |
coin_game_vectorized_env.py
|
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)
|
2021-03-09 17:26:20 +01:00 |
correlated_actions_env.py
|
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (autoregressive_action_dist.py) (#17705)
|
2021-08-16 22:08:13 +02:00 |
curriculum_capable_env.py
|
[RLlib] Add simple curriculum learning API and example script. (#15740)
|
2021-05-16 17:35:10 +02:00 |
d4rl_env.py
|
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
|
2021-05-04 19:06:19 +02:00 |
debug_counter_env.py
|
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
2020-12-21 18:38:34 -08:00 |
dm_control_suite.py
|
[RLlib] Env directory cleanup and tests. (#13082)
|
2021-01-19 10:09:39 +01:00 |
env_using_remote_actor.py
|
[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682)
|
2021-05-18 11:00:24 +02:00 |
env_with_subprocess.py
|
[RLlib] Implement DQN PyTorch distributional head. (#9589)
|
2020-07-25 09:29:24 +02:00 |
fast_image_env.py
|
[RLlib] rllib/examples folder restructuring (#8250)
|
2020-05-01 22:59:34 +02:00 |
gpu_requiring_env.py
|
[RLlib] Partial GPU examples (for learner and workers). (#15334)
|
2021-04-20 08:46:05 +02:00 |
halfcheetah_rand_direc.py
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
look_and_push.py
|
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
2020-05-18 17:26:40 +02:00 |
matrix_sequential_social_dilemma.py
|
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)
|
2021-03-09 17:26:20 +01:00 |
mbmpo_env.py
|
[RLlib] Issue #13507: Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037)
|
2021-02-11 18:58:46 +01:00 |
mock_env.py
|
[RLlib] Discussion 2294: Custom vector env example and fix. (#16083)
|
2021-07-28 10:40:04 -04:00 |
multi_agent.py
|
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)
|
2021-08-03 18:30:02 -04:00 |
nested_space_repeat_after_me_env.py
|
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
2021-08-18 17:21:01 +02:00 |
parametric_actions_cartpole.py
|
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664)
|
2021-07-28 15:25:09 -04:00 |
pendulum_mass.py
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
random_env.py
|
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
2020-10-12 13:49:11 -07:00 |
repeat_after_me_env.py
|
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
2021-08-18 17:21:01 +02:00 |
repeat_initial_obs_env.py
|
[RLlib] rllib/examples folder restructuring (#8250)
|
2020-05-01 22:59:34 +02:00 |
simple_corridor.py
|
[RLlib] Partial GPU examples (for learner and workers). (#15334)
|
2021-04-20 08:46:05 +02:00 |
simple_rpg.py
|
[rllib] Support for complex / variable-length observation spaces (#8393)
|
2020-06-06 12:22:19 +02:00 |
stateless_cartpole.py
|
[RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860)
|
2020-06-16 09:01:20 +02:00 |
transformed_action_space_env.py
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
two_step_game.py
|
[rllib] Improve test learning check, fix flaky two step qmix (#16843)
|
2021-07-06 19:39:12 +01:00 |
windy_maze_env.py
|
[CI] Upgrade flake8 to 3.9.1 (#15527)
|
2021-05-03 14:23:28 -07:00 |