ray/rllib/agents/ppo
Sven Mika e2edca45d4
[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238)
* Take out stats to analyze memory leak in torch PPO.

* WIP

* WIP

* WIP

* WIP

* WIP

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* LINT.

* Fix determine_tests_to_run.py.

* minor change to re-test after determine_tests_to_run.py.

* LINT.

* update comments.

* WIP

* WIP

* WIP

* FIX.

* Fix sequence_mask being dependent on torch being installed.

* Fix strange ray-core tf-error in test_memory_scheduling test case.

* Fix strange ray-core tf-error in test_memory_scheduling test case.

* Fix strange ray-core tf-error in test_memory_scheduling test case.

* Fix strange ray-core tf-error in test_memory_scheduling test case.
2020-02-22 11:02:31 -08:00
..
tests [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238) 2020-02-22 11:02:31 -08:00
__init__.py [rllib] Add Decentralized DDPPO trainer and documentation (#7088) 2020-02-10 15:28:27 -08:00
appo.py [rllib] implemented compute_advantages without gae (#6941) 2020-01-31 22:25:45 -08:00
appo_policy.py [rllib] implemented compute_advantages without gae (#6941) 2020-01-31 22:25:45 -08:00
ddppo.py [rllib] Add Decentralized DDPPO trainer and documentation (#7088) 2020-02-10 15:28:27 -08:00
ppo.py [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238) 2020-02-22 11:02:31 -08:00
ppo_tf_policy.py [RLlib] Implement PPO torch version. (#6826) 2020-01-20 23:06:50 -08:00
ppo_torch_policy.py [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238) 2020-02-22 11:02:31 -08:00
utils.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00