ray/rllib/agents/ppo at e1fc8368d43c211e11470fbf56c66775e6a708cc - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-07 02:51:39 -05:00

History

Sven Mika e2edca45d4 [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 ) * Take out stats to analyze memory leak in torch PPO. * WIP * WIP * WIP * WIP * WIP * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT. * Fix determine_tests_to_run.py. * minor change to re-test after determine_tests_to_run.py. * LINT. * update comments. * WIP * WIP * WIP * FIX. * Fix sequence_mask being dependent on torch being installed. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case.		2020-02-22 11:02:31 -08:00
..
tests	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
__init__.py	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
appo.py	[rllib] implemented compute_advantages without gae (#6941 )	2020-01-31 22:25:45 -08:00
appo_policy.py	[rllib] implemented compute_advantages without gae (#6941 )	2020-01-31 22:25:45 -08:00
ddppo.py	[rllib] Add Decentralized DDPPO trainer and documentation (#7088 )	2020-02-10 15:28:27 -08:00
ppo.py	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
ppo_tf_policy.py	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
ppo_torch_policy.py	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
utils.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00