ray/test/jenkins_tests
Philipp Moritz 791bee343f [rllib] Implement GAE for PPO (#849)
* make information available for GAE

* buggy version of GAE estimator

* fix

* add more logging and reweight losses

* fix logging

* fix loss

* adapt advantage calculation

* update gae

* standardize returns

* don't normalize td lambda ret

* fix

* don't standardize advantages

* do standardization earlier

* different standardization

* initializer

* drop into the debugger

* fix tensorflow broadcasting bug

* vf clipping

* don't standardize tdlambdaret

* different standardization

* use huber loss for value function

* refactor -- first half

* it runs

* fix

* update

* documentation

* linting and tests

* fix linting

* naming

* fix

* linting

* fix

* remove prefix madness

* fixes

* fix

* add value function example

* fix linting

* remove newline
2017-08-23 20:35:47 -07:00
..
multi_node_tests Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
multi_node_docker_test.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
run_multi_node_tests.sh [rllib] Implement GAE for PPO (#849) 2017-08-23 20:35:47 -07:00