ray/test
Philipp Moritz 791bee343f [rllib] Implement GAE for PPO (#849)
* make information available for GAE

* buggy version of GAE estimator

* fix

* add more logging and reweight losses

* fix logging

* fix loss

* adapt advantage calculation

* update gae

* standardize returns

* don't normalize td lambda ret

* fix

* don't standardize advantages

* do standardization earlier

* different standardization

* initializer

* drop into the debugger

* fix tensorflow broadcasting bug

* vf clipping

* don't standardize tdlambdaret

* different standardization

* use huber loss for value function

* refactor -- first half

* it runs

* fix

* update

* documentation

* linting and tests

* fix linting

* naming

* fix

* linting

* fix

* remove prefix madness

* fixes

* fix

* add value function example

* fix linting

* remove newline
2017-08-23 20:35:47 -07:00
..
jenkins_tests [rllib] Implement GAE for PPO (#849) 2017-08-23 20:35:47 -07:00
travis-ci Migrate repositories to ray-project. (#438) 2016-09-17 00:52:05 -07:00
actor_test.py Prototype actor checkpointing. (#814) 2017-08-07 17:52:39 -07:00
array_test.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
component_failures_test.py Pull Plasma from Apache Arrow and remove Plasma store from Ray. (#692) 2017-07-31 21:04:15 -07:00
failure_test.py Simplify put test and move it to failure tests. (#788) 2017-07-31 17:57:48 -07:00
microbenchmarks.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
multi_node_test.py Start processes using the same version of Python that was used to start Ray. (#760) 2017-07-21 00:05:10 +00:00
recursion_test.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
runtest.py Adding basic support for a user-interpretable resource label (#761) 2017-08-08 02:53:59 -07:00
stress_tests.py Simplify put test and move it to failure tests. (#788) 2017-07-31 17:57:48 -07:00
tensorflow_test.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00