mirror of
https://github.com/vale981/ray
synced 2025-03-06 10:31:39 -05:00
![]() * make information available for GAE * buggy version of GAE estimator * fix * add more logging and reweight losses * fix logging * fix loss * adapt advantage calculation * update gae * standardize returns * don't normalize td lambda ret * fix * don't standardize advantages * do standardization earlier * different standardization * initializer * drop into the debugger * fix tensorflow broadcasting bug * vf clipping * don't standardize tdlambdaret * different standardization * use huber loss for value function * refactor -- first half * it runs * fix * update * documentation * linting and tests * fix linting * naming * fix * linting * fix * remove prefix madness * fixes * fix * add value function example * fix linting * remove newline |
||
---|---|---|
.. | ||
jenkins_tests | ||
travis-ci | ||
actor_test.py | ||
array_test.py | ||
component_failures_test.py | ||
failure_test.py | ||
microbenchmarks.py | ||
multi_node_test.py | ||
recursion_test.py | ||
runtest.py | ||
stress_tests.py | ||
tensorflow_test.py |