ray/test
Alok Singh fd234e3171 [rllib] Fix A3C PyTorch implementation (#2036)
* Use F.softmax instead of a pointless network layer

Stateless functions should not be network layers.

* Use correct pytorch functions

* Rename argument name to out_size

Matches in_size and makes more sense.

* Fix shapes of tensors

Advantages and rewards both should be scalars, and therefore a list of them
should be 1D.

* Fmt

* replace deprecated function

* rm unnecessary Variable wrapper

* rm all use of torch Variables

Torch does this for us now.

* Ensure that values are flat list

* Fix shape error in conv nets

* fmt

* Fix shape errors

Reshaping the action before stepping in the env fixes a few errors.

* Add TODO

* Use correct filter size

Works when `self.config['model']['channel_major'] = True`.

* Add missing channel major

* Revert reshape of action

This should be handled by the agent or at least in a cleaner way that doesn't
break existing envs.

* Squeeze action

* Squeeze actions along first dimension

This should deal with some cases such as cartpole where actions are scalars
while leaving alone cases where actions are arrays (some robotics tasks).

* try adding pytorch tests

* typo

* fixup docker messages

* Fix A3C for some envs

Pendulum doesn't work since it's an edge case (expects singleton arrays, which
`.squeeze()` collapses to scalars).

* fmt

* nit flake

* small lint
2018-05-30 10:48:11 -07:00
..
jenkins_tests [rllib] Fix A3C PyTorch implementation (#2036) 2018-05-30 10:48:11 -07:00
travis-ci Migrate repositories to ray-project. (#438) 2016-09-17 00:52:05 -07:00
actor_test.py Fix support for actor classmethods (#2146) 2018-05-28 17:43:23 -07:00
array_test.py Fix python linting (#2076) 2018-05-16 15:04:31 -07:00
autoscaler_test.py [autoscaler] Remove faulty assert that breaks during downscaling, pull configs from env (#2006) 2018-05-15 12:47:11 -07:00
component_failures_test.py Lint Python files with Yapf (#1872) 2018-04-11 10:11:35 -07:00
credis_test.py Integrate credis with Ray & route task table entries into credis. (#1841) 2018-05-24 23:35:25 -07:00
cython_test.py Add basic functionality for Cython functions and actors (#1193) 2017-11-09 17:49:06 -08:00
failure_test.py Improve error message printing and suppression. (#2104) 2018-05-20 12:13:14 -07:00
microbenchmarks.py Move calls to ray.worker.cleanup into tearDown part of tests for isolation. (#1433) 2018-01-22 22:54:56 -08:00
monitor_test.py Start chain replicated GCS with Ray (#1538) 2018-03-07 10:18:58 -08:00
multi_node_test.py Integrate credis with Ray & route task table entries into credis. (#1841) 2018-05-24 23:35:25 -07:00
recursion_test.py Switch Python indentation from 2 spaces to 4 spaces. (#726) 2017-07-13 21:53:57 +00:00
runtest.py Implement Python global state API for xray. (#2125) 2018-05-29 16:25:54 -07:00
stress_tests.py [xray] Add Travis build for testing xray on Linux. (#2047) 2018-05-13 21:22:01 -07:00
tensorflow_test.py Use flake8-comprehensions (#1976) 2018-05-20 16:15:06 -07:00
xray_test.py Lint Python files with Yapf (#1872) 2018-04-11 10:11:35 -07:00