ray/python
Robert Nishihara 971becc905 [rllib] Use NoFilter instead of MeanStdFilter for PPO. (#1082)
* Make NoFilter the default observation filter for PPO.

* Make reward filter NoFilter for PPO.
2017-10-04 21:31:17 -07:00
..
ray [rllib] Use NoFilter instead of MeanStdFilter for PPO. (#1082) 2017-10-04 21:31:17 -07:00
build-wheel-macos.sh Clone catapult and generate html files during installation. (#956) 2017-09-10 13:41:16 -07:00
build-wheel-manylinux1.sh Clone catapult and generate html files during installation. (#956) 2017-09-10 13:41:16 -07:00
README-building-wheels.md Add script for building MacOS wheels. (#601) 2017-06-01 00:30:46 +00:00
setup.py Bump version number to 0.2.1. (#1026) 2017-10-01 12:33:13 -07:00