ray/python
Jones Wong 319c1340cb [rllib] Develop MARWIL (#3635)
*  add marvil policy graph

*  fix typo

*  add offline optimizer and enable running marwil

*  fix loss function

*  add maintaining the moving average of advantage norm

*  use sync replay optimizer for unifying

*  remove offline optimizer and use sync replay optimizer

*  format by yapf

*  add imitation learning objective

*  fix according to eric's review

*  format by yapf

* revise

* add test data

* marwil
2019-01-16 19:00:43 -08:00
..
benchmarks Change timeout from milliseconds to seconds in ray.wait. (#3706) 2019-01-08 21:32:08 -08:00
ray [rllib] Develop MARWIL (#3635) 2019-01-16 19:00:43 -08:00
asv.conf.json [asv] Pushing to s3 (#2246) 2018-06-20 10:43:44 -07:00
build-wheel-macos.sh Fix pyarrow version (#3760) 2019-01-13 14:28:23 -08:00
build-wheel-manylinux1.sh Update arrow to reduce plasma IPCs. (#3497) 2018-12-14 23:49:37 -05:00
README-benchmarks.rst [rllib][asv] Support ASV for RLlib (#2304) 2018-06-28 17:20:09 -07:00
README-building-wheels.md [DataFrame] Add Parquet Support in Build Process (#1531) 2018-02-16 07:18:42 -08:00
setup.py Use environment markers to only install faulthandler in Python < 3.3. (#3764) 2019-01-14 15:55:59 +08:00