ray/python at 319c1340cb00bca4653e4557f200658908f1cbba - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Jones Wong 319c1340cb [rllib] Develop MARWIL (#3635 ) * add marvil policy graph * fix typo * add offline optimizer and enable running marwil * fix loss function * add maintaining the moving average of advantage norm * use sync replay optimizer for unifying * remove offline optimizer and use sync replay optimizer * format by yapf * add imitation learning objective * fix according to eric's review * format by yapf * revise * add test data * marwil		2019-01-16 19:00:43 -08:00
..
benchmarks	Change timeout from milliseconds to seconds in ray.wait. (#3706 )	2019-01-08 21:32:08 -08:00
ray	[rllib] Develop MARWIL (#3635 )	2019-01-16 19:00:43 -08:00
asv.conf.json	[asv] Pushing to s3 (#2246 )	2018-06-20 10:43:44 -07:00
build-wheel-macos.sh	Fix pyarrow version (#3760 )	2019-01-13 14:28:23 -08:00
build-wheel-manylinux1.sh	Update arrow to reduce plasma IPCs. (#3497 )	2018-12-14 23:49:37 -05:00
README-benchmarks.rst	[rllib][asv] Support ASV for RLlib (#2304 )	2018-06-28 17:20:09 -07:00
README-building-wheels.md	[DataFrame] Add Parquet Support in Build Process (#1531 )	2018-02-16 07:18:42 -08:00
setup.py	Use environment markers to only install faulthandler in Python < 3.3. (#3764 )	2019-01-14 15:55:59 +08:00