hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-07 02:51:39 -05:00

Author	SHA1	Message	Date
Richard Liaw	cdc9227f1b	[tune] ASHA xgboost and lightgbm examples (#5500 )	2019-08-22 10:37:59 -07:00
Robert Nishihara	851c5b2dae	Add a script for benchmarking performance for Ray developers. (#5472 )	2019-08-19 23:41:23 -07:00
Richard Liaw	d7b309223b	[tune] MLFlow Logger (#5438 )	2019-08-14 15:58:18 -07:00
Lisa Dunlap	b7d0733362	[tune] Implement BOHB (#5382 )	2019-08-13 12:32:07 -07:00
Eric Liang	a1d2e17623	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
jichan3751	de95117e96	[sgd] Tune interface for Pytorch MultiNode SGD (#5350 )	2019-08-10 13:51:44 -07:00
Simon Mo	18f1e904de	Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409 )	2019-08-09 11:37:19 -07:00
Eric Liang	592f313210	[rllib] Centralized critic / PPO example on TwoStepGame (#5392 )	2019-08-08 14:03:28 -07:00
Wonseok Jeon	281829e712	MADDPG implementation in RLlib (#5348 )	2019-08-06 16:22:06 -07:00
Eric Liang	5d7afe8092	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
Richard Liaw	1eaa57c98f	[tune] Distributed example + walkthrough (#5157 )	2019-08-02 09:17:20 -07:00
Eric Liang	3bdd114282	[rllib] Better example rnn envs (#5300 )	2019-07-28 14:07:18 -07:00
Eric Liang	a62c5f40f6	[rllib] Document ModelV2 and clean up the models/ directory (#5277 )	2019-07-27 02:08:16 -07:00
Richard Liaw	7e715520e5	[sgd] Example for Training (#5292 )	2019-07-27 01:10:25 -07:00
Eric Liang	f9043cc49a	[rllib] Remove experimental eager support	2019-07-21 12:27:17 -07:00
Jones Wong	0af07bd493	Enable seeding actors for reproducible experiments (#5197 ) * enable graph-level worker-specific seed * lint checked * revised according to eric's suggestions * revised accordingly and added a test case * formated * Update test_reproducibility.py * Update trainer.py * Update rollout_worker.py * Update run_rllib_tests.sh * Update worker_set.py	2019-07-17 23:31:34 -07:00
Richard Liaw	b6509f46b0	Update wheels to 0.8.0dev2 (#5186 )	2019-07-12 17:27:03 -07:00
Richard Liaw	0b540ab492	[tune] Test example checkpointing (#4728 )	2019-07-10 01:58:26 -07:00
Eric Liang	34d054ff19	[rllib] ModelV2 API (#4926 )	2019-07-03 15:59:47 -07:00
Richard Liaw	b1827d5fbe	[tune] Update MNIST Example (#4991 )	2019-06-25 22:50:15 -07:00
Richard Liaw	bd8aceb896	[ci] Change Jenkins to py3 (#5022 ) * conda3 * integration * add nevergrad, remotedata * pytest 0.3.1 * otherdockers * setup * tune	2019-06-24 21:50:37 -07:00
Eric Liang	9e328fbe6f	[rllib] Add docs on how to use TF eager execution (#4927 )	2019-06-07 16:42:37 -07:00
Robert Nishihara	c3f8fc1c44	Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941 )	2019-06-06 17:22:45 -07:00
Eric Liang	7501ee51db	[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )	2019-06-03 06:49:24 +08:00
Peter Schafhalter	c2ade075a3	[sgd] Distributed Training via PyTorch (#4797 ) Implements distributed SGD using distributed PyTorch.	2019-06-01 21:39:22 -07:00
Eric Liang	1c073e92e4	[rllib] Fix documentation on custom policies (#4910 ) * wip * add docs * lint * todo sections * fix doc	2019-06-01 16:13:21 +08:00
Eric Liang	d7be5a5d36	[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO	2019-05-27 17:24:45 -07:00
Devin Petersohn	a7d01aba9b	Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847 )	2019-05-24 16:49:13 -07:00
Eric Liang	351753aae5	[rllib] Remove dependency on TensorFlow (#4764 ) * remove hard tf dep * add test * comment fix * fix test	2019-05-10 20:36:18 -07:00
Devin Petersohn	edb8465910	[ray-core] Initial addition of performance integration testing files (#4325 )	2019-05-08 13:40:54 -07:00
Eric Liang	ce66a552bf	Move large mem test to end (#4664 )	2019-04-19 11:43:22 -07:00
Eric Liang	3fd9dea721	[rllib] Fix tune.run(Agent class) (#4630 ) * update * Update __init__.py	2019-04-15 09:12:23 -07:00
cfan	bb207a205b	[rllib] Support torch device and distributions. (#4553 )	2019-04-12 11:39:14 -07:00
Eric Liang	4f46d3e9bf	[rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554 )	2019-04-09 00:36:49 -07:00
ctombumila37	7746d20d30	[rllib] ExternalMultiAgentEnv (#4200 )	2019-04-06 19:58:14 -07:00
Eric Liang	0d94f3eeef	[rllib] Improve datapath throughput of IMPALA / APPO (#4324 )	2019-03-31 12:25:52 -07:00
bjg2	77005d1814	[rllib] Make batch timeout for remote workers tunable (#4435 )	2019-03-29 13:19:42 -07:00
Eric Liang	2ffe67c5c3	[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478 )	2019-03-29 12:44:23 -07:00
Eric Liang	8ee240f40e	[rllib] Use 64-byte aligned memory when concatenating arrays (#4408 )	2019-03-25 23:56:51 -07:00
Eric Liang	57c1aeb427	[rllib] Use suppress_output instead of run_silent.sh script for tests (#4386 ) * fix * enable custom loss * Update run_rllib_tests.sh * enable tests * fix action prob * Update suppress_output * fix example * fix	2019-03-21 00:15:24 -07:00
Eric Liang	a45019d98c	[rllib] Add option to proceed even if some workers crashed (#4376 )	2019-03-16 13:34:09 -07:00
Eric Liang	d5f4698305	[tune] Avoid scheduler blocking, add reuse_actors optimization (#4218 )	2019-03-12 23:49:31 -07:00
Stefan Pantic	2202a81773	Fix multi discrete (#4338 ) * Revert "Revert "[wingman -> rllib] IMPALA MultiDiscrete changes (#3967)" (#4332)" This reverts commit `3c41cb9b60`. * Fix a bug with log rhos for vtrace * Reformat * lint	2019-03-12 20:32:11 -07:00
Eric Liang	3c41cb9b60	Revert "[wingman -> rllib] IMPALA MultiDiscrete changes (#3967 )" (#4332 ) This reverts commit `962b17f567`.	2019-03-11 22:51:26 -07:00
Eric Liang	c7f74dbdc7	[rllib] Add async remote workers (#4253 )	2019-03-08 15:39:48 -08:00
Robert Nishihara	fd2d8c2c06	Remove Jenkins backend tests and add new long running stress test. (#4288 )	2019-03-08 15:29:39 -08:00
Yuhong Guo	d5fb7b70a9	Update arrow version to fix plasma bugs (#4127 ) * Update arrow * Change to 2c511979b13b230e73a179dab1d55b03cd81ec02 which is rebased on Arrow 46f75d7 * Update to fix comment * disable tests which use python/ray/rllib/tests/data/cartpole_small * Fix get order of meta and data in MockObjectStore.java	2019-03-08 18:03:58 +08:00
Eric Liang	437459f40a	[build] Make travis logs not as long (#4213 ) * clean it up * Update .travis.yml * Update .travis.yml * update * fix example * suppress * timeout * print periodic progress * Update suppress_output * Update run_silent.sh * Update suppress_output * Update suppress_output * manually do timeout * sleep 300 * fix test * Update run_silent.sh * Update suppress_output * Update .travis.yml	2019-03-07 12:09:03 -08:00
Eric Liang	b0332551dd	[rllib] Fix APPO + continuous spaces, feed prev_rew/act to A3C properly (#4286 )	2019-03-06 21:36:26 -08:00
Eric Liang	30bf8e46c7	[rllib] Use nested scope in custom loss example	2019-03-04 18:29:22 -08:00

1 2 3

108 commits