hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

Author	SHA1	Message	Date
Richard Liaw	94e2fcea2e	[sgd] fp16 (apex) and scheduler support + move examples page (#7061 ) * Init fp16 * fp16 and schedulers * scheduler linking and fp16 * to fp16 * loss scaling and documentation * more documentation * add tests, refactor config * moredocs * more docs * fix logo, add test mode, add fp16 flag * fix tests * fix scheduler * fix apex * improve safety * fix tests * fix tests * remove pin memory default * rm * fix * Update doc/examples/doc_code/raysgd_torch_signatures.py * fix * migrate changes from other PR * ok thanks * pass * signatures * lint' * Update python/ray/experimental/sgd/pytorch/utils.py * Apply suggestions from code review Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com> * should address most comments * comments * fix this ci * fix tests' * testmode Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>	2020-02-16 19:04:08 -08:00
Sven Mika	2e60f0d4d8	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 ) * commit * comment	2020-02-15 14:50:44 -08:00
Sven Mika	5518a738b3	[RLlib] Fix erroneous use of LinearSchedule (in DDPG's exploration annealing). (#7125 ) * Fix erroneous use of LinearSchedule (in DDPG's exploration annealing). Erase schedules_obsoleted.py. * Trigger re-test. * Re-test.	2020-02-12 23:46:49 -08:00
Sven Mika	6e1c3ea824	[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974 )	2020-02-10 15:22:07 -08:00
Eric Liang	fbc545c03b	[rllib] Support parallel, parameterized evaluation (#6981 ) * eval api * update * sync eval filters * sync fix * docs * update * docs * update * link * nit * doc updates * format	2020-02-01 22:12:12 -08:00
Richard Liaw	037aa2b961	[sgd] Refactor PyTorch SGD Documentation. (#6910 ) * Refactor documentation and directory structurre * update loss * ,ore examples * fix comments * more code * svgs * formatting * more_docs * more writing * comments ready * move * whitespace * examples * fix * bold * pytorch * batch * fix * fix test * Apply suggestions from code review * quarantinegp * tests/ * fix missing	2020-01-29 08:51:01 -08:00
Sven Mika	446cbdf2e0	[RLlib] Fix issue (bug): LSTM + non-shared vf + PPO + tuple actions (#6890 ) * Add `RandomEnv` example to examples folder. Convert warning into Error message when using an LSTM in a non-shared-vf network (after the warning, the program would crash). * LINT. * Fix issue #6884. LSTM + non-shared vf NN + PPO crashes when using a Tuple action space. * LINT * Change warning message for Model: shared_vf=False, LSTM=True cases. * Bug fix. * Add examples/random_env.py test to Jenkins.	2020-01-24 10:29:35 -08:00
Sven Mika	ae9a3a2237	[RLlib] from_config util method for framework agnostic components; start moving RLlib tests into Bazel. (#6865 )	2020-01-22 17:02:58 -08:00
Sven Mika	c957ed58ed	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
Sven	60d4d5e1aa	Remove future imports (#6724 ) * Remove all __future__ imports from RLlib. * Remove (object) again from tf_run_builder.py::TFRunBuilder. * Fix 2xLINT warnings. * Fix broken appo_policy import (must be appo_tf_policy) * Remove future imports from all other ray files (not just RLlib). * Remove future imports from all other ray files (not just RLlib). * Remove future import blocks that contain `unicode_literals` as well. Revert appo_tf_policy.py to appo_policy.py (belongs to another PR). * Add two empty lines before Schedule class. * Put back __future__ imports into determine_tests_to_run.py. Fails otherwise on a py2/print related error.	2020-01-09 00:15:48 -08:00
Sven	f1b56fa5ee	PG unify/cleanup tf vs torch and PG functionality test cases (tf + torch). (#6650 ) * Unifying the code for PGTrainer/Policy wrt tf vs torch. Adding loss function test cases for the PGAgent (confirm equivalence of tf and torch). * Fix LINT line-len errors. * Fix LINT errors. * Fix `tf_pg_policy` imports (formerly: `pg_policy`). * Rename tf_pg_... into pg_tf_... following <alg>_<framework>_... convention, where ...=policy/loss/agent/trainer. Retire `PGAgent` class (use PGTrainer instead). * - Move PG test into agents/pg/tests directory. - All test cases will be located near the classes that are tested and then built into the Bazel/Travis test suite. * Moved post_process_advantages into pg.py (from pg_tf_policy.py), b/c the function is not a tf-specific one. * Fix remaining import errors for agents/pg/... * Fix circular dependency in pg imports. * Add pg tests to Jenkins test suite.	2020-01-02 16:08:03 -08:00
Richard Liaw	5719a05757	[sgd] Add support for multi-model multi-optimizer training (#6317 )	2019-12-15 15:19:45 -08:00
Yuhao Yang	ad4da17899	[Tune] Add example and tutorial for DCGAN (#6400 )	2019-12-13 14:15:44 -08:00
Eric Liang	be5dd8eb5e	Enable direct calls by default (#6367 ) * wip * add * timeout fix * const ref * comments * fix * fix * Move actor state into actor handle * comments 2 * enable by default * temp reorder * some fixes * add debug code * tmp * fix * wip * remove dbg * fix compile * fix * fix check * remove non direct tests * Increment ref count before resolving value * rename * fix another bug * tmp * tmp * Fix object pinning * build change * lint * ActorManager * tmp * ActorManager * fix test component failures * Remove old code * Remove unused * fix * fix * fix resources * fix advanced * eric's diff * blacklist * blacklist * cleanup * annotate * disable tests for now * remove * fix * fix * clean up verbosity * fix test * fix concurrency test * Update .travis.yml * Update .travis.yml * Update .travis.yml * split up analysis suite * split up trial runner suite * fix detached direct actors * fix * split up advanced tesT * lint * fix core worker test hang * fix bad check fail which breaks test_cluster.py in tune * fix some minor diffs in test_cluster * less workers * make less stressful * split up test * retry flaky tests * remove old test flags * fixes * lint * Update worker_pool.cc * fix race * fix * fix bugs in node failure handling * fix race condition * fix bugs in node failure handling * fix race condition * nits * fix test * disable heartbeatS * disable heartbeatS * fix * fix * use worker id * fix max fail * debug exit * fix merge, and apply [PATCH] fix concurrency test * [patch] fix core worker test hang * remove NotifyActorCreation, and return worker on completion of actor creation task * remove actor diied callback * Update core_worker.cc * lint * use task manager * fix merge * fix deadlock * wip * merge conflits * fix * better sysexit handling * better sysexit handling * better sysexit handling * check id * better debug * task failed msg * task failed msg * retry failed tasks with delay * retry failed tasks with delay * clip deps * fix * fix core worker tests * fix task manager test * fix all tests * cleanup * set to 0 for direct tests * dont check worker id for ownership rpc * dont check worker id for ownership rpc * debug messages * add comment * remove debug statements * nit * check worker id * fix test * owner * fix tests	2019-12-13 13:58:04 -08:00
Victor Le	4e24c805ee	AlphaZero and Ranked reward implementation (#6385 )	2019-12-07 12:08:40 -08:00
Eric Liang	4c6739476b	[rllib] Raise an error if GPUs are enabled but not tf.test.is_gpu_available() (#6365 )	2019-12-05 10:13:54 -08:00
Eric Liang	e5863d7914	Force tune tests to run in direct call mode (#6301 ) * force tune direct mode * force tune * fix * Update run_multi_node_tests.sh	2019-11-27 19:58:33 -08:00
Eric Liang	64a3a7239e	Set RAY_FORCE_DIRECT=1 for run_rllib_tests, test_basic (#6171 )	2019-11-25 14:12:11 -08:00
daiyaanarfeen	8f6d73a93a	[sgd] Extend distributed pytorch functionality (#5675 ) * raysgd * apply fn * double quotes * removed duplicate TimerStat * removed duplicate find_free_port * imports in pytorch_trainer * init doc * ray.experimental * remove resize example * resnet example * cifar * Fix up after kwargs * data_dir and dataloader_workers args * formatting * loss * init * update code * lint * smoketest * better_configs * fix * fix * fix * train_loader * fixdocs * ok * ok * fix * fix_update * fix * fix * done * fix * fix * fix * small * lint * fix * fix * fix_test * fix * validate * fix * fi	2019-11-05 11:16:46 -08:00
Richard Liaw	e94bebb1de	[tune] Fix Jenkins tests (#6028 )	2019-11-01 16:42:04 -07:00
Richard Liaw	48ba484640	[tune] Test TF2.0, TF1.14, TF1.12 Tensorboard support (#5931 )	2019-10-18 13:50:42 -07:00
Richard Liaw	d52a4983af	Update TF documentation (#5918 )	2019-10-16 01:31:27 -07:00
Richard Liaw	9f23620412	[tune] tf2.0 mnist example (#5898 ) * tfmnistexample * tfmnist * add_to_ci * format * exampledownlaod * fix	2019-10-15 22:25:01 -07:00
Richard Liaw	1650f7b174	[tune] Remove TF MNIST example + add TrialRunner hook to execut… (#5868 ) * remove test * add trial runner * remvoerestore * Remove other mnist examples * tunetest * revert * v1 * Revert "v1" This reverts commit c8bddaf2db7a8270c43c02021cac0e75df15ed20. * Revert "revert" This reverts commit b58f56884a0c288d3a6f997d149ab4d496ddd7a3. * errors * format	2019-10-13 20:33:56 -07:00
Eric Liang	04e997fe0d	Fix TF2 / rllib test (#5846 )	2019-10-07 14:25:16 -07:00
Anthony Yu	b99cdf4e39	[tune] PBT + Memnn example (#5723 ) * Add example file * Move into train function * Somewhat working example of MemNN, still has some failed trials * Reorganize into a class * Small fixes * Iteration decrease and fix hyperparam_mutations * Add example file * Move into train function * Somewhat working example of MemNN, still has some failed trials * Reorganize into a class * Small fixes * Iteration decrease and fix hyperparam_mutations * Some style edits * Address PR changes without modifying learning rate * Add configs and hyperparameter mutations * Add tune test * Modify import locations * Some parameter changes for testing * Update memnn example * Add tensorboard support and address PR comment * Final changes * lint * generator	2019-10-05 09:22:37 -07:00
Edward Oakes	443feb75f0	Fix test (#5810 )	2019-09-30 19:39:53 -07:00
Richard Liaw	baf85c6665	[tune/sgd] Fix Jenkins (#5765 )	2019-09-27 09:59:08 -07:00
Richard Liaw	10f21fa313	[docs] Convert Examples to Gallery (#5414 )	2019-09-24 15:46:56 -07:00
Richard Liaw	e00071721a	[tune] tf2.0 testing and supporting callables (#5738 )	2019-09-21 17:01:14 -07:00
jichan3751	1711e202a3	[training] Tensorflow interface for MultiNode SGD (#5440 )	2019-09-03 15:35:42 -07:00
Richard Liaw	411f30c125	[docs] Second push of changes (#5391 )	2019-08-28 17:54:15 -07:00
Eric Liang	97ccd75952	[rllib] Enable object store memory limit by default (#5534 )	2019-08-26 01:37:28 -07:00
gehring	b520f6141e	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
Richard Liaw	cdc9227f1b	[tune] ASHA xgboost and lightgbm examples (#5500 )	2019-08-22 10:37:59 -07:00
Robert Nishihara	851c5b2dae	Add a script for benchmarking performance for Ray developers. (#5472 )	2019-08-19 23:41:23 -07:00
Richard Liaw	d7b309223b	[tune] MLFlow Logger (#5438 )	2019-08-14 15:58:18 -07:00
Lisa Dunlap	b7d0733362	[tune] Implement BOHB (#5382 )	2019-08-13 12:32:07 -07:00
Eric Liang	a1d2e17623	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
jichan3751	de95117e96	[sgd] Tune interface for Pytorch MultiNode SGD (#5350 )	2019-08-10 13:51:44 -07:00
Simon Mo	18f1e904de	Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409 )	2019-08-09 11:37:19 -07:00
Eric Liang	592f313210	[rllib] Centralized critic / PPO example on TwoStepGame (#5392 )	2019-08-08 14:03:28 -07:00
Wonseok Jeon	281829e712	MADDPG implementation in RLlib (#5348 )	2019-08-06 16:22:06 -07:00
Eric Liang	5d7afe8092	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
Richard Liaw	1eaa57c98f	[tune] Distributed example + walkthrough (#5157 )	2019-08-02 09:17:20 -07:00
Eric Liang	3bdd114282	[rllib] Better example rnn envs (#5300 )	2019-07-28 14:07:18 -07:00
Eric Liang	a62c5f40f6	[rllib] Document ModelV2 and clean up the models/ directory (#5277 )	2019-07-27 02:08:16 -07:00
Richard Liaw	7e715520e5	[sgd] Example for Training (#5292 )	2019-07-27 01:10:25 -07:00
Eric Liang	f9043cc49a	[rllib] Remove experimental eager support	2019-07-21 12:27:17 -07:00
Jones Wong	0af07bd493	Enable seeding actors for reproducible experiments (#5197 ) * enable graph-level worker-specific seed * lint checked * revised according to eric's suggestions * revised accordingly and added a test case * formated * Update test_reproducibility.py * Update trainer.py * Update rollout_worker.py * Update run_rllib_tests.sh * Update worker_set.py	2019-07-17 23:31:34 -07:00

1 2

92 commits