hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	37208216ae	[rllib] Rename Agent to Trainer (#4556 )	2019-04-07 00:36:18 -07:00
Dušan Josipović	820c71b7d0	[tune/rllib] Add checkpoint eraser (#4490 )	2019-04-06 20:01:54 -07:00
ctombumila37	7746d20d30	[rllib] ExternalMultiAgentEnv (#4200 )	2019-04-06 19:58:14 -07:00
Andrew Tan	991b911e1d	[tune] Add `--columns` flag for CLI (#4564 )	2019-04-05 19:49:01 -07:00
Jérémy	300ec72d15	[tune] Add compatibility to nevergrad 0.2.0+ (#4529 ) ## What do these changes do? This PR prepares for future version 0.2.0 of `nevergrad`, in which each suggestion is a `Candidate` instance having fields `args` and `kwargs` instead of being a `np.ndarray`. The proposed changes are compatible with all versions of `nevergrad` (manually tested with `nevergrad_example.py` on both `master` and current version `v0.1.6`). See `nevergrad`'s [CHANGELOG](https://github.com/facebookresearch/nevergrad/blob/master/CHANGELOG.md) for more information on the change. ## Related issue number None ## Linter - [x] I've run `scripts/format.sh` to lint the changes in this PR.	2019-04-05 19:44:58 -07:00
Andrew Tan	bfd0af52bc	[tune] Add documentation to --output flag (#4518 ) ## What do these changes do? Add documentation for the `--output` flag for ls / lsx in the Tune CLI. ## Related issue number Closes #4511 ## Linter - [x] I've run `scripts/format.sh` to lint the changes in this PR.	2019-04-05 00:16:35 -07:00
Richard Liaw	50b2aa0740	[tune] Better handling of tune.function in global checkpoint (#4519 ) Enables result keys to be queried by CLI.	2019-04-04 21:08:47 -07:00
Federico Fontana	fb88f7efe6	Fixed bug in Dirichlet (#4440 ) (#4560 )	2019-04-04 14:33:09 -07:00
Yuhong Guo	c2349cf12d	Remove local/global_scheduler from code and doc. (#4549 )	2019-04-03 17:05:09 -07:00
Adi Zimmerman	51dae23d5c	[tune] Search Alg delay import + CLI timing test (#4230 )	2019-04-03 08:52:45 -07:00
Philipp Moritz	b0f6ddf6d1	Remove CMake files (#4493 )	2019-04-02 22:17:33 -07:00
Hao Chen	23404f7bcf	Fix some flaky tests (#4535 )	2019-04-02 17:57:11 -07:00
Simon Mo	db4cf24636	[serve] Double Serialization Optimization (#4532 )	2019-04-02 12:35:03 -07:00
Eric Liang	55a2d39409	[rllib] Add option for RNN state and value estimates to span episodes (#4429 ) * wip soft horizon * tests	2019-04-02 02:44:15 -07:00
Yuhong Guo	c2c548bdfd	Fix broken pipe callback (#4513 )	2019-04-02 17:42:18 +08:00
Jones Wong	fe7763e786	[rllib] replace the assertion in SyncReplayOptimizer by a warning (#4534 )	2019-04-02 01:43:22 -07:00
opherlieber	60b230b8ad	[rllib] Add support for LR schedule to DQN/APEX (#4473 )	2019-04-01 11:35:34 -07:00
Eric Liang	0d94f3eeef	[rllib] Improve datapath throughput of IMPALA / APPO (#4324 )	2019-03-31 12:25:52 -07:00
Toanngo	dffe19c59c	Update GCP gpu image (#4524 )	2019-03-31 01:01:22 -07:00
Eric Liang	b01ac41e6f	[rllib] Try to call close on envs on stop (#4521 )	2019-03-30 19:36:05 -07:00
Eric Liang	fce0062380	[rllib] Switch to tune.run() instead of run_experiments() (#4515 )	2019-03-30 14:07:50 -07:00
Risto Vuorio	798944fbfa	Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… (#4504 ) * Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4502) * Update dqn_policy_graph.py	2019-03-29 13:31:59 -07:00
Leon Sievers	f4b313eaad	[rllib] Moved clip_action into policy_graph; Clip actions in compute_single_action (#4459 ) * Moved clip_action into policy_graph; Clip actions in compute_single_action * Update policy_graph.py * Changed formatting * Updated codebase for convencience	2019-03-29 13:26:07 -07:00
gehring	5133b10700	Add support for tensorflow resource variables (#4438 ) * Adding support for resource variables Currently resource variable go undetected by the `TensorFlowVariables` since they do not use the same ops for reading values. This change should fix this until a more robust solution is implemented. * fix varhandle	2019-03-29 13:23:05 -07:00
bjg2	77005d1814	[rllib] Make batch timeout for remote workers tunable (#4435 )	2019-03-29 13:19:42 -07:00
Eric Liang	2ffe67c5c3	[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478 )	2019-03-29 12:44:23 -07:00
Eric Liang	09b2961750	[rllib] Ensure stats are consistently reported across all algos (#4445 )	2019-03-27 15:40:15 -07:00
Eric Liang	2871609296	[rllib] Report sampler performance metrics (#4427 )	2019-03-27 13:24:23 -07:00
Andrew Tan	12db684f72	[tune] add filter flag for Tune CLI (#4337 ) ## What do these changes do? Adds filter flag (--filter) to ls / lsx commands for Tune CLI. Usage: `tune ls [path] --filter [column] [operator] [value]` e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`	2019-03-27 11:19:25 -07:00
Robert Nishihara	c6f12e5219	Update documentation from 0.7.0.dev1 to 0.7.0.dev2. (#4485 )	2019-03-26 17:32:53 -07:00
Robert Nishihara	c0e10ef12d	Bump version number from 0.6.5 to 0.7.0.dev2. (#4484 )	2019-03-26 16:44:32 -07:00
Robert Nishihara	8548f12eb2	Give better error when include_webui=1 and webui can't be started. (#4471 )	2019-03-26 14:54:32 -07:00
Eric Liang	cff08e19ff	[rllib] Print out intermediate data shapes on the first iteration (#4426 )	2019-03-26 00:27:59 -07:00
Eric Liang	8ee240f40e	[rllib] Use 64-byte aligned memory when concatenating arrays (#4408 )	2019-03-25 23:56:51 -07:00
Vlad Firoiu	c68eea6134	[rllib] More efficient tuple flattening. (#4416 ) * More efficient tuple flattening. * Preprocessor.write uses transform by default. * lint * to array * Update test_catalog.py * Update test_catalog.py	2019-03-25 16:00:33 -07:00
Richard Liaw	a275af337e	[tune] Make examples more verbose (#4469 ) ## What do these changes do? Verbosity defaults to "1", so here we default verbosity for a couple examples. ## Related issue number Fixes #4467	2019-03-25 15:13:17 -07:00
Eric Liang	5b8eb475ce	[rllib] Allow None to be specified in multi-agent envs (#4464 ) * wip * check * doc update * Update hierarchical_training.py	2019-03-25 11:38:17 -07:00
William Ma	11580fb7dc	Changes where actor resources are assigned (#4323 )	2019-03-24 15:49:36 -07:00
Eric Liang	01699ce4ea	[rllib] Fix race condition with multiple data loaders, fix stats	2019-03-23 20:17:01 -07:00
Robert Nishihara	01747b11a1	Bump version from 0.7.0.dev1 to 0.6.5. (#4461 )	2019-03-22 15:03:29 -07:00
Richard Liaw	32bf23d24f	[tune] Remove output of tests	2019-03-22 10:48:03 -07:00
Leon Sievers	b21c20c9a6	[rllib] Added missing action clipping for rollout example script (#4413 ) * Added action clipping for rollout example script * Used action_clipping from sampler * Fixed and improved naming	2019-03-22 00:51:27 -07:00
Eric Liang	4b8b703561	[rllib] Some API cleanups and documentation improvements (#4409 )	2019-03-21 21:34:22 -07:00
Ion	59079a799c	Signal actor failure (#4196 )	2019-03-21 15:17:42 -07:00
Kai Yang	c36d03874b	Redis returns OK when removing a non-existent set entry (#4434 )	2019-03-21 11:59:15 -07:00
Eric Liang	57c1aeb427	[rllib] Use suppress_output instead of run_silent.sh script for tests (#4386 ) * fix * enable custom loss * Update run_rllib_tests.sh * enable tests * fix action prob * Update suppress_output * fix example * fix	2019-03-21 00:15:24 -07:00
Hao Chen	d03999d01e	Cross-language invocation Part 1: Java calling Python functions and actors (#4166 )	2019-03-21 13:34:21 +08:00
Richard Liaw	828dc08ac8	[tune] Fix tests for Function API for better consistency (#4421 )	2019-03-20 22:31:38 -07:00
Robert Nishihara	9c158c6a87	Start dashboard on all nodes and other small fixes. (#4428 ) * Start reporter on all nodes. * More fixes	2019-03-20 13:04:06 -07:00
Stephanie Wang	4ac9c1ed6e	Fix bug in cluster mode where driver exits when there are tasks in the waiting queue (#4251 )	2019-03-20 10:18:27 -07:00

1 2 3 4 5 ...

1294 commits