Commit graph

1294 commits

Author SHA1 Message Date
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer (#4556) 2019-04-07 00:36:18 -07:00
Dušan Josipović
820c71b7d0 [tune/rllib] Add checkpoint eraser (#4490) 2019-04-06 20:01:54 -07:00
ctombumila37
7746d20d30 [rllib] ExternalMultiAgentEnv (#4200) 2019-04-06 19:58:14 -07:00
Andrew Tan
991b911e1d [tune] Add --columns flag for CLI (#4564) 2019-04-05 19:49:01 -07:00
Jérémy
300ec72d15 [tune] Add compatibility to nevergrad 0.2.0+ (#4529)
## What do these changes do?

This PR prepares for future version  0.2.0 of `nevergrad`, in which each suggestion is a `Candidate` instance having fields `args` and `kwargs` instead of being a `np.ndarray`. The proposed changes are compatible with all versions of `nevergrad` (manually tested with `nevergrad_example.py` on both `master` and current version `v0.1.6`).

See `nevergrad`'s [CHANGELOG](https://github.com/facebookresearch/nevergrad/blob/master/CHANGELOG.md) for more information on the change.

## Related issue number

None

## Linter

- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 19:44:58 -07:00
Andrew Tan
bfd0af52bc [tune] Add documentation to --output flag (#4518)
## What do these changes do?

Add documentation for the `--output` flag for ls / lsx in the Tune CLI.

## Related issue number

Closes #4511 

## Linter

- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Richard Liaw
50b2aa0740
[tune] Better handling of tune.function in global checkpoint (#4519)
Enables result keys to be queried by CLI.
2019-04-04 21:08:47 -07:00
Federico Fontana
fb88f7efe6 Fixed bug in Dirichlet (#4440) (#4560) 2019-04-04 14:33:09 -07:00
Yuhong Guo
c2349cf12d Remove local/global_scheduler from code and doc. (#4549) 2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c [tune] Search Alg delay import + CLI timing test (#4230) 2019-04-03 08:52:45 -07:00
Philipp Moritz
b0f6ddf6d1 Remove CMake files (#4493) 2019-04-02 22:17:33 -07:00
Hao Chen
23404f7bcf Fix some flaky tests (#4535) 2019-04-02 17:57:11 -07:00
Simon Mo
db4cf24636 [serve] Double Serialization Optimization (#4532) 2019-04-02 12:35:03 -07:00
Eric Liang
55a2d39409
[rllib] Add option for RNN state and value estimates to span episodes (#4429)
* wip soft horizon

* tests
2019-04-02 02:44:15 -07:00
Yuhong Guo
c2c548bdfd Fix broken pipe callback (#4513) 2019-04-02 17:42:18 +08:00
Jones Wong
fe7763e786 [rllib] replace the assertion in SyncReplayOptimizer by a warning (#4534) 2019-04-02 01:43:22 -07:00
opherlieber
60b230b8ad [rllib] Add support for LR schedule to DQN/APEX (#4473) 2019-04-01 11:35:34 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO (#4324) 2019-03-31 12:25:52 -07:00
Toanngo
dffe19c59c Update GCP gpu image (#4524) 2019-03-31 01:01:22 -07:00
Eric Liang
b01ac41e6f
[rllib] Try to call close on envs on stop (#4521) 2019-03-30 19:36:05 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() (#4515) 2019-03-30 14:07:50 -07:00
Risto Vuorio
798944fbfa Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… (#4504)
* Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4502)

* Update dqn_policy_graph.py
2019-03-29 13:31:59 -07:00
Leon Sievers
f4b313eaad [rllib] Moved clip_action into policy_graph; Clip actions in compute_single_action (#4459)
* Moved clip_action into policy_graph; Clip actions in compute_single_action

* Update policy_graph.py

* Changed formatting

* Updated codebase for convencience
2019-03-29 13:26:07 -07:00
gehring
5133b10700 Add support for tensorflow resource variables (#4438)
* Adding support for resource variables

Currently resource variable go undetected by the `TensorFlowVariables` since they do not use the same ops for reading values. This change should fix this until a more robust solution is implemented.

* fix varhandle
2019-03-29 13:23:05 -07:00
bjg2
77005d1814 [rllib] Make batch timeout for remote workers tunable (#4435) 2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478) 2019-03-29 12:44:23 -07:00
Eric Liang
09b2961750
[rllib] Ensure stats are consistently reported across all algos (#4445) 2019-03-27 15:40:15 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics (#4427) 2019-03-27 13:24:23 -07:00
Andrew Tan
12db684f72 [tune] add filter flag for Tune CLI (#4337)
## What do these changes do?

Adds filter flag (--filter) to ls / lsx commands for Tune CLI.

Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara
c6f12e5219 Update documentation from 0.7.0.dev1 to 0.7.0.dev2. (#4485) 2019-03-26 17:32:53 -07:00
Robert Nishihara
c0e10ef12d Bump version number from 0.6.5 to 0.7.0.dev2. (#4484) 2019-03-26 16:44:32 -07:00
Robert Nishihara
8548f12eb2 Give better error when include_webui=1 and webui can't be started. (#4471) 2019-03-26 14:54:32 -07:00
Eric Liang
cff08e19ff
[rllib] Print out intermediate data shapes on the first iteration (#4426) 2019-03-26 00:27:59 -07:00
Eric Liang
8ee240f40e
[rllib] Use 64-byte aligned memory when concatenating arrays (#4408) 2019-03-25 23:56:51 -07:00
Vlad Firoiu
c68eea6134 [rllib] More efficient tuple flattening. (#4416)
* More efficient tuple flattening.

* Preprocessor.write uses transform by default.

* lint

* to array

* Update test_catalog.py

* Update test_catalog.py
2019-03-25 16:00:33 -07:00
Richard Liaw
a275af337e
[tune] Make examples more verbose (#4469)
## What do these changes do?
Verbosity defaults to "1", so here we default verbosity for a couple
examples.

## Related issue number

Fixes #4467
2019-03-25 15:13:17 -07:00
Eric Liang
5b8eb475ce
[rllib] Allow None to be specified in multi-agent envs (#4464)
* wip

* check

* doc update

* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
William Ma
11580fb7dc Changes where actor resources are assigned (#4323) 2019-03-24 15:49:36 -07:00
Eric Liang
01699ce4ea
[rllib] Fix race condition with multiple data loaders, fix stats 2019-03-23 20:17:01 -07:00
Robert Nishihara
01747b11a1 Bump version from 0.7.0.dev1 to 0.6.5. (#4461) 2019-03-22 15:03:29 -07:00
Richard Liaw
32bf23d24f [tune] Remove output of tests 2019-03-22 10:48:03 -07:00
Leon Sievers
b21c20c9a6 [rllib] Added missing action clipping for rollout example script (#4413)
* Added action clipping for rollout example script

* Used action_clipping from sampler

* Fixed and improved naming
2019-03-22 00:51:27 -07:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements (#4409) 2019-03-21 21:34:22 -07:00
Ion
59079a799c Signal actor failure (#4196) 2019-03-21 15:17:42 -07:00
Kai Yang
c36d03874b Redis returns OK when removing a non-existent set entry (#4434) 2019-03-21 11:59:15 -07:00
Eric Liang
57c1aeb427
[rllib] Use suppress_output instead of run_silent.sh script for tests (#4386)
* fix

* enable custom loss

* Update run_rllib_tests.sh

* enable tests

* fix action prob

* Update suppress_output

* fix example

* fix
2019-03-21 00:15:24 -07:00
Hao Chen
d03999d01e
Cross-language invocation Part 1: Java calling Python functions and actors (#4166) 2019-03-21 13:34:21 +08:00
Richard Liaw
828dc08ac8
[tune] Fix tests for Function API for better consistency (#4421) 2019-03-20 22:31:38 -07:00
Robert Nishihara
9c158c6a87 Start dashboard on all nodes and other small fixes. (#4428)
* Start reporter on all nodes.

* More fixes
2019-03-20 13:04:06 -07:00
Stephanie Wang
4ac9c1ed6e Fix bug in cluster mode where driver exits when there are tasks in the waiting queue (#4251) 2019-03-20 10:18:27 -07:00