Commit graph

2741 commits

Author SHA1 Message Date
Eric Liang
55a2d39409
[rllib] Add option for RNN state and value estimates to span episodes (#4429)
* wip soft horizon

* tests
2019-04-02 02:44:15 -07:00
Yuhong Guo
c2c548bdfd Fix broken pipe callback (#4513) 2019-04-02 17:42:18 +08:00
bibabolynn
20c7b2a6eb [Java] TestNG outputs more verbose error messages (#4507)
[Java] TestNG outputs more verbose error messages
2019-04-02 17:41:20 +08:00
Jones Wong
fe7763e786 [rllib] replace the assertion in SyncReplayOptimizer by a warning (#4534) 2019-04-02 01:43:22 -07:00
opherlieber
60b230b8ad [rllib] Add support for LR schedule to DQN/APEX (#4473) 2019-04-01 11:35:34 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO (#4324) 2019-03-31 12:25:52 -07:00
Toanngo
dffe19c59c Update GCP gpu image (#4524) 2019-03-31 01:01:22 -07:00
Eric Liang
b01ac41e6f
[rllib] Try to call close on envs on stop (#4521) 2019-03-30 19:36:05 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() (#4515) 2019-03-30 14:07:50 -07:00
ppeagle
5efb21e1d0 Initial commit for Ray streaming (#4268) 2019-03-30 19:32:05 +08:00
Eric Liang
e5bcae52f5
Add lint advisory to PR template 2019-03-29 16:49:02 -07:00
Risto Vuorio
798944fbfa Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… (#4504)
* Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4502)

* Update dqn_policy_graph.py
2019-03-29 13:31:59 -07:00
Leon Sievers
f4b313eaad [rllib] Moved clip_action into policy_graph; Clip actions in compute_single_action (#4459)
* Moved clip_action into policy_graph; Clip actions in compute_single_action

* Update policy_graph.py

* Changed formatting

* Updated codebase for convencience
2019-03-29 13:26:07 -07:00
gehring
5133b10700 Add support for tensorflow resource variables (#4438)
* Adding support for resource variables

Currently resource variable go undetected by the `TensorFlowVariables` since they do not use the same ops for reading values. This change should fix this until a more robust solution is implemented.

* fix varhandle
2019-03-29 13:23:05 -07:00
bjg2
77005d1814 [rllib] Make batch timeout for remote workers tunable (#4435) 2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478) 2019-03-29 12:44:23 -07:00
bibabolynn
ab55a1f93a [Java] Clean up outdated dependencies (#4489) 2019-03-28 14:33:45 +08:00
Philipp Moritz
1bcb0b94cc Synchronize arrow version and put changes upstream (#4385) 2019-03-27 22:37:07 -07:00
Robert Nishihara
a22bf1e511 Minor aesthetic changes to python file. (#4492) 2019-03-28 10:59:29 +08:00
Eric Liang
09b2961750
[rllib] Ensure stats are consistently reported across all algos (#4445) 2019-03-27 15:40:15 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics (#4427) 2019-03-27 13:24:23 -07:00
Andrew Tan
12db684f72 [tune] add filter flag for Tune CLI (#4337)
## What do these changes do?

Adds filter flag (--filter) to ls / lsx commands for Tune CLI.

Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara
c6f12e5219 Update documentation from 0.7.0.dev1 to 0.7.0.dev2. (#4485) 2019-03-26 17:32:53 -07:00
Robert Nishihara
c0e10ef12d Bump version number from 0.6.5 to 0.7.0.dev2. (#4484) 2019-03-26 16:44:32 -07:00
Robert Nishihara
8548f12eb2 Give better error when include_webui=1 and webui can't be started. (#4471) 2019-03-26 14:54:32 -07:00
bibabolynn
7a9d1546d4 [java] Fix getWorker and add create multi actors test (#4472) 2019-03-26 20:26:13 +08:00
Wang Qing
7d70cfba6e [Java] Fix loading custom classes from jars (#4475) 2019-03-26 20:15:08 +08:00
Eric Liang
cff08e19ff
[rllib] Print out intermediate data shapes on the first iteration (#4426) 2019-03-26 00:27:59 -07:00
Eric Liang
8ee240f40e
[rllib] Use 64-byte aligned memory when concatenating arrays (#4408) 2019-03-25 23:56:51 -07:00
Vlad Firoiu
c68eea6134 [rllib] More efficient tuple flattening. (#4416)
* More efficient tuple flattening.

* Preprocessor.write uses transform by default.

* lint

* to array

* Update test_catalog.py

* Update test_catalog.py
2019-03-25 16:00:33 -07:00
Richard Liaw
a275af337e
[tune] Make examples more verbose (#4469)
## What do these changes do?
Verbosity defaults to "1", so here we default verbosity for a couple
examples.

## Related issue number

Fixes #4467
2019-03-25 15:13:17 -07:00
Eric Liang
5b8eb475ce
[rllib] Allow None to be specified in multi-agent envs (#4464)
* wip

* check

* doc update

* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
William Ma
11580fb7dc Changes where actor resources are assigned (#4323) 2019-03-24 15:49:36 -07:00
Eric Liang
01699ce4ea
[rllib] Fix race condition with multiple data loaders, fix stats 2019-03-23 20:17:01 -07:00
Hao Chen
7a38f9be1c
Avoid redundant bazel build (#4458) 2019-03-23 10:44:11 +08:00
Robert Nishihara
01747b11a1 Bump version from 0.7.0.dev1 to 0.6.5. (#4461) 2019-03-22 15:03:29 -07:00
Richard Liaw
32bf23d24f [tune] Remove output of tests 2019-03-22 10:48:03 -07:00
Hao Chen
80cd9c9c1a
[travis] Add back '-v' option to pytest and install psutil (#4430) 2019-03-22 17:45:55 +08:00
Leon Sievers
b21c20c9a6 [rllib] Added missing action clipping for rollout example script (#4413)
* Added action clipping for rollout example script

* Used action_clipping from sampler

* Fixed and improved naming
2019-03-22 00:51:27 -07:00
Ruifang Chen
59d74d5e92 [Java] Build Java code with Bazel (#4284) 2019-03-22 14:30:05 +08:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements (#4409) 2019-03-21 21:34:22 -07:00
Ion
59079a799c Signal actor failure (#4196) 2019-03-21 15:17:42 -07:00
Kai Yang
c36d03874b Redis returns OK when removing a non-existent set entry (#4434) 2019-03-21 11:59:15 -07:00
Eric Liang
57c1aeb427
[rllib] Use suppress_output instead of run_silent.sh script for tests (#4386)
* fix

* enable custom loss

* Update run_rllib_tests.sh

* enable tests

* fix action prob

* Update suppress_output

* fix example

* fix
2019-03-21 00:15:24 -07:00
Hao Chen
d03999d01e
Cross-language invocation Part 1: Java calling Python functions and actors (#4166) 2019-03-21 13:34:21 +08:00
Richard Liaw
828dc08ac8
[tune] Fix tests for Function API for better consistency (#4421) 2019-03-20 22:31:38 -07:00
Philipp Moritz
80ef8c19aa Add initial news reader example (#4348) 2019-03-20 18:47:12 -07:00
Robert Nishihara
9c158c6a87 Start dashboard on all nodes and other small fixes. (#4428)
* Start reporter on all nodes.

* More fixes
2019-03-20 13:04:06 -07:00
Stephanie Wang
4ac9c1ed6e Fix bug in cluster mode where driver exits when there are tasks in the waiting queue (#4251) 2019-03-20 10:18:27 -07:00
Yuhong Guo
8ce7565530 Refactor pytest fixtures for ray core (#4390) 2019-03-20 11:48:32 +08:00