Eric Liang
37208216ae
[rllib] Rename Agent to Trainer ( #4556 )
2019-04-07 00:36:18 -07:00
Dušan Josipović
820c71b7d0
[tune/rllib] Add checkpoint eraser ( #4490 )
2019-04-06 20:01:54 -07:00
ctombumila37
7746d20d30
[rllib] ExternalMultiAgentEnv ( #4200 )
2019-04-06 19:58:14 -07:00
Andrew Tan
991b911e1d
[tune] Add --columns
flag for CLI ( #4564 )
2019-04-05 19:49:01 -07:00
Jérémy
300ec72d15
[tune] Add compatibility to nevergrad 0.2.0+ ( #4529 )
...
## What do these changes do?
This PR prepares for future version 0.2.0 of `nevergrad`, in which each suggestion is a `Candidate` instance having fields `args` and `kwargs` instead of being a `np.ndarray`. The proposed changes are compatible with all versions of `nevergrad` (manually tested with `nevergrad_example.py` on both `master` and current version `v0.1.6`).
See `nevergrad`'s [CHANGELOG](https://github.com/facebookresearch/nevergrad/blob/master/CHANGELOG.md ) for more information on the change.
## Related issue number
None
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 19:44:58 -07:00
Andrew Tan
bfd0af52bc
[tune] Add documentation to --output flag ( #4518 )
...
## What do these changes do?
Add documentation for the `--output` flag for ls / lsx in the Tune CLI.
## Related issue number
Closes #4511
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Richard Liaw
50b2aa0740
[tune] Better handling of tune.function in global checkpoint ( #4519 )
...
Enables result keys to be queried by CLI.
2019-04-04 21:08:47 -07:00
Federico Fontana
fb88f7efe6
Fixed bug in Dirichlet ( #4440 ) ( #4560 )
2019-04-04 14:33:09 -07:00
Yuhong Guo
c2349cf12d
Remove local/global_scheduler from code and doc. ( #4549 )
2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c
[tune] Search Alg delay import + CLI timing test ( #4230 )
2019-04-03 08:52:45 -07:00
Philipp Moritz
b0f6ddf6d1
Remove CMake files ( #4493 )
2019-04-02 22:17:33 -07:00
Hao Chen
23404f7bcf
Fix some flaky tests ( #4535 )
2019-04-02 17:57:11 -07:00
Simon Mo
db4cf24636
[serve] Double Serialization Optimization ( #4532 )
2019-04-02 12:35:03 -07:00
Eric Liang
55a2d39409
[rllib] Add option for RNN state and value estimates to span episodes ( #4429 )
...
* wip soft horizon
* tests
2019-04-02 02:44:15 -07:00
Yuhong Guo
c2c548bdfd
Fix broken pipe callback ( #4513 )
2019-04-02 17:42:18 +08:00
Jones Wong
fe7763e786
[rllib] replace the assertion in SyncReplayOptimizer by a warning ( #4534 )
2019-04-02 01:43:22 -07:00
opherlieber
60b230b8ad
[rllib] Add support for LR schedule to DQN/APEX ( #4473 )
2019-04-01 11:35:34 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO ( #4324 )
2019-03-31 12:25:52 -07:00
Toanngo
dffe19c59c
Update GCP gpu image ( #4524 )
2019-03-31 01:01:22 -07:00
Eric Liang
b01ac41e6f
[rllib] Try to call close on envs on stop ( #4521 )
2019-03-30 19:36:05 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() ( #4515 )
2019-03-30 14:07:50 -07:00
Risto Vuorio
798944fbfa
Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… ( #4504 )
...
* Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4502 )
* Update dqn_policy_graph.py
2019-03-29 13:31:59 -07:00
Leon Sievers
f4b313eaad
[rllib] Moved clip_action into policy_graph; Clip actions in compute_single_action ( #4459 )
...
* Moved clip_action into policy_graph; Clip actions in compute_single_action
* Update policy_graph.py
* Changed formatting
* Updated codebase for convencience
2019-03-29 13:26:07 -07:00
gehring
5133b10700
Add support for tensorflow resource variables ( #4438 )
...
* Adding support for resource variables
Currently resource variable go undetected by the `TensorFlowVariables` since they do not use the same ops for reading values. This change should fix this until a more robust solution is implemented.
* fix varhandle
2019-03-29 13:23:05 -07:00
bjg2
77005d1814
[rllib] Make batch timeout for remote workers tunable ( #4435 )
2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs ( #4478 )
2019-03-29 12:44:23 -07:00
Eric Liang
09b2961750
[rllib] Ensure stats are consistently reported across all algos ( #4445 )
2019-03-27 15:40:15 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics ( #4427 )
2019-03-27 13:24:23 -07:00
Andrew Tan
12db684f72
[tune] add filter flag for Tune CLI ( #4337 )
...
## What do these changes do?
Adds filter flag (--filter) to ls / lsx commands for Tune CLI.
Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara
c6f12e5219
Update documentation from 0.7.0.dev1 to 0.7.0.dev2. ( #4485 )
2019-03-26 17:32:53 -07:00
Robert Nishihara
c0e10ef12d
Bump version number from 0.6.5 to 0.7.0.dev2. ( #4484 )
2019-03-26 16:44:32 -07:00
Robert Nishihara
8548f12eb2
Give better error when include_webui=1 and webui can't be started. ( #4471 )
2019-03-26 14:54:32 -07:00
Eric Liang
cff08e19ff
[rllib] Print out intermediate data shapes on the first iteration ( #4426 )
2019-03-26 00:27:59 -07:00
Eric Liang
8ee240f40e
[rllib] Use 64-byte aligned memory when concatenating arrays ( #4408 )
2019-03-25 23:56:51 -07:00
Vlad Firoiu
c68eea6134
[rllib] More efficient tuple flattening. ( #4416 )
...
* More efficient tuple flattening.
* Preprocessor.write uses transform by default.
* lint
* to array
* Update test_catalog.py
* Update test_catalog.py
2019-03-25 16:00:33 -07:00
Richard Liaw
a275af337e
[tune] Make examples more verbose ( #4469 )
...
## What do these changes do?
Verbosity defaults to "1", so here we default verbosity for a couple
examples.
## Related issue number
Fixes #4467
2019-03-25 15:13:17 -07:00
Eric Liang
5b8eb475ce
[rllib] Allow None to be specified in multi-agent envs ( #4464 )
...
* wip
* check
* doc update
* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
William Ma
11580fb7dc
Changes where actor resources are assigned ( #4323 )
2019-03-24 15:49:36 -07:00
Eric Liang
01699ce4ea
[rllib] Fix race condition with multiple data loaders, fix stats
2019-03-23 20:17:01 -07:00
Robert Nishihara
01747b11a1
Bump version from 0.7.0.dev1 to 0.6.5. ( #4461 )
2019-03-22 15:03:29 -07:00
Richard Liaw
32bf23d24f
[tune] Remove output of tests
2019-03-22 10:48:03 -07:00
Leon Sievers
b21c20c9a6
[rllib] Added missing action clipping for rollout example script ( #4413 )
...
* Added action clipping for rollout example script
* Used action_clipping from sampler
* Fixed and improved naming
2019-03-22 00:51:27 -07:00
Eric Liang
4b8b703561
[rllib] Some API cleanups and documentation improvements ( #4409 )
2019-03-21 21:34:22 -07:00
Ion
59079a799c
Signal actor failure ( #4196 )
2019-03-21 15:17:42 -07:00
Kai Yang
c36d03874b
Redis returns OK when removing a non-existent set entry ( #4434 )
2019-03-21 11:59:15 -07:00
Eric Liang
57c1aeb427
[rllib] Use suppress_output instead of run_silent.sh script for tests ( #4386 )
...
* fix
* enable custom loss
* Update run_rllib_tests.sh
* enable tests
* fix action prob
* Update suppress_output
* fix example
* fix
2019-03-21 00:15:24 -07:00
Hao Chen
d03999d01e
Cross-language invocation Part 1: Java calling Python functions and actors ( #4166 )
2019-03-21 13:34:21 +08:00
Richard Liaw
828dc08ac8
[tune] Fix tests for Function API for better consistency ( #4421 )
2019-03-20 22:31:38 -07:00
Robert Nishihara
9c158c6a87
Start dashboard on all nodes and other small fixes. ( #4428 )
...
* Start reporter on all nodes.
* More fixes
2019-03-20 13:04:06 -07:00
Stephanie Wang
4ac9c1ed6e
Fix bug in cluster mode where driver exits when there are tasks in the waiting queue ( #4251 )
2019-03-20 10:18:27 -07:00