1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-17 08:36:38 -04:00
Commit graph

3420 commits

Author SHA1 Message Date
ppeagle
6697407ec4 [Java Streaming] Fix StreamSource constructor and SourceFunction initialization () 2019-04-10 20:02:11 +08:00
Kristian Hartikainen
ed02bf11f7 [autoscaler] Lint code that we forgot to lint in ()
* Lint code that we forgot to lint in previous PR

* Revert setup command merge

* Lint

* Revert "Revert setup command merge"

This reverts commit 55e1cdb1f256ea51ef66a38730d8f7865f1f5ad1.

* Fix testReportsConfigFailures test

* Minor syntax tweaks

* Lint
2019-04-10 17:01:36 +08:00
Vlad Firoiu
74fd3d7e21 [rllib] Support prev_state/prev_action in rollout and fix multiagent ()
* Cleaner and more correct treatment of agent states in rollout.py

* support lstm_use_prev_action_reward in rollout.py

* Linter.

* appease flake8

* Use _DUMMY_AGENT_ID instead of 0.

* All agents have a policy_agent_mapping.
Reset the mapping cache at the start of each episode.

* Update rollout.py

* Fix rollout.py for single-agent envs.

* Use agent_id, not policy_id.
2019-04-10 00:01:25 -07:00
Eric Liang
f8e8743347
[tune] Improve PBT example () 2019-04-09 20:59:17 -07:00
Si-Yuan
dab99d26af
Improve code related to node ()
* Make full use of node

implement local node

fix bugs mentioned in comments

* Add more tests

* Use more specific exception handling

* fix, lint

* fix for py2.x
2019-04-09 17:27:54 +08:00
Wang Qing
c5bcec54f3 Add ignore items for java build () 2019-04-09 15:59:58 +08:00
Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF () 2019-04-09 00:36:49 -07:00
Duane
7f23e8431b fixed typo in kuber yaml () 2019-04-08 23:13:42 -07:00
Stefan Pantic
915486984a [autoscaler] Add support for separate docker containers on head and worker nodes ()
* Added support for running different docker containers on clusters

* Remove node specific container names

* Keep old options and expand with node specific configuration

* Optimized imports

* Changed docker fields for autoscaler

* Auto reformat

* Updated comments

* Updated condition

* Run linter

* Updated example

* Changed condition for docker images, updated examples

* Removed duplicate line

* Fixed setup_commands

* Update autoscaler.py

* fix_better_image
2019-04-07 16:51:32 -07:00
Jones Wong
da5a471485 [rllib] validate observation in NoPreprocessor () 2019-04-07 16:11:50 -07:00
Eric Liang
f9b8e77e3b
[rllib] Don't merge unrolls from same episode when calculating seq lens () 2019-04-07 12:11:30 -07:00
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer () 2019-04-07 00:36:18 -07:00
Dušan Josipović
820c71b7d0 [tune/rllib] Add checkpoint eraser () 2019-04-06 20:01:54 -07:00
ctombumila37
7746d20d30 [rllib] ExternalMultiAgentEnv () 2019-04-06 19:58:14 -07:00
Andrew Tan
991b911e1d [tune] Add --columns flag for CLI () 2019-04-05 19:49:01 -07:00
Jérémy
300ec72d15 [tune] Add compatibility to nevergrad 0.2.0+ ()
## What do these changes do?

This PR prepares for future version  0.2.0 of `nevergrad`, in which each suggestion is a `Candidate` instance having fields `args` and `kwargs` instead of being a `np.ndarray`. The proposed changes are compatible with all versions of `nevergrad` (manually tested with `nevergrad_example.py` on both `master` and current version `v0.1.6`).

See `nevergrad`'s [CHANGELOG](https://github.com/facebookresearch/nevergrad/blob/master/CHANGELOG.md) for more information on the change.

## Related issue number

None

## Linter

- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 19:44:58 -07:00
Andrew Tan
bfd0af52bc [tune] Add documentation to --output flag ()
## What do these changes do?

Add documentation for the `--output` flag for ls / lsx in the Tune CLI.

## Related issue number

Closes  

## Linter

- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Richard Liaw
50b2aa0740
[tune] Better handling of tune.function in global checkpoint ()
Enables result keys to be queried by CLI.
2019-04-04 21:08:47 -07:00
Federico Fontana
fb88f7efe6 Fixed bug in Dirichlet () () 2019-04-04 14:33:09 -07:00
Tasha Chin
5693cd1344 [docs] Show source code () 2019-04-03 21:30:20 -07:00
William Ma
4b25810994 Adds a push_id to every push in the object manager () 2019-04-03 17:12:06 -07:00
Yuhong Guo
c2349cf12d Remove local/global_scheduler from code and doc. () 2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c [tune] Search Alg delay import + CLI timing test () 2019-04-03 08:52:45 -07:00
cclauss
68ccc4d3cf Travis CI: Do not hard-code Trusty, it EOLs this month ()
* Travis CI: Do not hard-code Trusty, it EOLs this month

Do not hard-code __Trusty__ because it reaches its end-of-life this month.
https://wiki.ubuntu.com/Releases

* Update .travis.yml
2019-04-03 16:39:52 +08:00
Philipp Moritz
b0f6ddf6d1 Remove CMake files () 2019-04-02 22:17:33 -07:00
Wang Qing
7d776f35e1 Integrate metrics () 2019-04-02 21:01:02 -07:00
cclauss
8e19d3721f Travis CI: The 'sudo' tag is now deprecated ()
[Travis are now recommending removing the __sudo__ tag](https://blog.travis-ci.com/2018-11-19-required-linux-infrastructure-migration).

"_If you currently specify __sudo: false__ in your __.travis.yml__, we recommend removing that configuration_"
2019-04-03 10:55:19 +08:00
Hao Chen
23404f7bcf Fix some flaky tests () 2019-04-02 17:57:11 -07:00
Simon Mo
db4cf24636 [serve] Double Serialization Optimization () 2019-04-02 12:35:03 -07:00
Eric Liang
55a2d39409
[rllib] Add option for RNN state and value estimates to span episodes ()
* wip soft horizon

* tests
2019-04-02 02:44:15 -07:00
Yuhong Guo
c2c548bdfd Fix broken pipe callback () 2019-04-02 17:42:18 +08:00
bibabolynn
20c7b2a6eb [Java] TestNG outputs more verbose error messages ()
[Java] TestNG outputs more verbose error messages
2019-04-02 17:41:20 +08:00
Jones Wong
fe7763e786 [rllib] replace the assertion in SyncReplayOptimizer by a warning () 2019-04-02 01:43:22 -07:00
opherlieber
60b230b8ad [rllib] Add support for LR schedule to DQN/APEX () 2019-04-01 11:35:34 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO () 2019-03-31 12:25:52 -07:00
Toanngo
dffe19c59c Update GCP gpu image () 2019-03-31 01:01:22 -07:00
Eric Liang
b01ac41e6f
[rllib] Try to call close on envs on stop () 2019-03-30 19:36:05 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() () 2019-03-30 14:07:50 -07:00
ppeagle
5efb21e1d0 Initial commit for Ray streaming () 2019-03-30 19:32:05 +08:00
Eric Liang
e5bcae52f5
Add lint advisory to PR template 2019-03-29 16:49:02 -07:00
Risto Vuorio
798944fbfa Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… ()
* Fixes Inconsistent weight assignment operations in DQNPolicyGraph ()

* Update dqn_policy_graph.py
2019-03-29 13:31:59 -07:00
Leon Sievers
f4b313eaad [rllib] Moved clip_action into policy_graph; Clip actions in compute_single_action ()
* Moved clip_action into policy_graph; Clip actions in compute_single_action

* Update policy_graph.py

* Changed formatting

* Updated codebase for convencience
2019-03-29 13:26:07 -07:00
gehring
5133b10700 Add support for tensorflow resource variables ()
* Adding support for resource variables

Currently resource variable go undetected by the `TensorFlowVariables` since they do not use the same ops for reading values. This change should fix this until a more robust solution is implemented.

* fix varhandle
2019-03-29 13:23:05 -07:00
bjg2
77005d1814 [rllib] Make batch timeout for remote workers tunable () 2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs () 2019-03-29 12:44:23 -07:00
bibabolynn
ab55a1f93a [Java] Clean up outdated dependencies () 2019-03-28 14:33:45 +08:00
Philipp Moritz
1bcb0b94cc Synchronize arrow version and put changes upstream () 2019-03-27 22:37:07 -07:00
Robert Nishihara
a22bf1e511 Minor aesthetic changes to python file. () 2019-03-28 10:59:29 +08:00
Eric Liang
09b2961750
[rllib] Ensure stats are consistently reported across all algos () 2019-03-27 15:40:15 -07:00
Eric Liang
2871609296
[rllib] Report sampler performance metrics () 2019-03-27 13:24:23 -07:00