Devin Petersohn
56a78baf67
Bump version to 0.6.6 ( #4621 )
2019-04-13 10:37:17 -07:00
Hao Chen
d52b080081
[Java] Avoid unnecessary memory copy and addd a benchmark ( #4611 )
2019-04-14 00:17:04 +08:00
Richard Liaw
0bfb0d2c29
[tune] Fix checkpointing for Gym Types
2019-04-12 21:03:56 -07:00
Eric Liang
6e7680bf21
[rllib] Clean up concepts documentation and policy optimizer creation ( #4592 )
2019-04-12 21:03:26 -07:00
Romil Bhardwaj
0f42f87ebc
Updating zero capacity resource semantics ( #4555 )
2019-04-12 16:53:57 -07:00
cfan
bb207a205b
[rllib] Support torch device and distributions. ( #4553 )
2019-04-12 11:39:14 -07:00
Wang Qing
5cfbfe5df6
[Java] Implement GcsClient ( #4601 )
2019-04-12 22:44:47 +08:00
Wang Qing
fe07a5b4b1
Add delete_creating_tasks
option for internal.free()
( #4588 )
...
* add delete creating task objects.
* format code style
* Fix lint
* add tests add address comments.
* Refine test
* Refine java test
* Fix CI
* Refine
* Fix lint
* Fix CI
2019-04-12 13:38:31 +08:00
justinwyang
e88e706fcc
Enforce quoting style in Travis. ( #4589 )
2019-04-11 14:24:26 -07:00
ppeagle
6697407ec4
[Java Streaming] Fix StreamSource constructor and SourceFunction initialization ( #4597 )
2019-04-10 20:02:11 +08:00
Kristian Hartikainen
ed02bf11f7
[autoscaler] Lint code that we forgot to lint in #4537 ( #4584 )
...
* Lint code that we forgot to lint in previous PR
* Revert setup command merge
* Lint
* Revert "Revert setup command merge"
This reverts commit 55e1cdb1f256ea51ef66a38730d8f7865f1f5ad1.
* Fix testReportsConfigFailures test
* Minor syntax tweaks
* Lint
2019-04-10 17:01:36 +08:00
Vlad Firoiu
74fd3d7e21
[rllib] Support prev_state/prev_action in rollout and fix multiagent ( #4565 )
...
* Cleaner and more correct treatment of agent states in rollout.py
* support lstm_use_prev_action_reward in rollout.py
* Linter.
* appease flake8
* Use _DUMMY_AGENT_ID instead of 0.
* All agents have a policy_agent_mapping.
Reset the mapping cache at the start of each episode.
* Update rollout.py
* Fix rollout.py for single-agent envs.
* Use agent_id, not policy_id.
2019-04-10 00:01:25 -07:00
Eric Liang
f8e8743347
[tune] Improve PBT example ( #4575 )
2019-04-09 20:59:17 -07:00
Si-Yuan
dab99d26af
Improve code related to node ( #4383 )
...
* Make full use of node
implement local node
fix bugs mentioned in comments
* Add more tests
* Use more specific exception handling
* fix, lint
* fix for py2.x
2019-04-09 17:27:54 +08:00
Wang Qing
c5bcec54f3
Add ignore items for java build ( #4579 )
2019-04-09 15:59:58 +08:00
Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF ( #4554 )
2019-04-09 00:36:49 -07:00
Duane
7f23e8431b
fixed typo in kuber yaml ( #4582 )
2019-04-08 23:13:42 -07:00
Stefan Pantic
915486984a
[autoscaler] Add support for separate docker containers on head and worker nodes ( #4537 )
...
* Added support for running different docker containers on clusters
* Remove node specific container names
* Keep old options and expand with node specific configuration
* Optimized imports
* Changed docker fields for autoscaler
* Auto reformat
* Updated comments
* Updated condition
* Run linter
* Updated example
* Changed condition for docker images, updated examples
* Removed duplicate line
* Fixed setup_commands
* Update autoscaler.py
* fix_better_image
2019-04-07 16:51:32 -07:00
Jones Wong
da5a471485
[rllib] validate observation in NoPreprocessor ( #4546 )
2019-04-07 16:11:50 -07:00
Eric Liang
f9b8e77e3b
[rllib] Don't merge unrolls from same episode when calculating seq lens ( #4557 )
2019-04-07 12:11:30 -07:00
Eric Liang
37208216ae
[rllib] Rename Agent to Trainer ( #4556 )
2019-04-07 00:36:18 -07:00
Dušan Josipović
820c71b7d0
[tune/rllib] Add checkpoint eraser ( #4490 )
2019-04-06 20:01:54 -07:00
ctombumila37
7746d20d30
[rllib] ExternalMultiAgentEnv ( #4200 )
2019-04-06 19:58:14 -07:00
Andrew Tan
991b911e1d
[tune] Add --columns
flag for CLI ( #4564 )
2019-04-05 19:49:01 -07:00
Jérémy
300ec72d15
[tune] Add compatibility to nevergrad 0.2.0+ ( #4529 )
...
## What do these changes do?
This PR prepares for future version 0.2.0 of `nevergrad`, in which each suggestion is a `Candidate` instance having fields `args` and `kwargs` instead of being a `np.ndarray`. The proposed changes are compatible with all versions of `nevergrad` (manually tested with `nevergrad_example.py` on both `master` and current version `v0.1.6`).
See `nevergrad`'s [CHANGELOG](https://github.com/facebookresearch/nevergrad/blob/master/CHANGELOG.md ) for more information on the change.
## Related issue number
None
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 19:44:58 -07:00
Andrew Tan
bfd0af52bc
[tune] Add documentation to --output flag ( #4518 )
...
## What do these changes do?
Add documentation for the `--output` flag for ls / lsx in the Tune CLI.
## Related issue number
Closes #4511
## Linter
- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Richard Liaw
50b2aa0740
[tune] Better handling of tune.function in global checkpoint ( #4519 )
...
Enables result keys to be queried by CLI.
2019-04-04 21:08:47 -07:00
Federico Fontana
fb88f7efe6
Fixed bug in Dirichlet ( #4440 ) ( #4560 )
2019-04-04 14:33:09 -07:00
Tasha Chin
5693cd1344
[docs] Show source code ( #3281 )
2019-04-03 21:30:20 -07:00
William Ma
4b25810994
Adds a push_id
to every push in the object manager ( #4407 )
2019-04-03 17:12:06 -07:00
Yuhong Guo
c2349cf12d
Remove local/global_scheduler from code and doc. ( #4549 )
2019-04-03 17:05:09 -07:00
Adi Zimmerman
51dae23d5c
[tune] Search Alg delay import + CLI timing test ( #4230 )
2019-04-03 08:52:45 -07:00
cclauss
68ccc4d3cf
Travis CI: Do not hard-code Trusty, it EOLs this month ( #4545 )
...
* Travis CI: Do not hard-code Trusty, it EOLs this month
Do not hard-code __Trusty__ because it reaches its end-of-life this month.
https://wiki.ubuntu.com/Releases
* Update .travis.yml
2019-04-03 16:39:52 +08:00
Philipp Moritz
b0f6ddf6d1
Remove CMake files ( #4493 )
2019-04-02 22:17:33 -07:00
Wang Qing
7d776f35e1
Integrate metrics ( #4246 )
2019-04-02 21:01:02 -07:00
cclauss
8e19d3721f
Travis CI: The 'sudo' tag is now deprecated ( #4542 )
...
[Travis are now recommending removing the __sudo__ tag](https://blog.travis-ci.com/2018-11-19-required-linux-infrastructure-migration ).
"_If you currently specify __sudo: false__ in your __.travis.yml__, we recommend removing that configuration_"
2019-04-03 10:55:19 +08:00
Hao Chen
23404f7bcf
Fix some flaky tests ( #4535 )
2019-04-02 17:57:11 -07:00
Simon Mo
db4cf24636
[serve] Double Serialization Optimization ( #4532 )
2019-04-02 12:35:03 -07:00
Eric Liang
55a2d39409
[rllib] Add option for RNN state and value estimates to span episodes ( #4429 )
...
* wip soft horizon
* tests
2019-04-02 02:44:15 -07:00
Yuhong Guo
c2c548bdfd
Fix broken pipe callback ( #4513 )
2019-04-02 17:42:18 +08:00
bibabolynn
20c7b2a6eb
[Java] TestNG outputs more verbose error messages ( #4507 )
...
[Java] TestNG outputs more verbose error messages
2019-04-02 17:41:20 +08:00
Jones Wong
fe7763e786
[rllib] replace the assertion in SyncReplayOptimizer by a warning ( #4534 )
2019-04-02 01:43:22 -07:00
opherlieber
60b230b8ad
[rllib] Add support for LR schedule to DQN/APEX ( #4473 )
2019-04-01 11:35:34 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO ( #4324 )
2019-03-31 12:25:52 -07:00
Toanngo
dffe19c59c
Update GCP gpu image ( #4524 )
2019-03-31 01:01:22 -07:00
Eric Liang
b01ac41e6f
[rllib] Try to call close on envs on stop ( #4521 )
2019-03-30 19:36:05 -07:00
Eric Liang
fce0062380
[rllib] Switch to tune.run() instead of run_experiments() ( #4515 )
2019-03-30 14:07:50 -07:00
ppeagle
5efb21e1d0
Initial commit for Ray streaming ( #4268 )
2019-03-30 19:32:05 +08:00
Eric Liang
e5bcae52f5
Add lint advisory to PR template
2019-03-29 16:49:02 -07:00
Risto Vuorio
798944fbfa
Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4… ( #4504 )
...
* Fixes Inconsistent weight assignment operations in DQNPolicyGraph (#4502 )
* Update dqn_policy_graph.py
2019-03-29 13:31:59 -07:00