Eric Liang
ae54e0dc0a
[rllib] Copy plasma memory before adding data to replay buffer
2020-04-29 14:17:54 -07:00
Edward Oakes
17f0d50f1a
[serve] Temporarily disable test_master_crashes ( #8230 )
2020-04-29 14:36:09 -05:00
Xianyang Liu
fbf23eb6ff
[SGD] Fix IterableDataset errors ( #8208 )
2020-04-29 10:51:31 -07:00
Simon Mo
1b1fe0cc5b
Fix Serve long running test ( #8223 )
2020-04-29 09:32:39 -07:00
ijrsvt
c393b6d165
Remove logging ( #8211 )
2020-04-29 09:15:43 -07:00
Sven Mika
bf25aee392
[RLlib] Deprecate all Model(v1) usage. ( #8146 )
...
Deprecate all Model(v1) usage.
2020-04-29 12:12:59 +02:00
Sven Mika
eb91619175
Fix release 0.8.5 tests for PPO torch Breakout. ( #8226 )
2020-04-29 10:36:41 +02:00
chaokunyang
91f630f709
[Streaming] Streaming Cross-Lang API ( #7464 )
2020-04-29 13:42:08 +08:00
Simon Mo
101255f782
[Serve] RayServe TF, PyTorch, Sklearn Examples ( #8156 )
2020-04-28 22:24:55 -07:00
Simon Mo
af3d3e778e
[RayServe] Specify installation instruction in doc ( #8220 )
2020-04-28 14:38:10 -07:00
Richard Liaw
4d639354cd
[tune] Hotfix for test_ls ( #8215 )
2020-04-28 14:06:12 -07:00
Edward Oakes
7c0200c93b
[serve] Master actor fault tolerance ( #8116 )
2020-04-28 15:52:29 -05:00
Edward Oakes
ebdccde030
Fetch internal config from raylet ( #8195 )
2020-04-28 13:12:11 -05:00
Sven Mika
1775e89f26
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. ( #8143 )
...
Deprecate TupleActions and support arbitrarily nested action spaces.
Closes issue #8143 .
2020-04-28 14:59:16 +02:00
fangfengbin
deffc340ea
[GCS]Add in-memory gcs table storage ( #8184 )
2020-04-28 17:19:46 +08:00
aannadi
eb790bf3a3
[Dashboard] Set logdir in Tune Dashboard and TensorBoard Opt-in ( #8074 )
2020-04-27 20:17:52 -07:00
WuTao
32c2055c99
Streaming state ( #7348 )
2020-04-28 10:36:32 +08:00
Richard Liaw
be5235d982
[tune] Clarify Intro Tune Documentation ( #8201 )
2020-04-27 18:01:00 -07:00
ijrsvt
a77e5a8cbf
[Doc] Fix Docstring for Task Cancellation ( #8198 )
2020-04-27 17:06:08 -07:00
Neil Lugovoy
8cf598deab
[sgd] Fix GPU Reservations in LocalDistributedRunner ( #8157 )
2020-04-27 16:03:33 -07:00
Sven Mika
4e713152e9
[RLlib] Fix for issue https://github.com/ray-project/ray/issues/8191 ( #8200 )
...
Fix attribute error when missing exploration in Policy.
Issue #8191
2020-04-27 23:19:26 +02:00
Robert Nishihara
48250217ac
Fix API documentation formatting. ( #8197 )
2020-04-27 10:48:42 -07:00
Philipp Moritz
d7da25eee1
Use RAY_ADDRESS to connect to an existing Ray cluster if present ( #7977 )
2020-04-27 09:59:37 -07:00
Robert Zangnan Yu
a77b19e4f2
[docs] Comments on potential srun orders during Slurm Deployment ( #8183 )
2020-04-27 09:30:16 -07:00
Richard Liaw
87557a00fa
[tune] Refactor search algorithms ( #7037 )
...
* start refactoring of search algorithms
* format
* needs tests
* fix
* suggestions
* Fix PBT
* lint
* refactoring
* hyperopt_working
* dragonfly
* hyperopt
* change_half_of_algs
* save
* code-removed
* remove_lots_of_unneccessary
* changes
* formatting
* suggest
* reset
* rm
* tests
* search-change
* exception
* refactor-doc
* search
* py
* moredocs
* Update doc/source/tune-searchalg.rst
* concurrency
* max
* tune
* betterwarning
* bohb
* tests
* test-change
Co-authored-by: ujvl <misraujval@gmail.com>
2020-04-27 08:51:13 -07:00
Kai Yang
1d5bceddf0
fix java UT about multi-threading ( #8014 )
2020-04-27 15:11:22 +08:00
Sven Mika
7ec2223c84
[RLlib] DDPG PyTorch actor-model was missing sigmoid layer ( #8188 )
...
Fix DDPG PyTorch (missing sigmoid layer (to squash action outputs) after deterministic action outputs).
2020-04-26 23:08:13 +02:00
mehrdadn
b9de9dadd7
Fix Windows build ( #8186 )
...
Co-authored-by: Mehrdad <noreply@github.com>
2020-04-26 13:07:25 -07:00
chaokunyang
5cf49d5edd
Fix streaming ci ( #8159 )
2020-04-26 20:56:58 +08:00
fangfengbin
5bff707d20
[GCS]Add in-memory store client ( #8144 )
2020-04-26 19:09:26 +08:00
ZhuSenlin
9255fcd516
[GCS] Add node failure detector ( #8119 )
2020-04-26 19:08:27 +08:00
fangfengbin
c5d181e3d9
gcs adapts to worker table pub sub ( #8182 )
...
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-04-26 17:58:55 +08:00
Richard Liaw
5bc6e32c0a
[autoscaler] latest_dlami update ( #8178 )
2020-04-26 00:25:46 -07:00
fangfengbin
f17bea2de5
Fix get gcs server address block bug ( #8126 )
2020-04-26 10:01:06 +08:00
Tomasz Wrona
b508166419
Copy initial state of an RNN to a CPU before converting it to a NumPy array ( #8097 )
2020-04-25 18:49:09 -07:00
Richard Liaw
b506f87117
[tune] New Doc edits, add Concepts page ( #8083 )
...
Co-Authored-By: Sven Mika <sven@anyscale.io>
2020-04-25 18:25:56 -07:00
ijrsvt
69ff7e3e35
TaskCancellation ( #7669 )
...
* Smol comment
* WIP, not passing ray.init
* Fixed small problem
* wip
* Pseudo interrupt things
* Basic prototype operational
* correct proc title
* Mostly done
* Cleanup
* cleaner raylet error
* Cleaning up a few loose ends
* Fixing Race Conds
* Prelim testing
* Fixing comments and adding second_check for kill
* Working_new_impl
* demo_ready
* Fixing my english
* Fixing a few problems
* Small problems
* Cleaning up
* Response to changes
* Fixing error passing
* Merged to master
* fixing lock
* Cleaning up print statements
* Format
* Fixing Unit test build failure
* mock_worker fix
* java_fix
* Canel
* Switching to Cancel
* Responding to Review
* FixFormatting
* Lease cancellation
* FInal comments?
* Moving exist check to CoreWorker
* Fix Actor Transport Test
* Fixing task manager test
* chaning clock repr
* Fix build
* fix white space
* lint fix
* Updating to medium size
* Fixing Java test compilation issue
* lengthen bad timeouts
2020-04-25 16:04:52 -07:00
Richard Liaw
9dd3490c38
[tune] Safer try-catch for TensorboardX ( #8174 )
...
Co-Authored-By: Kristian Hartikainen <kristian.hartikainen@gmail.com>
2020-04-25 13:08:37 -07:00
Simon Mo
13c14eac07
[Asyncio] Remove async init legacy code ( #8177 )
...
* [Asyncio] Remove async init legacy code
* Fix places that call async_init
2020-04-25 09:32:38 -07:00
Edward Oakes
9dc625318f
[serve] Add basic test for specifying the method in a serve call ( #8172 )
2020-04-24 20:15:27 -05:00
Scott Graham
0dc01d8c1e
[autoscaler] Azure versioning ( #8168 )
2020-04-24 17:03:55 -07:00
fangfengbin
38dfe5db86
remove store client template ( #8160 )
...
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-04-24 21:19:12 +08:00
fangfengbin
713e375d50
[GCS]GCS adapts to job table pub sub ( #8145 )
2020-04-24 16:33:25 +08:00
Eric Liang
2298f6fb40
[rllib] Port DQN/Ape-X to training workflow api ( #8077 )
2020-04-23 12:39:19 -07:00
Sven Mika
499ad5fbe4
[RLlib] PyTorch version of APPO. ( #8120 )
...
- Translate all vtrace functionality to torch and added torch to the framework_iterator-loop in all existing vtrace test cases.
- Add learning test cases for APPO torch (both w/ and w/o v-trace).
- Add quick compilation tests for APPO (tf and torch, v-trace and no v-trace).
2020-04-23 09:11:12 +02:00
Sven Mika
e9ee5c4e5f
[RLlib] Nested action space PR (minimally invasive; torch only + test). ( #8101 )
...
- Add TorchMultiActionDistribution class.
- Add framework-agnostic test cases for TorchMultiActionDistribution.
2020-04-23 09:09:22 +02:00
Nick Matthews
a9d8d16b6b
Change memory monitor warning to a logging call ( #8137 )
2020-04-22 21:29:18 -07:00
yncxcw
51559c08b9
Fix mis-memory counting in memory monitor for contaienr environment ( #8113 )
...
Co-authored-by: weich <weich@nvidia.com>
2020-04-22 14:32:35 -07:00
Edward Oakes
0bb918f2b1
Disable eager execution to fix test_tensorflow ( #8133 )
2020-04-22 15:54:42 -05:00
Edward Oakes
f9f41e5a1a
[serve] Fix nonblocking serve.init() ( #8068 )
2020-04-22 11:51:27 -05:00