Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Richard Liaw
094ec7adbc
[tune] Allow nested values in trial runner ( #5346 )
2019-08-06 14:36:17 -07:00
Philipp Moritz
e8d9cfc1f1
Ray projects schema and validation ( #5329 )
2019-08-06 14:36:04 -07:00
Hao Chen
3ad2fe76e0
Cap concurrent requests ( #5341 )
2019-08-06 13:56:19 -07:00
Matthew A. Wright
e3c9f7e83a
Custom action distributions ( #5164 )
...
* custom action dist wip
* Test case for custom action dist
* ActionDistribution.get_parameter_shape_for_action_space pattern
* Edit exception message to also suggest using a custom action distribution
* Clean up ModelCatalog.get_action_dist
* Pass model config to ActionDistribution constructors
* Update custom action distribution test case
* Name fix
* Autoformatter
* parameter shape static methods for torch distributions
* Fix docstring
* Generalize fake array for graph initialization
* Fix action dist constructors
* Correct parameter shape static methods for multicategorical and gaussian
* Make suggested changes to custom action dist's
* Correct instances of not passing model config to action dist
* Autoformatter
* fix tuple distribution constructor
* bugfix
2019-08-06 11:13:16 -07:00
Richard Liaw
94bff244e4
[docs] Hotfix for removing unneeded files ( #5383 )
2019-08-06 08:46:59 -07:00
Joey Jiang
02c5d2be20
Add common preprocessing for each request in node manager. ( #5296 )
2019-08-06 20:48:58 +08:00
Eric Liang
0a3ff489fa
Send raylet error logs through the log monitor ( #5351 )
2019-08-05 23:35:09 -07:00
Richard Liaw
a08ea09760
[docs] rewrite ( #5175 )
2019-08-05 23:33:14 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Kai Yang
384cbfb211
Fix duplicated timeout logic in AbstractRayRuntime.get() ( #5338 )
2019-08-06 13:36:49 +08:00
Richard Liaw
32f275344b
[tune] Pandas as soft dep
2019-08-05 18:59:08 -07:00
Qing Wang
cc5c78b1da
Fix the issue of not initializing GLOG
2019-08-05 13:26:09 -07:00
Eric Liang
955154a19d
Reduce Ray / RLlib startup messages ( #5368 )
2019-08-05 13:23:54 -07:00
Michael Gump
67f9e225cf
[tune] Fix small bug in experiment_analysis ( #5365 )
...
* [tune] Fix small bug in experiment_analysis
_experiment_state was not set as an attribute of ExperimentAnalysis
* Update python/ray/tune/analysis/experiment_analysis.py
Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>
2019-08-05 13:19:30 -07:00
Zhijun Fu
134c6bd128
[direct call] In memory store ( #5303 )
2019-08-05 13:14:45 -07:00
Simon Mo
25b5bd1530
ray stop
sends SIGKILL
instead of SIGTERM
(#5354 )
2019-08-02 14:46:03 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough ( #5157 )
2019-08-02 09:17:20 -07:00
Kristian Hartikainen
13fb9fe3db
[rllib] Feature/soft actor critic v2 ( #5328 )
...
* Add base for Soft Actor-Critic
* Pick changes from old SAC branch
* Update sac.py
* First implementation of sac model
* Remove unnecessary SAC imports
* Prune unnecessary noise and exploration code
* Implement SAC model and use that in SAC policy
* runs but doesn't learn
* clear state
* fix batch size
* Add missing alpha grads and vars
* -200 by 2k timesteps
* doc
* lazy squash
* one file
* ignore tfp
* revert done
2019-08-01 23:37:36 -07:00
Xianyang Liu
3ae54a2b20
Fix log monitor read error ( #5221 )
2019-08-01 15:47:10 -07:00
Eric Liang
20450a4e82
[rllib] Add rock paper scissors multi-agent example ( #5336 )
2019-08-01 13:03:59 -07:00
jichan3751
bd6dfc994f
[sgd] Replaced class Resources in sgd with use_gpu
( #5252 )
2019-08-01 01:03:10 -07:00
Lukas Folle
0391050daf
Fixed link in tune that was not working ( #5331 )
2019-07-31 21:43:46 -07:00
Samir Al-Stouhi
51b8915c0a
Added CARLA Community Example ( #5333 )
2019-07-31 18:10:50 -07:00
Simon Mo
d762379904
[Asyncio] Allow Async_API to init when loop is running ( #5323 )
2019-07-31 15:29:42 -07:00
Kristian Hartikainen
1345802c39
[autoscaler] Change sys.exit(1) in update ssh_cmd ( #5266 )
2019-07-31 13:45:05 -07:00
Jaroslaw Rzepecki
b3c8091a35
Fix Tuple spaces in rollout.py ( #5332 )
...
Make sure that the initial action is also properly flattened.
2019-07-31 11:38:49 -07:00
Stephanie Wang
e218e615df
Lineage cache performance optimization to avoid duplicate GCS requests #5327
2019-07-31 10:43:29 -07:00
Robert Nishihara
63a6b0e710
Fix bug in passing large arguments to tasks. ( #5325 )
2019-07-30 22:28:35 -07:00
Hao Chen
991e71dde6
Submit task asynchronously from raylet client ( #5313 )
2019-07-30 12:58:57 -07:00
Zhijun Fu
eb307f93f8
Support direct actor call ( #5183 )
2019-07-30 17:47:17 +08:00
Simon Mo
196495a4de
Fix Redis Test ( #5302 )
2019-07-30 00:22:16 -07:00
micafan
b3bcf59148
Rename ClientTableData to GcsNodeInfo ( #5251 )
2019-07-30 11:22:47 +08:00
Simon Mo
3ba8680963
Bump version to 0.8.0.dev3 ( #5308 )
2019-07-29 18:28:38 -07:00
Michael Luo
1337c98f02
[rllib] Importance Sampling and KL Loss for APPO ( #5051 )
2019-07-29 15:02:32 -07:00
Simon Mo
3b00144e7d
Bump version to 0.7.3 ( #5301 )
2019-07-29 10:25:32 -07:00
Eric Liang
3bdd114282
[rllib] Better example rnn envs ( #5300 )
2019-07-28 14:07:18 -07:00
Qing Wang
1465a30ea9
Fix releasing CPUs incorrectly when actor creation task blocked. ( #5271 )
...
* Fix
* Remove useless log
* Address
* Fix typo
* sleep
2019-07-28 15:46:17 +08:00
Richard Liaw
5ea859dc73
[sgd] hotfix example failure ( #5297 )
...
* hotfix
* Update train_example.py
2019-07-27 18:13:22 -07:00
Eric Liang
6f2c5b2819
Revert "[autoscaler] Clean up error messages on setup failure ( #5210 )" ( #5299 )
...
This reverts commit 7fc15dbf7f
.
2019-07-27 16:53:47 -07:00
lanlin
341dbf6c45
[tune] support nested dictionaries for CSVLogger ( #5295 )
2019-07-27 14:44:34 -07:00
Richard Liaw
b4823d63c6
[autoscaler] Local YAML readability ( #5290 )
2019-07-27 12:51:50 -07:00
LorenzoCevolani
10cbcced7e
Correctly setting the input to Train ( #3853 )
...
In the ResNetTrainActor class, the data are now exactly build using the Train flag for the cifar_input script.
2019-07-27 11:08:35 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory ( #5277 )
2019-07-27 02:08:16 -07:00
Richard Liaw
9c00616cdc
Retry and exception for hang on memory store full ( #5143 )
2019-07-27 01:20:13 -07:00
Richard Liaw
5e15b36d6e
[tune] experiment_analysis split to Analysis ( #5115 )
2019-07-27 01:10:52 -07:00
Richard Liaw
7e715520e5
[sgd] Example for Training ( #5292 )
2019-07-27 01:10:25 -07:00
Daniel Edgecumbe
06fec63c87
[autoscaler] Add a 'request_cores' function for manual autoscaling ( #4754 )
2019-07-26 17:14:45 -07:00
lanlin
d9e81da3b8
[tune] configurable maximum length of trial identifier ( #5287 )
2019-07-26 17:09:54 -07:00
Hao Chen
6f737e6a50
Add CODEOWNERS file ( #5259 )
2019-07-26 12:40:07 +08:00