jichan3751
|
1711e202a3
|
[training] Tensorflow interface for MultiNode SGD (#5440)
|
2019-09-03 15:35:42 -07:00 |
|
Richard Liaw
|
411f30c125
|
[docs] Second push of changes (#5391)
|
2019-08-28 17:54:15 -07:00 |
|
Eric Liang
|
97ccd75952
|
[rllib] Enable object store memory limit by default (#5534)
|
2019-08-26 01:37:28 -07:00 |
|
gehring
|
b520f6141e
|
[rllib] Adds eager support with a generic TFEagerPolicy class (#5436)
|
2019-08-23 14:21:11 +08:00 |
|
Richard Liaw
|
cdc9227f1b
|
[tune] ASHA xgboost and lightgbm examples (#5500)
|
2019-08-22 10:37:59 -07:00 |
|
Robert Nishihara
|
851c5b2dae
|
Add a script for benchmarking performance for Ray developers. (#5472)
|
2019-08-19 23:41:23 -07:00 |
|
Richard Liaw
|
d7b309223b
|
[tune] MLFlow Logger (#5438)
|
2019-08-14 15:58:18 -07:00 |
|
Lisa Dunlap
|
b7d0733362
|
[tune] Implement BOHB (#5382)
|
2019-08-13 12:32:07 -07:00 |
|
Eric Liang
|
a1d2e17623
|
[rllib] Autoregressive action distributions (#5304)
|
2019-08-10 14:05:12 -07:00 |
|
jichan3751
|
de95117e96
|
[sgd] Tune interface for Pytorch MultiNode SGD (#5350)
|
2019-08-10 13:51:44 -07:00 |
|
Simon Mo
|
18f1e904de
|
Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409)
|
2019-08-09 11:37:19 -07:00 |
|
Eric Liang
|
592f313210
|
[rllib] Centralized critic / PPO example on TwoStepGame (#5392)
|
2019-08-08 14:03:28 -07:00 |
|
Wonseok Jeon
|
281829e712
|
MADDPG implementation in RLlib (#5348)
|
2019-08-06 16:22:06 -07:00 |
|
Eric Liang
|
5d7afe8092
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
|
Richard Liaw
|
1eaa57c98f
|
[tune] Distributed example + walkthrough (#5157)
|
2019-08-02 09:17:20 -07:00 |
|
Eric Liang
|
3bdd114282
|
[rllib] Better example rnn envs (#5300)
|
2019-07-28 14:07:18 -07:00 |
|
Eric Liang
|
a62c5f40f6
|
[rllib] Document ModelV2 and clean up the models/ directory (#5277)
|
2019-07-27 02:08:16 -07:00 |
|
Richard Liaw
|
7e715520e5
|
[sgd] Example for Training (#5292)
|
2019-07-27 01:10:25 -07:00 |
|
Eric Liang
|
f9043cc49a
|
[rllib] Remove experimental eager support
|
2019-07-21 12:27:17 -07:00 |
|
Jones Wong
|
0af07bd493
|
Enable seeding actors for reproducible experiments (#5197)
* enable graph-level worker-specific seed
* lint checked
* revised according to eric's suggestions
* revised accordingly and added a test case
* formated
* Update test_reproducibility.py
* Update trainer.py
* Update rollout_worker.py
* Update run_rllib_tests.sh
* Update worker_set.py
|
2019-07-17 23:31:34 -07:00 |
|
Richard Liaw
|
b6509f46b0
|
Update wheels to 0.8.0dev2 (#5186)
|
2019-07-12 17:27:03 -07:00 |
|
Richard Liaw
|
0b540ab492
|
[tune] Test example checkpointing (#4728)
|
2019-07-10 01:58:26 -07:00 |
|
Eric Liang
|
34d054ff19
|
[rllib] ModelV2 API (#4926)
|
2019-07-03 15:59:47 -07:00 |
|
Richard Liaw
|
b1827d5fbe
|
[tune] Update MNIST Example (#4991)
|
2019-06-25 22:50:15 -07:00 |
|
Richard Liaw
|
bd8aceb896
|
[ci] Change Jenkins to py3 (#5022)
* conda3
* integration
* add nevergrad, remotedata
* pytest 0.3.1
* otherdockers
* setup
* tune
|
2019-06-24 21:50:37 -07:00 |
|
Eric Liang
|
9e328fbe6f
|
[rllib] Add docs on how to use TF eager execution (#4927)
|
2019-06-07 16:42:37 -07:00 |
|
Robert Nishihara
|
c3f8fc1c44
|
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941)
|
2019-06-06 17:22:45 -07:00 |
|
Eric Liang
|
7501ee51db
|
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820)
|
2019-06-03 06:49:24 +08:00 |
|
Peter Schafhalter
|
c2ade075a3
|
[sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
|
2019-06-01 21:39:22 -07:00 |
|
Eric Liang
|
1c073e92e4
|
[rllib] Fix documentation on custom policies (#4910)
* wip
* add docs
* lint
* todo sections
* fix doc
|
2019-06-01 16:13:21 +08:00 |
|
Eric Liang
|
d7be5a5d36
|
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO
|
2019-05-27 17:24:45 -07:00 |
|
Devin Petersohn
|
a7d01aba9b
|
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847)
|
2019-05-24 16:49:13 -07:00 |
|
Eric Liang
|
351753aae5
|
[rllib] Remove dependency on TensorFlow (#4764)
* remove hard tf dep
* add test
* comment fix
* fix test
|
2019-05-10 20:36:18 -07:00 |
|
Devin Petersohn
|
edb8465910
|
[ray-core] Initial addition of performance integration testing files (#4325)
|
2019-05-08 13:40:54 -07:00 |
|
Eric Liang
|
ce66a552bf
|
Move large mem test to end (#4664)
|
2019-04-19 11:43:22 -07:00 |
|
Eric Liang
|
3fd9dea721
|
[rllib] Fix tune.run(Agent class) (#4630)
* update
* Update __init__.py
|
2019-04-15 09:12:23 -07:00 |
|
cfan
|
bb207a205b
|
[rllib] Support torch device and distributions. (#4553)
|
2019-04-12 11:39:14 -07:00 |
|
Eric Liang
|
4f46d3e9bf
|
[rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554)
|
2019-04-09 00:36:49 -07:00 |
|
ctombumila37
|
7746d20d30
|
[rllib] ExternalMultiAgentEnv (#4200)
|
2019-04-06 19:58:14 -07:00 |
|
Eric Liang
|
0d94f3eeef
|
[rllib] Improve datapath throughput of IMPALA / APPO (#4324)
|
2019-03-31 12:25:52 -07:00 |
|
bjg2
|
77005d1814
|
[rllib] Make batch timeout for remote workers tunable (#4435)
|
2019-03-29 13:19:42 -07:00 |
|
Eric Liang
|
2ffe67c5c3
|
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs (#4478)
|
2019-03-29 12:44:23 -07:00 |
|
Eric Liang
|
8ee240f40e
|
[rllib] Use 64-byte aligned memory when concatenating arrays (#4408)
|
2019-03-25 23:56:51 -07:00 |
|
Eric Liang
|
57c1aeb427
|
[rllib] Use suppress_output instead of run_silent.sh script for tests (#4386)
* fix
* enable custom loss
* Update run_rllib_tests.sh
* enable tests
* fix action prob
* Update suppress_output
* fix example
* fix
|
2019-03-21 00:15:24 -07:00 |
|
Eric Liang
|
a45019d98c
|
[rllib] Add option to proceed even if some workers crashed (#4376)
|
2019-03-16 13:34:09 -07:00 |
|
Eric Liang
|
d5f4698305
|
[tune] Avoid scheduler blocking, add reuse_actors optimization (#4218)
|
2019-03-12 23:49:31 -07:00 |
|
Stefan Pantic
|
2202a81773
|
Fix multi discrete (#4338)
* Revert "Revert "[wingman -> rllib] IMPALA MultiDiscrete changes (#3967)" (#4332)"
This reverts commit 3c41cb9b60 .
* Fix a bug with log rhos for vtrace
* Reformat
* lint
|
2019-03-12 20:32:11 -07:00 |
|
Eric Liang
|
3c41cb9b60
|
Revert "[wingman -> rllib] IMPALA MultiDiscrete changes (#3967)" (#4332)
This reverts commit 962b17f567 .
|
2019-03-11 22:51:26 -07:00 |
|
Eric Liang
|
c7f74dbdc7
|
[rllib] Add async remote workers (#4253)
|
2019-03-08 15:39:48 -08:00 |
|
Robert Nishihara
|
fd2d8c2c06
|
Remove Jenkins backend tests and add new long running stress test. (#4288)
|
2019-03-08 15:29:39 -08:00 |
|