daiyaanarfeen
8f6d73a93a
[sgd] Extend distributed pytorch functionality ( #5675 )
...
* raysgd
* apply fn
* double quotes
* removed duplicate TimerStat
* removed duplicate find_free_port
* imports in pytorch_trainer
* init doc
* ray.experimental
* remove resize example
* resnet example
* cifar
* Fix up after kwargs
* data_dir and dataloader_workers args
* formatting
* loss
* init
* update code
* lint
* smoketest
* better_configs
* fix
* fix
* fix
* train_loader
* fixdocs
* ok
* ok
* fix
* fix_update
* fix
* fix
* done
* fix
* fix
* fix
* small
* lint
* fix
* fix
* fix_test
* fix
* validate
* fix
* fi
2019-11-05 11:16:46 -08:00
Richard Liaw
e94bebb1de
[tune] Fix Jenkins tests ( #6028 )
2019-11-01 16:42:04 -07:00
Richard Liaw
48ba484640
[tune] Test TF2.0, TF1.14, TF1.12 Tensorboard support ( #5931 )
2019-10-18 13:50:42 -07:00
Richard Liaw
d52a4983af
Update TF documentation ( #5918 )
2019-10-16 01:31:27 -07:00
Richard Liaw
9f23620412
[tune] tf2.0 mnist example ( #5898 )
...
* tfmnistexample
* tfmnist
* add_to_ci
* format
* exampledownlaod
* fix
2019-10-15 22:25:01 -07:00
Richard Liaw
1650f7b174
[tune] Remove TF MNIST example + add TrialRunner hook to execut… ( #5868 )
...
* remove test
* add trial runner
* remvoerestore
* Remove other mnist examples
* tunetest
* revert
* v1
* Revert "v1"
This reverts commit c8bddaf2db7a8270c43c02021cac0e75df15ed20.
* Revert "revert"
This reverts commit b58f56884a0c288d3a6f997d149ab4d496ddd7a3.
* errors
* format
2019-10-13 20:33:56 -07:00
Eric Liang
04e997fe0d
Fix TF2 / rllib test ( #5846 )
2019-10-07 14:25:16 -07:00
Anthony Yu
b99cdf4e39
[tune] PBT + Memnn example ( #5723 )
...
* Add example file
* Move into train function
* Somewhat working example of MemNN, still has some failed trials
* Reorganize into a class
* Small fixes
* Iteration decrease and fix hyperparam_mutations
* Add example file
* Move into train function
* Somewhat working example of MemNN, still has some failed trials
* Reorganize into a class
* Small fixes
* Iteration decrease and fix hyperparam_mutations
* Some style edits
* Address PR changes without modifying learning rate
* Add configs and hyperparameter mutations
* Add tune test
* Modify import locations
* Some parameter changes for testing
* Update memnn example
* Add tensorboard support and address PR comment
* Final changes
* lint
* generator
2019-10-05 09:22:37 -07:00
Edward Oakes
443feb75f0
Fix test ( #5810 )
2019-09-30 19:39:53 -07:00
Richard Liaw
baf85c6665
[tune/sgd] Fix Jenkins ( #5765 )
2019-09-27 09:59:08 -07:00
Richard Liaw
10f21fa313
[docs] Convert Examples to Gallery ( #5414 )
2019-09-24 15:46:56 -07:00
Richard Liaw
e00071721a
[tune] tf2.0 testing and supporting callables ( #5738 )
2019-09-21 17:01:14 -07:00
jichan3751
1711e202a3
[training] Tensorflow interface for MultiNode SGD ( #5440 )
2019-09-03 15:35:42 -07:00
Richard Liaw
411f30c125
[docs] Second push of changes ( #5391 )
2019-08-28 17:54:15 -07:00
Eric Liang
97ccd75952
[rllib] Enable object store memory limit by default ( #5534 )
2019-08-26 01:37:28 -07:00
gehring
b520f6141e
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
Richard Liaw
cdc9227f1b
[tune] ASHA xgboost and lightgbm examples ( #5500 )
2019-08-22 10:37:59 -07:00
Robert Nishihara
851c5b2dae
Add a script for benchmarking performance for Ray developers. ( #5472 )
2019-08-19 23:41:23 -07:00
Richard Liaw
d7b309223b
[tune] MLFlow Logger ( #5438 )
2019-08-14 15:58:18 -07:00
Lisa Dunlap
b7d0733362
[tune] Implement BOHB ( #5382 )
2019-08-13 12:32:07 -07:00
Eric Liang
a1d2e17623
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
jichan3751
de95117e96
[sgd] Tune interface for Pytorch MultiNode SGD ( #5350 )
2019-08-10 13:51:44 -07:00
Simon Mo
18f1e904de
Bump 0.8.0.dev2 -> 0.8.0.dev3 ( #5409 )
2019-08-09 11:37:19 -07:00
Eric Liang
592f313210
[rllib] Centralized critic / PPO example on TwoStepGame ( #5392 )
2019-08-08 14:03:28 -07:00
Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Richard Liaw
1eaa57c98f
[tune] Distributed example + walkthrough ( #5157 )
2019-08-02 09:17:20 -07:00
Eric Liang
3bdd114282
[rllib] Better example rnn envs ( #5300 )
2019-07-28 14:07:18 -07:00
Eric Liang
a62c5f40f6
[rllib] Document ModelV2 and clean up the models/ directory ( #5277 )
2019-07-27 02:08:16 -07:00
Richard Liaw
7e715520e5
[sgd] Example for Training ( #5292 )
2019-07-27 01:10:25 -07:00
Eric Liang
f9043cc49a
[rllib] Remove experimental eager support
2019-07-21 12:27:17 -07:00
Jones Wong
0af07bd493
Enable seeding actors for reproducible experiments ( #5197 )
...
* enable graph-level worker-specific seed
* lint checked
* revised according to eric's suggestions
* revised accordingly and added a test case
* formated
* Update test_reproducibility.py
* Update trainer.py
* Update rollout_worker.py
* Update run_rllib_tests.sh
* Update worker_set.py
2019-07-17 23:31:34 -07:00
Richard Liaw
b6509f46b0
Update wheels to 0.8.0dev2 ( #5186 )
2019-07-12 17:27:03 -07:00
Richard Liaw
0b540ab492
[tune] Test example checkpointing ( #4728 )
2019-07-10 01:58:26 -07:00
Eric Liang
34d054ff19
[rllib] ModelV2 API ( #4926 )
2019-07-03 15:59:47 -07:00
Richard Liaw
b1827d5fbe
[tune] Update MNIST Example ( #4991 )
2019-06-25 22:50:15 -07:00
Richard Liaw
bd8aceb896
[ci] Change Jenkins to py3 ( #5022 )
...
* conda3
* integration
* add nevergrad, remotedata
* pytest 0.3.1
* otherdockers
* setup
* tune
2019-06-24 21:50:37 -07:00
Eric Liang
9e328fbe6f
[rllib] Add docs on how to use TF eager execution ( #4927 )
2019-06-07 16:42:37 -07:00
Robert Nishihara
c3f8fc1c44
Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. ( #4941 )
2019-06-06 17:22:45 -07:00
Eric Liang
7501ee51db
[rllib] Rename PolicyEvaluator => RolloutWorker ( #4820 )
2019-06-03 06:49:24 +08:00
Peter Schafhalter
c2ade075a3
[sgd] Distributed Training via PyTorch ( #4797 )
...
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang
1c073e92e4
[rllib] Fix documentation on custom policies ( #4910 )
...
* wip
* add docs
* lint
* todo sections
* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang
d7be5a5d36
[rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO
2019-05-27 17:24:45 -07:00
Devin Petersohn
a7d01aba9b
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. ( #4847 )
2019-05-24 16:49:13 -07:00
Eric Liang
351753aae5
[rllib] Remove dependency on TensorFlow ( #4764 )
...
* remove hard tf dep
* add test
* comment fix
* fix test
2019-05-10 20:36:18 -07:00
Devin Petersohn
edb8465910
[ray-core] Initial addition of performance integration testing files ( #4325 )
2019-05-08 13:40:54 -07:00
Eric Liang
ce66a552bf
Move large mem test to end ( #4664 )
2019-04-19 11:43:22 -07:00
Eric Liang
3fd9dea721
[rllib] Fix tune.run(Agent class) ( #4630 )
...
* update
* Update __init__.py
2019-04-15 09:12:23 -07:00
cfan
bb207a205b
[rllib] Support torch device and distributions. ( #4553 )
2019-04-12 11:39:14 -07:00
Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF ( #4554 )
2019-04-09 00:36:49 -07:00