Eric Liang
|
7e3e4cd321
|
[rllib] Execution plan API documentation (#10000)
* wip
* updte
* comments
|
2020-08-11 23:58:41 -07:00 |
|
Eric Liang
|
4b62a888cc
|
[rllib] Remove deprecated policy optimizer package. (#9262)
|
2020-07-02 14:39:40 -07:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Richard Liaw
|
b506f87117
|
[tune] New Doc edits, add Concepts page (#8083)
Co-Authored-By: Sven Mika <sven@anyscale.io>
|
2020-04-25 18:25:56 -07:00 |
|
hubcity
|
3d0a8662b3
|
#7246 - Fixing broken links (#7247)
* #7246 - Fixing broken links
* Apply suggestions from code review
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
|
2020-03-25 21:46:13 -07:00 |
|
Eric Liang
|
dd70720578
|
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
* bulk rename
* deprecation warn
* update doc
* update fig
* line length
* rename
* make pytest comptaible
* fix test
* fi sys
* rename
* wip
* fix more
* lint
* update svg
* comments
* lint
* fix use of batch steps
|
2020-03-14 12:05:04 -07:00 |
|
Yutai Zhou
|
9b6794cbb0
|
[rllib] updated policy definition link (#6989)
|
2020-01-31 16:22:11 -08:00 |
|
Sven Mika
|
c957ed58ed
|
[RLlib] Implement PPO torch version. (#6826)
|
2020-01-20 23:06:50 -08:00 |
|
Sven Mika
|
e6227082bd
|
[RLlib] Add torch flag to train.py (#6807)
|
2020-01-17 18:48:44 -08:00 |
|
gehring
|
8903bcd0c3
|
[rllib] Tracing for eager tensorflow policies with tf.function (#5705)
* Added tracing of eager policies with `tf.function`
* lint
* add config option
* add docs
* wip
* tracing now works with a3c
* typo
* none
* file doc
* returns
* syntax error
* syntax error
|
2019-09-17 01:44:20 -07:00 |
|
Richard Liaw
|
34f6d2fc5c
|
[tune] Update trainable docs and support hparams (#5558)
|
2019-09-04 12:44:42 -07:00 |
|
gehring
|
b520f6141e
|
[rllib] Adds eager support with a generic TFEagerPolicy class (#5436)
|
2019-08-23 14:21:11 +08:00 |
|
Eric Liang
|
a1d2e17623
|
[rllib] Autoregressive action distributions (#5304)
|
2019-08-10 14:05:12 -07:00 |
|
Eric Liang
|
5d7afe8092
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
|
Richard Liaw
|
1eaa57c98f
|
[tune] Distributed example + walkthrough (#5157)
|
2019-08-02 09:17:20 -07:00 |
|
Eric Liang
|
20450a4e82
|
[rllib] Add rock paper scissors multi-agent example (#5336)
|
2019-08-01 13:03:59 -07:00 |
|
Eric Liang
|
a62c5f40f6
|
[rllib] Document ModelV2 and clean up the models/ directory (#5277)
|
2019-07-27 02:08:16 -07:00 |
|
Eric Liang
|
f9043cc49a
|
[rllib] Remove experimental eager support
|
2019-07-21 12:27:17 -07:00 |
|
Eric Liang
|
047f4ccd61
|
[rllib] Fix rollout.py with tuple action space (#5201)
* fix it
* update doc too
* fix rollout
|
2019-07-16 10:52:35 -07:00 |
|
Eric Liang
|
34d054ff19
|
[rllib] ModelV2 API (#4926)
|
2019-07-03 15:59:47 -07:00 |
|
Eric Liang
|
9e328fbe6f
|
[rllib] Add docs on how to use TF eager execution (#4927)
|
2019-06-07 16:42:37 -07:00 |
|
Eric Liang
|
7501ee51db
|
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820)
|
2019-06-03 06:49:24 +08:00 |
|
Eric Liang
|
9aa1cd613d
|
[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894)
* fix torch extra out
* preserve setitem
* fix docs
|
2019-06-01 16:58:49 +08:00 |
|
Eric Liang
|
1c073e92e4
|
[rllib] Fix documentation on custom policies (#4910)
* wip
* add docs
* lint
* todo sections
* fix doc
|
2019-06-01 16:13:21 +08:00 |
|
Eric Liang
|
a45c61e19b
|
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821)
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
|
2019-05-27 14:17:32 -07:00 |
|
Eric Liang
|
02583a8598
|
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
|
2019-05-20 16:46:05 -07:00 |
|
Eric Liang
|
71b2dec3b4
|
[rllib] Fix bounds of space returned by preprocessor.observation_space (#4736)
|
2019-05-05 18:25:38 -07:00 |
|
Eric Liang
|
6848dfd179
|
[rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586)
|
2019-04-17 20:30:03 -04:00 |
|
Eric Liang
|
6e7680bf21
|
[rllib] Clean up concepts documentation and policy optimizer creation (#4592)
|
2019-04-12 21:03:26 -07:00 |
|
Eric Liang
|
59901a88a0
|
[rllib] Native support for Dict and Tuple spaces; fix Tuple action spaces; add prev a, r to LSTM (#3051)
|
2018-10-20 15:21:22 -07:00 |
|
Sergey Kolesnikov
|
05490b8cb9
|
[rllib] dqn/ddpg policy customization (#2445)
* dqn policy update - more customization
* docs for custom DQN graph
* Update rllib-training.rst
* Update rllib-models.rst
* Update rllib.rst
* Update rllib-training.rst
* Update rllib-concepts.rst
* yapf codestyle
|
2018-07-22 14:47:14 -07:00 |
|
Eric Liang
|
b316afeb43
|
[rllib] Add debug info back to PPO and fix optimizer compatibility (#2366)
|
2018-07-12 19:22:46 +02:00 |
|
Eric Liang
|
4ef9d15315
|
[rllib] Add concepts section of docs (#2373)
This fills in the rllib concepts documentation.
|
2018-07-08 18:46:52 -07:00 |
|