Eric Liang
|
9e328fbe6f
|
[rllib] Add docs on how to use TF eager execution (#4927)
|
2019-06-07 16:42:37 -07:00 |
|
Eric Liang
|
7501ee51db
|
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820)
|
2019-06-03 06:49:24 +08:00 |
|
Eric Liang
|
9aa1cd613d
|
[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894)
* fix torch extra out
* preserve setitem
* fix docs
|
2019-06-01 16:58:49 +08:00 |
|
Eric Liang
|
1c073e92e4
|
[rllib] Fix documentation on custom policies (#4910)
* wip
* add docs
* lint
* todo sections
* fix doc
|
2019-06-01 16:13:21 +08:00 |
|
Eric Liang
|
a45c61e19b
|
[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821)
* wip
* fix index
* fix bugs
* todo
* add imports
* note on get ph
* note on get ph
* rename to building custom algs
* add rnn state info
|
2019-05-27 14:17:32 -07:00 |
|
Eric Liang
|
02583a8598
|
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
|
2019-05-20 16:46:05 -07:00 |
|
Eric Liang
|
71b2dec3b4
|
[rllib] Fix bounds of space returned by preprocessor.observation_space (#4736)
|
2019-05-05 18:25:38 -07:00 |
|
Eric Liang
|
6848dfd179
|
[rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586)
|
2019-04-17 20:30:03 -04:00 |
|
Eric Liang
|
6e7680bf21
|
[rllib] Clean up concepts documentation and policy optimizer creation (#4592)
|
2019-04-12 21:03:26 -07:00 |
|
Eric Liang
|
59901a88a0
|
[rllib] Native support for Dict and Tuple spaces; fix Tuple action spaces; add prev a, r to LSTM (#3051)
|
2018-10-20 15:21:22 -07:00 |
|
Sergey Kolesnikov
|
05490b8cb9
|
[rllib] dqn/ddpg policy customization (#2445)
* dqn policy update - more customization
* docs for custom DQN graph
* Update rllib-training.rst
* Update rllib-models.rst
* Update rllib.rst
* Update rllib-training.rst
* Update rllib-concepts.rst
* yapf codestyle
|
2018-07-22 14:47:14 -07:00 |
|
Eric Liang
|
b316afeb43
|
[rllib] Add debug info back to PPO and fix optimizer compatibility (#2366)
|
2018-07-12 19:22:46 +02:00 |
|
Eric Liang
|
4ef9d15315
|
[rllib] Add concepts section of docs (#2373)
This fills in the rllib concepts documentation.
|
2018-07-08 18:46:52 -07:00 |
|