Eric Liang
|
02583a8598
|
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
|
2019-05-20 16:46:05 -07:00 |
|
Eric Liang
|
71b2dec3b4
|
[rllib] Fix bounds of space returned by preprocessor.observation_space (#4736)
|
2019-05-05 18:25:38 -07:00 |
|
Eric Liang
|
6848dfd179
|
[rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586)
|
2019-04-17 20:30:03 -04:00 |
|
Eric Liang
|
6e7680bf21
|
[rllib] Clean up concepts documentation and policy optimizer creation (#4592)
|
2019-04-12 21:03:26 -07:00 |
|
Eric Liang
|
59901a88a0
|
[rllib] Native support for Dict and Tuple spaces; fix Tuple action spaces; add prev a, r to LSTM (#3051)
|
2018-10-20 15:21:22 -07:00 |
|
Sergey Kolesnikov
|
05490b8cb9
|
[rllib] dqn/ddpg policy customization (#2445)
* dqn policy update - more customization
* docs for custom DQN graph
* Update rllib-training.rst
* Update rllib-models.rst
* Update rllib.rst
* Update rllib-training.rst
* Update rllib-concepts.rst
* yapf codestyle
|
2018-07-22 14:47:14 -07:00 |
|
Eric Liang
|
b316afeb43
|
[rllib] Add debug info back to PPO and fix optimizer compatibility (#2366)
|
2018-07-12 19:22:46 +02:00 |
|
Eric Liang
|
4ef9d15315
|
[rllib] Add concepts section of docs (#2373)
This fills in the rllib concepts documentation.
|
2018-07-08 18:46:52 -07:00 |
|