Eric Liang
74abeab057
[rllib] Improve accessing model state docs ( #5656 )
...
* [rllib] better model docs
* fix
* s
2019-09-08 23:01:26 -07:00
Eric Liang
cf90394a09
[rllib] Fix TF2 import of EagerVariableStore ( #5625 )
2019-09-07 12:10:03 -07:00
Eric Liang
1455a19c85
Consolidate and clean up documentation ( #5645 )
2019-09-07 11:50:18 -07:00
Eric Liang
19bbf1eb4d
[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern ( #5626 )
2019-09-04 21:39:22 -07:00
Eric Liang
a101812b9f
Replace --redis-address with --address in test, docs, tune, rllib ( #5602 )
...
* wip
* add tests and tune
* add ci
* test fix
* lint
* fix tests
* wip
* sugar dep
2019-09-01 16:53:02 -07:00
Eric Liang
daf38c8723
[tune] Deprecate tune.function ( #5601 )
...
* remove tune function
* remove examples
* Update tune-usage.rst
2019-08-31 16:00:10 -07:00
Philipp Moritz
747daff2cb
Fix impala stress test ( #5596 )
2019-08-31 01:20:53 -07:00
Eric Liang
38231907f3
[rllib] Forgot to register param noise layer variables
2019-08-29 18:12:31 -07:00
Eric Liang
03a1b75852
[rllib] Fix some eager execution regressions with 1.13 ( #5537 )
...
* fix bugs with 1.13
* allow disable
2019-08-26 23:23:35 -07:00
Eric Liang
97ccd75952
[rllib] Enable object store memory limit by default ( #5534 )
2019-08-26 01:37:28 -07:00
gehring
b520f6141e
[rllib] Adds eager support with a generic TFEagerPolicy
class ( #5436 )
2019-08-23 14:21:11 +08:00
Eric Liang
e2e30ca507
Ray, Tune, and RLlib support for memory, object_store_memory options ( #5226 )
2019-08-21 23:01:10 -07:00
jon-chuang
658e002cdf
[rllib] Add autoregressive KL ( #5469 )
2019-08-19 14:34:50 +08:00
Neil Lugovoy
1376f1ae60
[tune] Reporter crash fix ( #5426 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2019-08-13 14:10:22 -07:00
Eric Liang
79949fb8a0
[rllib] RLlib in 60 seconds documentation ( #5430 )
2019-08-12 17:39:02 -07:00
Eric Liang
cc86271cf8
[hotfix] fix Travis action dist test ( #5428 )
2019-08-10 17:59:54 -07:00
Eric Liang
a1d2e17623
[rllib] Autoregressive action distributions ( #5304 )
2019-08-10 14:05:12 -07:00
Eric Liang
8b6f0d3224
[rllib] Fix output API when lz4 not installed ( #5421 )
2019-08-10 13:53:27 -07:00
Eric Liang
df47bdf6c9
Allow address
instead of redis_address
( #5412 )
...
* addr
* wip
* fix typo
* add to start
* switch to ray address for train
* say address
* disambiguate help
* comments 2
2019-08-10 00:18:41 -07:00
Eric Liang
592f313210
[rllib] Centralized critic / PPO example on TwoStepGame ( #5392 )
2019-08-08 14:03:28 -07:00
Wonseok Jeon
281829e712
MADDPG implementation in RLlib ( #5348 )
2019-08-06 16:22:06 -07:00
Matthew A. Wright
e3c9f7e83a
Custom action distributions ( #5164 )
...
* custom action dist wip
* Test case for custom action dist
* ActionDistribution.get_parameter_shape_for_action_space pattern
* Edit exception message to also suggest using a custom action distribution
* Clean up ModelCatalog.get_action_dist
* Pass model config to ActionDistribution constructors
* Update custom action distribution test case
* Name fix
* Autoformatter
* parameter shape static methods for torch distributions
* Fix docstring
* Generalize fake array for graph initialization
* Fix action dist constructors
* Correct parameter shape static methods for multicategorical and gaussian
* Make suggested changes to custom action dist's
* Correct instances of not passing model config to action dist
* Autoformatter
* fix tuple distribution constructor
* bugfix
2019-08-06 11:13:16 -07:00
Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00