mirror of
https://github.com/vale981/ray
synced 2025-03-08 19:41:38 -05:00
![]() * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT and fixes. MB-MPO and MAML not working yet. * wip * update * update * rmeove * remove dep * higher * Update requirements_rllib.txt * Update requirements_rllib.txt * relpos * no mbmpo Co-authored-by: Eric Liang <ekhliang@gmail.com> |
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
pg.py | ||
pg_tf_policy.py | ||
pg_torch_policy.py | ||
README.md | ||
utils.py |
Policy Gradient (PG)
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.