ray/rllib/algorithms/pg/README.md

Ignoring revisions in .git-blame-ignore-revs. Click here to bypass and see the normal blame view.

9 lines
311 B
Markdown
Raw Normal View History

Policy Gradient (PG)
====================
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.
**[Detailed Documentation](https://docs.ray.io/en/master/rllib-algorithms.html#pg)**
**[Implementation](https://github.com/ray-project/ray/blob/master/rllib/algorithms/pg/pg.py)**