ray/rllib/agents/pg/README.md at 66ea09989791b6b7fee860f6fa0002fd667032c7 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 04:46:38 -04:00

Eric Liang 9b8218aabd

[docs] Move all /latest links to /master (#11897 )

* use master link

* remae

* revert non-ray

* more

* mre

2020-11-10 10:53:28 -08:00

8 lines

307 B

Markdown

Raw Blame History

 Policy Gradient (PG)
 ====================
 An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.
 **[Detailed Documentation](https://docs.ray.io/en/master/rllib-algorithms.html#pg)**
 **[Implementation](https://github.com/ray-project/ray/blob/master/rllib/agents/pg/pg.py)**