mirror of
https://github.com/vale981/ray
synced 2025-03-08 19:41:38 -05:00
9 lines
307 B
Markdown
9 lines
307 B
Markdown
![]() |
Policy Gradient (PG)
|
||
|
====================
|
||
|
|
||
|
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.
|
||
|
|
||
|
**[Detailed Documentation](https://docs.ray.io/en/latest/rllib-algorithms.html#pg)**
|
||
|
|
||
|
**[Implementation](https://github.com/ray-project/ray/blob/master/rllib/agents/pg/pg.py)**
|