mirror of
https://github.com/vale981/ray
synced 2025-03-07 02:51:39 -05:00
307 B
307 B
Policy Gradient (PG)
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.