hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-12 14:16:39 -04:00

* use master link

* remae

* revert non-ray

* more

* mre

2020-11-10 10:53:28 -08:00

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.