hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-11 13:46:40 -04:00

* use master link

* remae

* revert non-ray

* more

* mre

2020-11-10 10:53:28 -08:00

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.