* use master link * remae * revert non-ray * more * mre
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.
Detailed Documentation
Implementation