ray/rllib/algorithms/pg
2022-07-13 18:55:20 +02:00
..
tests [RLlib] PG policy subclassing conversion. (#25288) 2022-06-06 13:07:47 +02:00
__init__.py [RLlib] Fix a bunch of issues related to connectors. (#26510) 2022-07-13 18:55:20 +02:00
pg.py [RLlib] Fix a bunch of issues related to connectors. (#26510) 2022-07-13 18:55:20 +02:00
pg_tf_policy.py [RLlib] Fix a bunch of issues related to connectors. (#26510) 2022-07-13 18:55:20 +02:00
pg_torch_policy.py [RLlib] PG policy subclassing conversion. (#25288) 2022-06-06 13:07:47 +02:00
README.md [RLlib] Fix broken links from agent -> algo conversion. (#25014) 2022-05-20 11:37:11 +02:00
utils.py Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation