hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

History

Jun Gong b383d987d1 [RLlib] Fix a bunch of issues related to connectors. (#26510 )		2022-07-13 18:55:20 +02:00
..
tests	[RLlib] PG policy subclassing conversion. (#25288 )	2022-06-06 13:07:47 +02:00
__init__.py	[RLlib] Fix a bunch of issues related to connectors. (#26510 )	2022-07-13 18:55:20 +02:00
pg.py	[RLlib] Fix a bunch of issues related to connectors. (#26510 )	2022-07-13 18:55:20 +02:00
pg_tf_policy.py	[RLlib] Fix a bunch of issues related to connectors. (#26510 )	2022-07-13 18:55:20 +02:00
pg_torch_policy.py	[RLlib] PG policy subclassing conversion. (#25288 )	2022-06-06 13:07:47 +02:00
README.md	[RLlib] Fix broken links from agent -> algo conversion. (#25014 )	2022-05-20 11:37:11 +02:00
utils.py	Clean up docstyle in python modules and add LINT rule (#25272 )	2022-06-01 11:27:54 -07:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation