mirror of
https://github.com/vale981/ray
synced 2025-03-07 02:51:39 -05:00
![]() * Revert "Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284)"
This reverts commit
|
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
default_config.py | ||
pg.py | ||
pg_tf_policy.py | ||
pg_torch_policy.py | ||
README.md | ||
utils.py |
Policy Gradient (PG)
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.