2020-09-02 14:03:01 +02:00
|
|
|
Proximal Policy Optimization (PPO)
|
|
|
|
==================================
|
|
|
|
|
|
|
|
Implementations of:
|
|
|
|
|
|
|
|
1) Proximal Policy Optimization (PPO).
|
|
|
|
|
2020-09-19 03:30:45 -04:00
|
|
|
**[Detailed Documentation](https://docs.ray.io/en/master/rllib-algorithms.html#ppo)**
|
2020-09-02 14:03:01 +02:00
|
|
|
|
|
|
|
**[Implementation](https://github.com/ray-project/ray/blob/master/rllib/agents/ppo/ppo.py)**
|
|
|
|
|
|
|
|
2) Asynchronous Proximal Policy Optimization (APPO).
|
|
|
|
|
2020-09-19 03:30:45 -04:00
|
|
|
**[Detailed Documentation](https://docs.ray.io/en/master/rllib-algorithms.html#appo)**
|
2020-09-02 14:03:01 +02:00
|
|
|
|
|
|
|
**[Implementation](https://github.com/ray-project/ray/blob/master/rllib/agents/ppo/appo.py)**
|
|
|
|
|
|
|
|
3) Decentralized Distributed Proximal Policy Optimization (DDPPO)
|
|
|
|
|
2020-09-19 03:30:45 -04:00
|
|
|
**[Detailed Documentation](https://docs.ray.io/en/master/rllib-algorithms.html#decentralized-distributed-proximal-policy-optimization-dd-ppo)**
|
2020-09-02 14:03:01 +02:00
|
|
|
|
|
|
|
**[Implementation](https://github.com/ray-project/ray/blob/master/rllib/agents/ppo/ddppo.py)**
|
|
|
|
|