hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

Fork 0

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Commit graph

Author	SHA1	Message	Date
gjoliver	99a0088233	[RLlib] Unify the way we create local replay buffer for all agents (#19627 ) * [RLlib] Unify the way we create and use LocalReplayBuffer for all the agents. This change 1. Get rid of the try...except clause when we call execution_plan(), and get rid of the Deprecation warning as a result. 2. Fix the execution_plan() call in Trainer._try_recover() too. 3. Most importantly, makes it much easier to create and use different types of local replay buffers for all our agents. E.g., allow us to easily create a reservoir sampling replay buffer for APPO agent for Riot in the near future. * Introduce explicit configuration for replay buffer types. * Fix is_training key error. * actually deprecate buffer_size field.	2021-10-26 20:56:02 +02:00
Richard Liaw	a78a2263e5	[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896 )	2021-07-22 10:55:07 -04:00
Pierre TASSEL	66605cfcbd	[RLLib] Random Parametric Trainer (#11366 )	2020-11-04 11:12:51 +01:00

Author

SHA1

Message

Date

gjoliver

99a0088233

[RLlib] Unify the way we create local replay buffer for all agents (#19627 )

* [RLlib] Unify the way we create and use LocalReplayBuffer for all the agents.

This change
1. Get rid of the try...except clause when we call execution_plan(),
   and get rid of the Deprecation warning as a result.
2. Fix the execution_plan() call in Trainer._try_recover() too.
3. Most importantly, makes it much easier to create and use different types
   of local replay buffers for all our agents.
   E.g., allow us to easily create a reservoir sampling replay buffer for
   APPO agent for Riot in the near future.
* Introduce explicit configuration for replay buffer types.
* Fix is_training key error.
* actually deprecate buffer_size field.

2021-10-26 20:56:02 +02:00

Richard Liaw

a78a2263e5

[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896 )

2021-07-22 10:55:07 -04:00

Pierre TASSEL

66605cfcbd

[RLLib] Random Parametric Trainer (#11366 )

2020-11-04 11:12:51 +01:00

3 commits