hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

Fork 0

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

Commit graph

Author	SHA1	Message	Date
Eric Liang	995ac24a2c	[rllib] clarify train batch size for PPO (#2793 ) It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.	2018-09-05 12:06:13 -07:00

Author

SHA1

Message

Date

Eric Liang

995ac24a2c

[rllib] clarify train batch size for PPO (#2793 )

It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.

2018-09-05 12:06:13 -07:00

1 commit