mirror of
https://github.com/vale981/ray
synced 2025-03-08 19:41:38 -05:00
![]() The DDPPO LR scheduler test is broken because the learner_info_dictionary that is returned by the training iteration function does not consistently return a learner info for every training iteration, but the test expects that it does. We'll need to fix the test then re-merge Reverts #23906 |
||
---|---|---|
.. | ||
test_appo.py | ||
test_ddppo.py | ||
test_ppo.py |