ray/rllib/agents/ppo/tests at 0ddbce651866d686681915d418509353b4ad29e8 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Avnish Narayan 0ddbce6518 Revert "[RLlib] DD-PPO training iteration fn (#23906 )" (#24030 ) The DDPPO LR scheduler test is broken because the learner_info_dictionary that is returned by the training iteration function does not consistently return a learner info for every training iteration, but the test expects that it does. We'll need to fix the test then re-merge Reverts #23906		2022-04-19 16:43:57 -07:00
..
test_appo.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
test_ddppo.py	Revert "[RLlib] DD-PPO training iteration fn (#23906 )" (#24030 )	2022-04-19 16:43:57 -07:00
test_ppo.py	[RLlib] POC: Config objects instead of dicts (PPO only). (#23491 )	2022-03-31 18:26:12 +02:00