ray/rllib/agents/ppo/tests
Avnish Narayan 0ddbce6518
Revert "[RLlib] DD-PPO training iteration fn (#23906)" (#24030)
The DDPPO LR scheduler test is broken because the learner_info_dictionary that is returned by the training iteration function does not consistently return a learner info for every training iteration, but the test expects that it does.

We'll need to fix the test then re-merge

Reverts #23906
2022-04-19 16:43:57 -07:00
..
test_appo.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
test_ddppo.py Revert "[RLlib] DD-PPO training iteration fn (#23906)" (#24030) 2022-04-19 16:43:57 -07:00
test_ppo.py [RLlib] POC: Config objects instead of dicts (PPO only). (#23491) 2022-03-31 18:26:12 +02:00