I think you are assuming that
classifyTestsP() adjusts for multiple testing across genes. Actually it adjusts only across contrasts, i.e., across rows rather than down columns. As the help says
"The adjustment for multiple testing is across the contrasts rather than the more usual control across genes."
You should use
decideTests() instead. It does something closer to what you are probably expecting.