I'm running a deferential expression pipeline that first calculates DE with DEseq2 and then looks for enticement in the DE genes. This pipeline seems somewhat common in the literature, so I'm sure it's fine.
While well represented in the literature, I'm wondering if this approach is not redundant? GOseq is supposed to prevent biases introduced by the greater power to detect DE in longer transcripts. However, since the variance stabilization approach of DEseq2 should already have effectively compensated for any gene length effects, it seems like the subsequent use of GOseq could result in more false
positives negatives by effectively subjecting longer genes to a second round of increased scrutiny (and possible devaluation rejection).
My feeling now is that if a gene makes it through DEseq2 and shows up as being DE, then it should remain in the GO enrichment analysis with no addition weighting based upon length.
EDIT: In my initial wording, I made some very poor word choices and have tried to amend them. My initial question was poorly formed and am hoping that it now is more to the point of what I am asking.