I am running DESeq2 comparing two cell types and exploring differential expression in a feature set with a bit more than 3 million features.
I am getting padj=1 or NA as the adjusted p-value.
Yet if I remove rows with the total sum less than 30 for example, the answer changes a lot, and I get many more adjusted p-values.
Does this have to do with the sheer number of features? Any suggestions? I thought that the independent filtering would allow me to bypass removing rows manually and that it would find the threshold with most power, but maybe so many features are problematic for this.
Any ideas?
Thank you very much for your response!! It helps a lot. I am pouring through your documents so thanks very much for providing those and for this very helpful answer.