Data filtering in DESeq2
1
0
Entering edit mode
@sallybadawi-16174
Last seen 5.1 years ago

Hello again,

Actually, we know that you havent recommended data filtration before running DESeq function, claiming that it only affects the speed of the function running. Interestingly, when setting a filtration strategy based on the percentage of samples having zero read counts in our data, we found that indeed the homogeneity of the data, the distribution and the normalization have been improved. The relation between the removed genes post-filtration proceeded for analysis and the number of DEG obtained wasnt linear, a peak of DEG was obtained post 71% filtration and then decreased. We see that this strategy has at least removed the experimental error coming from the low count genes that are at the threshold of detection in mRNA-seq. I would like to know what do you suggest and how can we explain these results, Is it really better to proceed without filtering the data?

Thank you

deseq2 data filtering • 650 views
ADD COMMENT
0
Entering edit mode
@mikelove
Last seen 3 hours ago
United States

It depends on the data of course. I never claimed the filtering only affects the speed, but I said that this and the reduced memory size of the object make some pre-filtering useful for most datasets.

If you have found that pre-filtering provides better results on your dataset, that is of course fine to perform before running DESeq().

ADD COMMENT

Login before adding your answer.

Traffic: 837 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6