Preferential sequencing of longer Genes in Illumina?
Entering edit mode
Last seen 9 months ago

I was reading this paper from 2010 :

It is mentioned that

One inherent bias of the illumina platform is the preferential sequencing of longer genes. Hence, longer genes are more likely declared as DE.

Is it true for the current Illumina platforms as well? And as a result, we observe low counts for some of the genes. For example, I am looking at the gene PYCR1, and after performing DEseq2 I have got good Log2foldchange (3.63) and Padj-value (1.95E-06), however, the basemean is about 23. Now, it is a known fact that in cancer this gene is upregulated, but the counts are not convincing. I am really confused about what to do here!

DESeq2 sequencing • 297 views
Entering edit mode
Last seen 4 hours ago
United States

Yes longer genes have higher counts typically. This is taken into account for gene set testing in goseq which can be ran downstream of DESeq2 for gene set results.

For per gene results, you don’t modify the standard pipeline. DESeq2 has been extensively benchmarked.


Login before adding your answer.

Traffic: 380 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6