Hi,
We are currently using the DESeq2 R package for analysis of differential expression of small-RNAs over genes and we would very much appreciate if you could help us with a question regarding the treatment of multiple reads:
As I understand it, when dealing with mRNA-seq the consensus is to count the one location which received the highest alignment score (whether by the aligner scoring system or by "the rich get richer" methods and so on) or to neglect these reads completely.
While dealing with small-RNAs, however, the situation might be a bit different, since small-RNAs act in trans and have the potential of targeting each location they align to.
One option is to consider and count all the locations a read aligns to, but this will obviously affect the statistics greatly.
Another option is - as in mRNA seq - to count only one read or to neglect the multiple-aligned reads completely. This option, though, is not ideal too since large portions of the small-RNA reads are actually derived from repetitive elements and ignoring them will also affect and shift the results.
This leads me to the question itself: with the intention to use DESeq2 for differential expression analysis, which counting method would you recommend to use while dealing with small-RNA seq?
Thank you!
Leah