Question: Non-integer counts from tximport OK for edgeR / DESeq2?
1
gravatar for Jenny Drnevich
2.6 years ago by
Jenny Drnevich1.9k
United States
Jenny Drnevich1.9k wrote:

Hi all,

I'm starting to use the tximport package to pull in salmon output for statistical analysis and I just wanted to verify that it is fine to use the non-integer counts with edgeR and DESeq2. Years ago, we needed to round the decimal counts from e.g., cuffdiff, but the tximport vignette implied the non-integer values were fine because there was no mention of rounding. I searched the support site and found these: C: Can I feed TCGA normalized count data to EdgeR for differential gene expression and non-integer counts for edgeR, the first of which has Steve Lianoglou asking for a separate post to highlight the qualified yes answer in regards to non-integer counts for edgeR now. That's mainly what this post is doing, and also linking it to the tximport package as more people should now be using it to pull in counts and length offsets from kallisto/sailfish/salmon/RSEM.

BTW - thanks for the great tximport package and the F1000 article on how transcript-level estimates improve gene-level inferences: https://f1000research.com/articles/4-1521/v2!

Jenny

edger deseq2 tximport • 2.0k views
ADD COMMENTlink modified 2.6 years ago by Michael Love23k • written 2.6 years ago by Jenny Drnevich1.9k

Thanks for posting this and tying those two threads together! To be honest I completely forgot about that comment ... which is why I guess I was hoping we could "pin" it somewhere :-)

ADD REPLYlink written 2.6 years ago by Steve Lianoglou12k
Answer: Non-integer counts from tximport OK for edgeR / DESeq2?
2
gravatar for Michael Love
2.6 years ago by
Michael Love23k
United States
Michael Love23k wrote:

hi Jenny,

Yes the examples in the tximport vignette show the intended downstream usage.

edgeR has support for non-integer counts, and the tximport-to-DESeqDataSet constructor function just rounds the estimated non-integer counts to integers. I'm not worried about any loss of precision for inference of log fold change in this rounding, because fractions of counts are tiny compared to the sampling and biological variation on counts in RNA-seq.

ADD COMMENTlink written 2.6 years ago by Michael Love23k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 197 users visited in the last hour