matrix with Salmon gene-level quantifications in DESeq
2
0
Entering edit mode
luca.s ▴ 50
@lucas-24386
Last seen 7 months ago
Italy

Hi all. I am in the need of analyzing RNAseq data that are available in the form of a matrix with Salmon quantifications summarized at gene level (non-integer values). Can they be used as an input for DESeq, and what would be the best way to do this? I am providing a sample of the table below (apologies for the formatting). Thank you for the help. Luca

Sample_A Sample_B Sample_C A1BG 131.084 124.366 172.048 A1CF 3 6 8.856 A2M 24.212 36.87 14.873 A2ML1 1212.969 40.953 53.881

Enter the body of text here

Code should be placed in three backticks as shown below


# include your problematic code here with any corresponding output 
# please also include the results of running the following in an R session 

sessionInfo( )
DESeq2 • 946 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 44 minutes ago
United States

Use the tximport package.

ADD COMMENT
0
Entering edit mode

Thanks James for the quick answer, but does the tximport package work also for importing a single matrix in which are contained all gene quantifications for all samples? Sorry for the naive question, but I understood from the vignette that it handles only the original Salmon output.

ADD REPLY
0
Entering edit mode

I feel like this is definitely covered in the vignette. We show importing multiple samples to gene level.

ADD REPLY
0
Entering edit mode
ATpoint ★ 4.0k
@atpoint-13662
Last seen 15 hours ago
Germany

If you have quantifications on the transcript level and want to go to the gene level then use the methods described in tximport.

If you already have a matrix with gene level counts then load it into DESeq2 with DESeq2::DESeqDataSetFromMatrix, requiring that rows are genes and columns are samples.

Given that you seem to have the original salmon quantifications, I would recommend to not do any custom approach but simple aggregate the salmon transcript level abundence estimates (so the quant.sf files) to the gene level with tximport as described in this link:

https://bioconductor.org/packages/release/bioc/vignettes/tximport/inst/doc/tximport.html#Salmon

...and then port it over to DESeq2 with DESeq2::DESeqDataSetFromTximport. That can be done in very few lines of code and you can be sure the data (e.g. the length estimates) will end up at the required slots in the dds object to be correctly used downstream.

ADD COMMENT
0
Entering edit mode

Thanks all for for dissecting the point and sorry for the late feedback. Unfortunately, I only had the matrix of gene level counts, so I went through DESeqDataSetFromMatrix. Hope this does not impact the tresults too much.

ADD REPLY
0
Entering edit mode

How would you have a gene-level matrix from salmon?

ADD REPLY
0
Entering edit mode

tximport from a collaborator

ADD REPLY

Login before adding your answer.

Traffic: 772 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6