Question

salmon quantification - identical transcripts in fasta file

0

Entering edit mode

dmr210 ▴ 30

@dmr210-12497

Last seen 6.6 years ago

Hi,

I am using Salmon to quantify transcript expression based on RNAseq.

I am using the ensembl annotation, and as I am interested in non-coding RNA (lncRNA in particular) I merged the "cdna.all" and the "ncrna" fasta files. (see ftp://ftp.ensembl.org/pub/release-87/fasta/mus_musculus/)

After looking at these two transcriptomes in more details, I found that 20 transcripts are common between the two files, i.e. they have the same ID.

My question is relatively silly... but I wasn't able to answer it based on the dicumentation:

Would Salmon get 'confused' by this and consider the reads as ambiguous in some way, or would it 'notice' that the ID is identical and 'ignore' the repetition?

Thanks!

salmon • 1.2k views

ADD COMMENT • link updated 7.0 years ago by James W. MacDonald 65k • written 7.0 years ago by dmr210 ▴ 30

score 0 · Answer 1 · 2017-04-03

0

Entering edit mode

James W. MacDonald 65k

@james-w-macdonald-5106

Last seen 5 hours ago

United States

This support site is intended for Bioconductor packages. Salmon isn't a Bioconductor package (it's not even an R package!), so you are in the wrong place. You should ask the developers, or look at their FAQ.

ADD COMMENT • link 7.0 years ago James W. MacDonald 65k