Remove "no_feature" and "alignment_not_unique" counts before using EdgeR or Deseq2
1
0
Entering edit mode
cottrellka • 0
@cottrellka-18933
Last seen 6.0 years ago

This is my first time using EdgeR or DeSeq2. I used HTSeq-Count to get counts for reads aligning to CDS (this is for ribosome profiling) for each gene. I obviously have a fair number of multi-aligning reads and reads that don't align to the feature (CDS). Should I remove these before running EdgeR or Deseq2?

 

 

 

edger deseq2 • 847 views
ADD COMMENT
0
Entering edit mode
@gordon-smyth
Last seen 29 minutes ago
WEHI, Melbourne, Australia

I think you will find that edgeR and DESeq2 both remove these rows of the htseq output automatically, so you don't have to do it yourself.

But htseq isn't Bioconductor software. You might consider using the Bioconductor package Rsubread instead, which has been benchmarked as faster and more accurate. Rsubread produces a matrix of counts directly as an R object, so the issue you dealing with doesn't arise.

https://www.biorxiv.org/content/early/2018/07/26/377762

ADD COMMENT
0
Entering edit mode

Thanks for the input.

ADD REPLY

Login before adding your answer.

Traffic: 1101 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6