Redundant meta-features (genes) using FeatureCounts
2
0
Entering edit mode
ccheung • 0
@ccheung-7248
Last seen 8.5 years ago
European Union

Hi,

I'm completely new to RNASeq analysis so I'm having a lot of difficulties analyzing the data. First of all, I got some raw FeatureCount data, and there are multiple entries of the same meta-feature (gene) all with the same or very similar counts. What does this mean?  How do I combine them or remove them so that I only have data for one meta-feature (gene)?

Also, the raw data includes novel unannotated "genes". How do I remove these so that I can analyze only the annotated genes (ie. ones in Ensembl).

Any help or advice would be greatly appreciated. Thanx.

carol

rna-seq featurecounts • 1.9k views
ADD COMMENT
0
Entering edit mode

Hi Carol,

In order for us to help you, we will need more information. For example, how did you 'get some raw FeatureCount data'? It's best if you show the code you used, preferably cut down to just what's relevant to your question.

ADD REPLY
0
Entering edit mode
@gordon-smyth
Last seen 1 hour ago
WEHI, Melbourne, Australia

When you say that you have FeatureCount data, do you mean mean that you have output from the featureCounts() function of the Rsubread package? If so, then the reason that you have multiple rows for the same gene is that your GFF file contains multiple meta-features for the same gene. For example, the meta-features might be Ensembl transcripts, and each gene may have several transcripts that contain almost the same exons.

ADD COMMENT
0
Entering edit mode
ccheung • 0
@ccheung-7248
Last seen 8.5 years ago
European Union

Thanx Gordon!

ADD COMMENT

Login before adding your answer.

Traffic: 750 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6