Question

Best way of presenting "absolute" expression values (edgeR)

0

Entering edit mode

Sindre ▴ 110

@sindre-6193

Last seen 3.7 years ago

Hi! We want to classify a new type of glands by ranking genes by expression level using RNAseq. We don't have any good controls, so we just want to see a ranked list of genes. I have used Cufflinks RPKM values, but if I want to use edgeR, is this a valid way of doing it using featureCounts: fc <- featureCounts(files=targets$Targets,nthreads=8, isGTFAnnotationFile=TRUE, GTF.attrType="gene_id", GTF.featureType="exon", useMetaFeatures=TRUE, annot.ext="genes.gtf") x <- DGEList(counts=fc$counts, genes=fc$annotation) expr <- calcNormFactors(x) expr_norm <- rpkm(expr, log=FALSE,gene.length=x$genes$Length) # Getting gene length from FeatureCounts, using rkpm() in the edgeR package, not Rsubread.. Then just write out this table.. Thanks!

RNASeq edgeR RNASeq edgeR • 2.2k views

ADD COMMENT • link updated 13 months ago by Gordon Smyth 50k • written 10.0 years ago by Sindre ▴ 110

0

Entering edit mode

Hi! I know a lot of people do the following: 1. Take a list of differentially expressed genes 2. Fetch the FASTA files for protein coding genes 3. Predict if secretory by using SignalP My question is, does it exist a list/database of known/predicted secretory protein coding genes from hg19? That would be much more efficient than many people predicting the same proteins a lot of times..

ADD REPLY • link 10.0 years ago Sindre ▴ 110

score 0 · Answer 1 · 2014-05-10

0

Entering edit mode

Gordon Smyth 50k

@gordon-smyth

Last seen 3 hours ago

WEHI, Melbourne, Australia

Yes, your code is fine for getting normalized RPKM from featureCounts and edgeR.

Your code is similar to the public case study: https://bioinf.wehi.edu.au/RNAseqCaseStudy

In the latest version of edgeR, you can even simplify the code to

expr_norm <- rpkm(expr)

Gordon

ADD COMMENT • link 10.0 years ago • updated 13 months ago Gordon Smyth 50k