Differential Gene Expression
2
0
Entering edit mode
fromhj304 ▴ 10
@fromhj304-11588
Last seen 8.2 years ago

Hi all, I'm right now working with a RNA-seq raw count data file (in .txt format). It's a matrix of ~50,000 genes (row) and 8 samples (column).

My assignment is to get differential gene expression analysis. But I'm not really sure what it means by that. I'm using DESeq2 package in R for this task, and since I'm really new to R and bioinformatics, I'm not really sure what they want as the result.

 

So my questions are:

1) What does one expect from differential expression analysis? I've got the list of genes that have p-values less than 0.05. Would that be sufficient for differential expression analysis? What I have right now is just the list of genes.

2) What is MA plot and what does it have to do with differential expression analysis? I've read some papers about RNA-Seq and R packages that help analysis, and most of them mentioned MA plot. What is its significance in DE analysis? And what can we see from it?

 

Here is part of my source code that I wrote to get the list of genes that have p-values less than 0.05:

#get differential expression results
res<-results(dds)

#sort by p-value
res<-res[order(res$pvalue),]
head(res)

#write a .csv file that contains information of genes
#that have p-value smaller than 0.05
write.csv(res[which(res$pvalue<0.05),], file="DEoutput.csv")

 

Thanks!

deseq2 bioinformatics rna-seq Tutorial • 1.7k views
ADD COMMENT
1
Entering edit mode
@gordon-smyth
Last seen 15 hours ago
WEHI, Melbourne, Australia

MA plots were originally an invention of Terry Speed's group at UC Berkeley for two-color microarrays. MA plots first appeared in the sma and affy packages back in 2002, and were a way to display the raw data from a two-color microarray or for a pair of Affymetrix microarrays. The idea of using MA plots to display the results of a differential expression analysis appeared in the limma package in 2004.

A short review of MA plots can be found in the section on "mean-difference plots" in this article:

http://nar.oxfordjournals.org/content/43/7/e47.full

ADD COMMENT
1
Entering edit mode
@mikelove
Last seen 3 days ago
United States

"1) What does one expect from differential expression analysis? I've got the list of genes that have p-values less than 0.05. Would that be sufficient for differential expression analysis? What I have right now is just the list of genes."

This is one possible "end point" for DE analysis. The list of genes with adjusted p-value less than X should satisfy the property that in expectation the false discovery rate in this set is less than X.

"2) What is MA plot and what does it have to do with differential expression analysis? I've read some papers about RNA-Seq and R packages that help analysis, and most of them mentioned MA plot. What is its significance in DE analysis? And what can we see from it?"

Check Gordon Smyth's pointer to a paper describing the plot.

You can also read about the MA plot in DESeq2 by reading Section 1.5.1 MA-plot, in the DESeq2 vignette:

vignette("DESeq2")

Or the help page for DESeq2's plotMA function:

?plotMA

Or the description of DESeq2's MA plot in our RNA-seq workflow:

http://www.bioconductor.org/help/workflows/rnaseqGene/#plotting-results

ADD COMMENT
0
Entering edit mode

Hi thank you so much for the answer!

But why did you use adjusted p-values instead of just p-values? What are their differences?

 

Thanks again!
 

ADD REPLY
0
Entering edit mode

See "Multiple testing" in our workflow.

ADD REPLY
0
Entering edit mode

Thank you so much!!

ADD REPLY

Login before adding your answer.

Traffic: 624 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6