Setting up model for time course analysis

0

Entering edit mode

Casper Shyr ▴ 140

@casper-shyr-4113

Last seen 9.6 years ago

Dear Bioc, I am trying to find DE genes in a time course experiment, and I am a bit confused at how to setup the matrix model. I wasn't able to follow the example in manual page 48-49 because I don't know what the variable "targets" refer to. But I have setup a model based on Section 7.2. My layout is: I have 10 time points, time 0,1,3,5,7,9... Each time point has two replicates. My aim is to find DE genes by comparing each time point to the reference point (i.e. at time 0). Below are my codes related to setting up the model. It would help me a lot if someone could confirm if I'm doing this right, or whether some changes that need to be done. Thanks a lot in advance! design<-model.matrix(~0+factor(c(1,1,2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,1 0,10))); ##1=at time 0, 2=at time 1...and so on colnames(design)<-c("WT0hr","WT1hr","WT2hr","WT6hr","WT12hr","WT18hr", "WT24hr","WT30hr","WT48hr","WT72hr"); fit<-lmFit(allDatagcRMA, design); ##allDatagcRMA contains the expressions normalized by gcrma contrast.matrix <- makeContrasts(WT1hr-WT0hr, WT2hr-WT0hr, WT6hr- WT0hr, WT12hr-WT0hr, WT18hr-WT0hr, WT24hr-WT0hr, WT30hr-WT0hr, WT48hr- WT0hr, WT72hr-WT0hr, levels=design); fit2<-contrasts.fit(fit,contrast.matrix); fit2<-eBayes(fit2); topTableF(fit2, adjust="BH"); ##To get the top 10 genes that respond to at least one of the comparisons above Thanks again! Sincerely, Casper [[alternative HTML version deleted]]

• 853 views

ADD COMMENT • link updated 13.6 years ago by Heidi Dvinge ★ 2.0k • written 13.6 years ago by Casper Shyr ▴ 140

0

Entering edit mode

Heidi Dvinge ★ 2.0k

@heidi-dvinge-2195

Last seen 9.6 years ago

Hi Casper, > > Dear Bioc, > > I am trying to find DE genes in a time course experiment, and I am a > bit confused at how to setup the matrix model. I wasn't able to follow > the example in manual page 48-49 because I don't know what the variable > "targets" refer to. But I have setup a model based on Section 7.2. > I guess you're referring to the limma users' guide. "targets" probably refer to the Targets.txt file that has been read in. Just do a search through the pdf and it should pop up. > > > My layout is: I have 10 time points, time 0,1,3,5,7,9... > > Each time point has two replicates. > > My aim is to find DE genes by comparing each time point to the reference > point (i.e. at time 0). > Not directly related to your question, but when you have such a long time series you might want to consider doing something else than pairwise comparisons in limma. For example, the "timecourse" package uses an approach similar to limma, but is specifically designed for time course experiments, and takes into account that the individual time points are not independent. Comparing all time points directly to t0 does give you a list of genes that respond during one or more time points, but the downstream interpretation might become more difficult. For example, what if you get a gene that is significant at 12, 18 and 30 hours but not at 24? Is there actually a dip in expression level, or is the p-value at t24 just slightly higher than whatever threshold you use? If you get a gene that is significant only at a single time point, is this then a genuine spike in expression level? By doing all these pairwise comparisons I suspect you'll get a lot of genes with a "mixed" behaviour like this. At least if you rely on some fixed p-value threshold and divide genes into differentially expressed/non-differentially expressed at each time point, rather than doing some clustering across the entire time series. HTH \Heidi > > > Below are my codes related to setting up the model. It would help me a lot > if someone could confirm if I'm doing this right, or whether some changes > that need to be done. > > Thanks a lot in advance! > > > > design<-model.matrix(~0+factor(c(1,1,2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9 ,10,10))); > ##1=at time 0, 2=at time 1...and so on > colnames(design)<-c("WT0hr","WT1hr","WT2hr","WT6hr","WT12hr","WT18hr ","WT24hr","WT30hr","WT48hr","WT72hr"); > fit<-lmFit(allDatagcRMA, design); ##allDatagcRMA contains the expressions > normalized by gcrma > contrast.matrix <- makeContrasts(WT1hr-WT0hr, WT2hr-WT0hr, WT6hr- WT0hr, > WT12hr-WT0hr, WT18hr-WT0hr, WT24hr-WT0hr, WT30hr-WT0hr, WT48hr- WT0hr, > WT72hr-WT0hr, levels=design); > fit2<-contrasts.fit(fit,contrast.matrix); > fit2<-eBayes(fit2); > topTableF(fit2, adjust="BH"); ##To get the top 10 genes that respond to at > least one of the comparisons above > > > > > > Thanks again! > > Sincerely, > > Casper > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor >

ADD COMMENT • link 13.6 years ago Heidi Dvinge ★ 2.0k

Login before adding your answer.