Question regarding handling technical replicates for Affy arrays
0
0
Entering edit mode
@gordon-smyth
Last seen 1 hour ago
WEHI, Melbourne, Australia
Hi Noel, Your code looks correct. Have you looked at the value of the consensus correlation, to check that it is reasonable? Best wishes Gordon > Date: Sat, 27 May 2006 17:56:51 -0700 (PDT) > From: "noel0925 at sbcglobal.net" <noel0925 at="" sbcglobal.net=""> > Subject: [BioC] Question regarding handling technical replicates for > Affy arrays > To: bioconductor at stat.math.ethz.ch > > Hi All, > > There seems to have been much discussion regarding > the statistical issues surrounding handling of > technical replicates in gene expression data using the > Limma package as pertains to two-color arrays, but > less so for Affy data. > > As such, I am still uncertain regarding how best to > handle a single technical replicate in a dataset 40 > Affy arrays from 3 tumor types. > > Clearly, I want to model the biological variation > between these three tumor types as a fixed effect. > > Limma, I understand, is only able to handle 1 random > effect. Unlike two-color arrays, single color arrays > do not necessitate allocating random effects to within > array spot replicates or dye swaps so, I should still > be able to model the lack of dependence between these > two technical reps as a random effect using the > duplicateCorrelation function. > > It is unclear to me how to specify this single > technical replicate however. The examples I have read > about in the limma manual or other desciptions of its > functions usually deal with cDNA arrays and usually > there is a nice structure to the technical > replication- eg pairs of technical replicates for each > sample (dye swap or otherwise). In my case, I only > have one technical rep (and 20, 10, and 10 biolgical > reps). > > What seemed to make sense to me was: > >>classes<- c(rep(1,20), rep(2,10), rep(3,10)) > >>f<- factor(targets$Target, levels = c("RNA1", "RNA2", > "RNA3")) >>design<- model.matrix(~0+f) >>colnames(design)<- c("RNA1", "RNA2", "RNA3") >>biolrep <- c(1,2,3,4,5,5,6,7,8,9??40) >>corfit<- duplicateCorrelation(eset, design, ndups=1, > block= biolrep) >>fit<-lmFit(eset,design, ndups=1, block=biolrep > ,cor=corfit$consensus) >>contrast.matrix<- makeContrasts(RNA1-RNA3, > RNA2-RNA3,RNA1-RAN2, levels=design) >>fit2<- contrasts.fit(fit, contrast.matrix) >>fit2<-eBayes(fit2) > > How else can I specify biolrep? > > > > Obviously it would be preferable to model the lack of > dependence rather than go with the alternatives. > > i.) Averaging the two technical reps so as to treat > them as primary data. > I assume this would be done post-normalization and > summarization? > > > ii.) I have also read that simply treating the > technical rep as a biological rep is not too dangerous > since- the measurement error can be larger than the > biological variation- but am hesitant since I want to > perform downstream analysis like clustering and don't > want an 'extra' sample that really comes from the same > patient. > > iii.) Discard the data from one of the technical reps > (worse still). > > I would greatly appreciate any insight into the best > way to handle this issue. > > Thanks in advance. > Noelle
GO Clustering affy limma GO Clustering affy limma • 828 views
ADD COMMENT

Login before adding your answer.

Traffic: 683 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6