Question: Really weird results when I filter this new data that I have, script has worked on two previous data sets.
19 months ago by
So this is my script and I have used it in the past. The filtering and then getting my list of DEG. The list contains only pos_controls from affymetrix but no main probes. Which I do not understand because I thought I removed them with the filter step which works on my other 2 datasets. Thanks!

mydir <- "C:\\Users\\hakim\\Documents\\hanane_data"

#listing the files from directory using special CEL file read function
celList <- list.celfiles(mydir, full.names=TRUE)
#reading data from cellist and setting annotation package to approiate one for this microarray
rawData <- read.celfiles(celList, pkgname='')
#normalizing the data using RMA algorithm
normData <- rma(rawData, target="core")
#retreaving feature data
featureData(normData) <- getNetAffx(normData, "transcript")

#the respective experimental groups of your data
group <- factor(c((,4)),,4),,4)
#design and contrast matrix of the data
design <- model.matrix(~ 0 + group)
colnames(design) <- c("ID01","I01","ID005","I005", "ID3","I3","CtlID","CtlI")
contrast <- makeContrasts( "ID01-ID005","ID01-ID3","ID005-ID3","ID01-CtlID","ID005-CtlID","ID3-CtlID",
                           levels= design )
eset <- getMainProbes(normData)
normData.filtered <- nsFilter(eset, require.entrez = FALSE,
                              remove.dupEntrez = FALSE)

normfit <-eBayes( lmFit(normData.filtered$eset, design), contrast) )

#getting the list of probes
probeset.list <-topTable(normfit,coef="ID01-ID3",number=100000, adjust="BH", lfc=1)

ADD COMMENTlink modified 19 months ago by James W. MacDonald44k • written 19 months ago by hakimelakhrass0
19 months ago by
United States
Debugging your own scripts is really up to you as an analyst. Things look OK to me however:

> eset2 <- rma(read.celfiles(list.celfiles())
> annotation(eset2)
 [1] "" 
> featureData(eset2) <- getNetAffx(eset2, "transcript")
> library(affycoretools)    
> eset2 <- getMainProbes(eset2)
> table(pData(featureData(eset2))$category)
           main normgene->exon  
          33793           1352             

And you should expect some normgene->exon probesets in this table, as they do double-duty as normgene->exon and main probesets on this array, and the Affy transcript csv labels them as normgene rather than main probesets.

ADD COMMENTlink written 19 months ago by James W. MacDonald44k
