Question

Correct annotation R package for human clariom array analysis

0

Entering edit mode

svlachavas ▴ 840

@svlachavas-7225

Last seen 5 days ago

Germany/Heidelberg/German Cancer Resear…

Dear Community,

i have tried recently to read, process and analyze some human clariom array CEL files in R-unfortunately, i have no information from my collaborators which specific platform has been used (for example S or D)-so i tried the following approach:


library(oligo)
library(affycoretools)
library(limma)
library(org.Hs.eg.db)

dat <- read.celfiles(list.celfiles())

# Platform design info loaded.
Reading in : 01_SUM-159.CEL
Reading in : 02_SUM-159-WT1.CEL
Reading in : 03_SUM-159-WT3.CEL
Reading in : 04_SUM-159-WT4.CEL
Reading in : 05_SUM-159-WT5.CEL
Reading in : 06_SUM-159-WT6.CEL
Reading in : 07_SUM-159-KO3.CEL
Reading in : 08_SUM-159-KO4.CEL
Reading in : 09_SUM-159-KO5.CEL
Reading in : 10_SUM-159-KO6.CEL
Reading in : 11_SUM-159-KO8.CEL
Reading in : 12_SUM-159-KO10.CEL

dat
GeneFeatureSet (storageMode: lockedEnvironment)
assayData: 2598544 features, 12 samples 
  element names: exprs 
protocolData
  rowNames: 01_SUM-159.CEL 02_SUM-159-WT1.CEL ...
    12_SUM-159-KO10.CEL (12 total)
  varLabels: exprs dates
  varMetadata: labelDescription channel
phenoData
  rowNames: 01_SUM-159.CEL 02_SUM-159-WT1.CEL ...
    12_SUM-159-KO10.CEL (12 total)
  varLabels: index
  varMetadata: labelDescription channel
featureData: none
experimentData: use 'experimentData(object)'
Annotation: pd.hugene.2.0.st

z <- pData(dat)

otherdat <- data.frame(Condition_Clariom = c("WT","WT","WT","WT","WT","WT",
"KO_clone","KO_clone","KO_clone","KO_clone","KO_clone","KO_clone"))

z <- data.frame(z, otherdat)
pData(dat) <- z
validObject(dat)

celfiles.rma <- rma(dat, target="core")
eset.rma <- getMainProbes(celfiles.rma)

# library(hugene20sttranscriptcluster.db)
eset.rma <- annotateEset(eset.rma, hugene20sttranscriptcluster.db).....

Thus, my main question is: as above after reading the raw CEL files, the annotation platform of the dat object is pd.hugene.2.0.st, could i use the above procedure with hugene20sttranscriptcluster.db as annotation ?

Or my above approach is incorrect and i should use another annotation R package ?

Thank you in advance,

Efstathios

clariom oligo affycoretools pd.hugene.2.0.st • 1.6k views

ADD COMMENT • link updated 6.0 years ago by James W. MacDonald 68k • written 6.0 years ago by svlachavas ▴ 840

score 1 · Answer 1 · 2019-03-12

1

Entering edit mode

James W. MacDonald 68k

@james-w-macdonald-5106

Last seen 2 hours ago

United States

That's the conventional thing to do. Why do you have doubts? Also, why do you think these are clariom arrays?

ADD COMMENT • link 6.0 years ago James W. MacDonald 68k

0

Entering edit mode

Dear James,

thanks for your reply-unfortunately, i have just an excel file-which has an extra description for these cel files with the abbreviation "Clariom_SUM159"-where SUM159 is a cancer cell line-and i can't currently have any other extra information-

thus, you believe that perhaps these are not even clariom and are indeed gene st arrays ? my initial doubt for the analysis was because i have never analyzed in the past clariom arrays (if indeed these are...)

ADD REPLY • link 6.0 years ago svlachavas ▴ 840

0

Entering edit mode

Wait. You show a bunch of code that makes it look like you have celfiles that you read in and processed using oligo, but now you say you just have an Excel spreadsheet? Why would you post a bunch of code that has no relation to what you are actually doing?

ADD REPLY • link 6.0 years ago James W. MacDonald 68k

0

Entering edit mode

Dear James, please excuse me for any unintended misunderstanding- indeed these are the relative CEL files for the experiment-i just meant that any information about the experiment, is on an extra excel file, that only mentions the above information-

ADD REPLY • link 6.0 years ago svlachavas ▴ 840

0

Entering edit mode

If the oligo package says they are hugene20 arrays, then you can be assured that is what they are.

ADD REPLY • link 6.0 years ago James W. MacDonald 68k