Question

illumina ht-12 v4 expression beadchip

3

Entering edit mode

arisarkar88 ▴ 40

@arisarkar88-11464

Last seen 2.3 years ago

Los Angeles

How to normalize and determine the differentially expressed genes for illumina ht-12 v4 expression beadchip?

microarray illumina human ht-12 v4 • 5.4k views

ADD COMMENT • link updated 8.7 years ago by Gordon Smyth 53k • written 8.7 years ago by arisarkar88 ▴ 40

1

Entering edit mode

use limma and read the manual

ADD REPLY • link 8.7 years ago chris86 ▴ 420

0

Entering edit mode

Thank you for your replies. But, I was trying to analyse https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE74629. Does limma handle this kind of data where the non-normalized txt file is only provided? Besides, the non-normalized file has columns for: Probe id, Sample and detection p-value only? What are the input files needed for analysing using limma?

ADD REPLY • link 8.7 years ago arisarkar88 ▴ 40

0

Entering edit mode

Yes, limma will read and process those columns. See my answer below.

ADD REPLY • link 8.7 years ago Gordon Smyth 53k

score 7 · Answer 1 · 2017-02-21

limma processes this sort of data easily. The key is to use the arguments of read.ilmn() to tell limma how the columns are named in the data file from GEO. Here is a quick limma analysis of GSE74629:

> library(limma)
> x <- read.ilmn("GSE74629_non-normalized.txt",expr="SAMPLE ",probeid="ID_REF")
Reading file GSE74629_non-normalized.txt ... ...
> y <- neqc(x)
Note: inferring mean and variance of negative control probe intensities from the
detection p-values.
> Group <- rep(c("PDAC","Healthy"),c(36,14))
> Group <- factor(Group)
> design <- model.matrix(~Group)
> keep <- rowSums(y$E>5) >= 14
> y2 <- y[keep,]
> fit <- lmFit(y2,design)
> fit <- eBayes(fit,trend=TRUE,robust=TRUE)
> topTable(fit,coef=2)
              logFC AveExpr     t  P.Value adj.P.Val    B
ILMN_2079655 -1.504    8.54 -8.28 3.50e-11  4.79e-07 15.2
ILMN_1697268  0.879    8.31  7.92 1.09e-10  5.00e-07 14.1
ILMN_1784884  0.886   11.74  7.86 1.40e-10  5.00e-07 13.9
ILMN_1705892 -1.049    6.85 -7.85 1.46e-10  5.00e-07 13.9
ILMN_1804738  0.795    7.26  7.56 4.31e-10  1.11e-06 12.8
ILMN_3201663 -0.815    4.93 -7.53 4.87e-10  1.11e-06 12.7
ILMN_1652073 -0.939   11.04 -7.43 7.08e-10  1.33e-06 12.4
ILMN_3226875 -1.219    9.83 -7.41 8.33e-10  1.33e-06 12.2
ILMN_1811702  0.927    9.09  7.37 8.76e-10  1.33e-06 12.2
ILMN_1797522  0.807    9.04  7.27 1.32e-09  1.58e-06 11.8

The neqc normalization method used above is described here: https://doi.org/10.1093/nar/gkq871

score 1 · Answer 2 · 2017-02-21

1

Entering edit mode

svlachavas ▴ 840

@svlachavas-7225

Last seen 4 months ago

Germany/Heidelberg/German Cancer Resear…

Just to extend Chris answer, the R package BeadArrayUseCases

(http://bioconductor.org/packages/release/data/experiment/html/BeadArrayUseCases.html) has a very extensive vignette especially for Illumina platforms.

ADD COMMENT • link 8.7 years ago svlachavas ▴ 840