Question

Should I skip the eBayes step when using Limma for Affymetrix miRNA v1 chip?

0

Entering edit mode

Guest User ★ 13k

@guest-user-4897

Last seen 11.4 years ago

Dear List, I am working with Affymetrix's miRNA V1 chip, which uses very different probe sets for different molecule types, e.g. 4 identical probes for one miR, or 11 different probes for a snoRNA. I have read that the eBayes step assumes equal error variance between probe sets so it is not suitable for this kind of mixed set of probe set designs. To further complicate matters I am thinking about generating a custom CDF where the miR probe sets would have varied number of probes. http://pomelo2.bioinfo.cnio.es/help/pomelo2-help.html#toc10 Should I look at everything through Limma without the eBayes step (making it equivelant to a normal t-test?), or separate into several different analyses for different molecule types and only drop the eBayes step for the miRs (which will have varying sizes of probe sets)? Many thanks, Scott -- output of sessionInfo(): > sessionInfo() R version 3.0.2 (2013-09-25) Platform: x86_64-w64-mingw32/x64 (64-bit) locale: [1] LC_COLLATE=English_United Kingdom.1252 [2] LC_CTYPE=English_United Kingdom.1252 [3] LC_MONETARY=English_United Kingdom.1252 [4] LC_NUMERIC=C [5] LC_TIME=English_United Kingdom.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base -- Sent via the guest posting facility at bioconductor.org.

miRNA cdf probe limma miRNA cdf probe limma • 2.3k views

ADD COMMENT • link updated 11.4 years ago by Gordon Smyth 53k • written 11.5 years ago by Guest User ★ 13k

score 5 · Answer 1 · 2014-09-26

5

Entering edit mode

Gordon Smyth 53k

@gordon-smyth

Last seen 8 hours ago

WEHI, Melbourne, Australia

Dear Scott,

You are misinterpreting the empirical Bayes assumption of eBayes. eBayes doesn't assume equal variances, it only assumes that the variances can be considered as a random sample from the same distribution.

Omitting the eBayes step would would throw the baby out with the bath water.

I doubt that the error variance depends quite as directly on the number of probes in a probe-set as you might think. When we have analysed the miRNA Affymetrix chip, we have found that it has major problems from the point of view of normalization, while the issue that you raise is relatively minor.

I could suggest ways to take into account the number of probes per probe-set in the eBayes calculations, but I don't think this will be important.

Best wishes
Gordon

PS. If you have the choice, RNA-seq is cheaper and better.

[This answer was originally posted to the Bioconductor mailing list on 22 August 2014.]

ADD COMMENT • link 11.4 years ago Gordon Smyth 53k

1

Entering edit mode

Just for the record, here is how you can incorporate probe-set size into the eBayes step to see whether it is important. I will simulate a little toy example where we know there should be a trend:

library(limma)
n <- 1:100 # number of probes per probe-set
ID <- rep(1:100,n) # probe-set IDs
nprobes <- length(ID)
x <- matrix(rnorm(nprobes*3),nprobes,3)
y <- avereps(x,ID=ID)

# y has 100 rows, each row is an average of n probes

design <- matrix(1,3,1)
fit <- lmFit(y,design)
fit$Amean <- log(n)
fit <- eBayes(fit,trend=TRUE)
plotSA(fit)

Limma will estimate a decreasing trend of variance vs n as well as doing empirical Bayes squeezing around the trend. The x-axis label of the plot will say "average expression" but it is actually log(n).

Best wishes
Gordon

[This answer was originally posted to the Bioconductor mailing list on 24 August 2014.]

ADD REPLY • link 11.4 years ago Gordon Smyth 53k