Question

how to use `arrayWeightsQuick()`?

0

Entering edit mode

matthew • 0

@5bc3c081

Last seen 7 months ago

United States

I am using limma for a large analysis (>5,000 samples, >300 covariates) of single-channel microarray data without probe quality weights. I find that arrayWeights() is unusably slow in this large-sample, large-coefficient setting (arrayWeights() has been running for > 24 hours and is still going).

Therefore, I am interested in using arrayWeightsQuick() as an alternative to arrayWeights(). However, arrayWeightsQuick() cannot be a drop-in replacement for arrayWeights() because the function signature of arrayWeightsQuick() requires the output of lmFit(). I believe this means that arrayWeightsQuick() is used at a different stage of the workflow relative to arrayWeights().

Here is my current workflow using arrayWeights():

weights <- arrayWeights(expression, design=design)  # unusably slow!
fit <- lmFit(expression, design=design, weights=weights)

Is the following 2-step procedure a proper way to substitute arrayWeightsQuick() for arrayWeights()?

prefit <- lmFit(expression, design=design) # temporary
weights <- arrayWeightsQuick(expression, prefit)
fit <- lmFit(expression, design=design, weights=weights)

Relatedly: the user guide for limma mentions arrayWeightsSimple(), but no such function exists. Perhaps arrayWeightsSimple() was renamed to arrayWeightsQuick() in the code, but the documentation is not yet updated accordingly?

limma • 2.0k views

ADD COMMENT • link updated 8 months ago by TraceMorrow • 0 • written 9 months ago by matthew • 0

0

Entering edit mode

What sort of object is expression? Does it contain any missing values?

ADD REPLY • link 9 months ago Gordon Smyth 53k

0

Entering edit mode

In the specific use-case I have in mind, expression is a simple matrix of log2 gene expression per sample, no missing values.

ADD REPLY • link 9 months ago matthew • 0

0

Entering edit mode

Yes, your proposed approach is correct. Using arrayWeightsQuick() after an initial lmFit() is suitable for large datasets, providing faster computation compared to arrayWeights(). Ensure your expression matrix has no missing values for optimal performance.

ADD REPLY • link 8 months ago TraceMorrow • 0

score 2 · Accepted Answer · 2025-04-10

Yes, that is the correct way to use arrayWeightsQuick.

Thanks for pointing out that there is still a reference to arrayWeightsSimple in the User's Guide. That function was merged into arrayWeights in 2019 and was removed as redundant, as part of a major rewrite of arrayWeights. arrayWeightsSimple was always different from arrayWeightsQuick. If you are running arrayWeights now on a matrix with no missing values or prior weights, then it will run effectively the same algorithm that was previously in arrayWeightsSimple.

On my laptop, it takes just over an hour for arrayWeights() to run on a dataset with 10,000 probes, 5000 samples and a design matrix with 300 columns. Meanwhile, arrayWeightsQuick takes only a few seconds.