Question

Analyse Microarray Data - AVG_Signal

0

Entering edit mode

Alex Greenshields-Watson • 0

@alex-greenshields-watson-22187

Last seen 6.2 years ago

Cardiff

Hello,

I have a historic microarray dataset from an illumina bead based technology. For my data I only have an dataframe where the columns include, for each sample:

"AVG_Signal"
"NARRAYS"
"ARRAY_STDEV"
"Detection Pval"

I would like to run a comparison analysis using limma to get p adjusted values, however I am not sure what my data corresponds to based on instructions in the limma users guide.

I am more familiar with RNA-seq analysis using DESeq2. However have read posts which suggest Deseq2 is not applicable for microarray - is this correct?

What should I do regarding normalisation and comparison?

The within array normalisation function does not seem applicable.

I have proceeded straight to model.matrix(), lmFit() and makeContrasts() - without any normalisation - however I have only a very small number of differentially expressed genes as a result.

Any advice would be very helpful.

Kind Regards, Alex

limma microarray • 3.0k views

ADD COMMENT • link updated 6.2 years ago by Gordon Smyth 53k • written 6.2 years ago by Alex Greenshields-Watson • 0

1

Entering edit mode

Correct. DESeq2 is not appropriate for microarray.

ADD REPLY • link 6.2 years ago Michael Love 43k

0

Entering edit mode

James W. MacDonald 68k

@james-w-macdonald-5106

Last seen 2 hours ago

United States

If you just have averages, you can't do much with those data. You need the raw data.

ADD COMMENT • link 6.2 years ago James W. MacDonald 68k

0

Entering edit mode

Well, I think it's technically the average signal per bead for each gene in that sample. From looking online this seems like a standard output from illumina bead arrays and I think this is in fact the raw data (as a non-normalised bead summary), in a format created by bead studio, see publication below - raw data file.

http://www.damtp.cam.ac.uk/user/st321/CV&Publications_files/STpapers-pdf/DSTT06.pdf

As a follow up I've managed to get it through Limma and have now got a meaningful set of DEGs that make sense biologically. I will check and then post the code in a follow up.

ADD REPLY • link 6.2 years ago Alex Greenshields-Watson • 0

score 4 · Accepted Answer · 2019-11-12

4

Entering edit mode

Gordon Smyth 53k

@gordon-smyth

Last seen 2 hours ago

WEHI, Melbourne, Australia

The limma pipeline for Illumina Beadchips is described in Sections 4.6 and 17.3 of the limma User's Guide. Briefly, you use read.ilmn to read the file and neqc to background correct and normalize.

limma will read the intensities (AVE_Signal) and the detection p-values, and will use the detection p-values to infer the intensities of the background probes as part of the background correction step. Usually the NARRAYS entries will be all 1.

ADD COMMENT • link 6.2 years ago Gordon Smyth 53k

0

Entering edit mode

Many thanks. Apologies I missed this in the guide ;) Kind Regards, Alex

ADD REPLY • link 6.2 years ago Alex Greenshields-Watson • 0

0

Entering edit mode

I know this is a slightly separate question - but if I want to then use this normalised Limma data in visualisation, PCA or distance analysis - should you sum or average multiple probe values that correspond to the same gene? Or just leave them as they are?

ADD REPLY • link 6.2 years ago Alex Greenshields-Watson • 0

0

Entering edit mode

Found this post:

https://support.bioconductor.org/p/43745/

Sorted thanks.

ADD REPLY • link 6.2 years ago Alex Greenshields-Watson • 0