Hello!
I have to analyze data obtained from Affymetrix miRNA 4.0 microarray. It is the first time I have to analyze such kind of data and I wanted to be sure that I was doing it right. My main concerns are about the normalization of the probe's intensities and the filtering in limma.
# Normalization library(oligo) library(pd.mirna.4.0) celFiles <- list.celfiles("~/Desktop/Affymetrix_miRNA/RawData_miRNA", full.names=TRUE) rawData <- read.celfiles(celFiles, pkgname="pd.mirna.4.0") eset <- rma(rawData) # I did some plots and everything look really great after normalization. # Limma # I directly use the eset data to calculte the miRNAs differentially expressed.
Question:
1. Normalization: Should I do something else for the normalization or apply rma is usually enough? Should I do something with the information of the spike-in probes?
2. Filtering: I have seen that among the probes on this micro-array some are unrelevant for my analysis:
snoRNA, spike-in,...
should I remove such probes before computing all the statistics from limma?
3. More globally, I found many tutorials talking about normalization of microarrays for genes but not for miRNAs. Are they differences in the processing of those two types of microarrays that I should know?
4. From what I have seen, some miRNAs are represented by several probes on the chip. They seem to be clustered during the normalization step and the creation of the expression dataset. Can someone explain me how it is done or point me to article/post that explain such thing?
Thanks in advance!
Thanks a lot!
For the last question I was talking about the multiple probes into single probeset so you answered my question.
I have another quick question related to the previous one.
I see that usually when a particular miR is differentially expressed, it is in most of the represented species but not in all. I supposed it is due to the conservation of those miRNAs accros species but I was wondering if in general people tends to restricted their differential expression analysis to the species they are interested in? Like a prefiltering of the data for only human probes is better than use the whole information if I am working with human samples?
Thanks!
That sort of depends on what you are after. In general I restrict to the species under consideration, but you could make the argument that some of the other miRNA transcripts (non-human) for which there isn't a human transcript on the array are actually expressed in humans, and we just don't know about it yet. In that case, a differentially expressed non-human miRNA may indicate that the miRNA is expressed in humans, and differentially so in your experiment.
As usual, there are trade-offs, and you have to decide what trade-offs you want to make.
Thanks this is what I thought!
Hello, Before applying limma it is recommended to remove low expressed probes. I am not sure about how to perform this filtering step based on intensities in this chip, which cutoff to select. Any advice would be much appreciated. Thanks in advance.