Question: Appropriate merge of batches of the same array to create a final total ExpressionSet
gravatar for svlachavas
4.1 years ago by
Greece/Athens/National Hellenic Research Foundation
svlachavas660 wrote:

Dear ALL,

based on an experimental design of testing 4 different substances("treatments") , dataset grows incrementally, as about on average a month period i get small batches of 5 CELs each(one control and 4 "treatments") which grow as beside the first batch, now i got the second batch which are "technical replicates" of the first batch. The platform is Affymetrix PrimeView(PM-only). So my main and very important question is:

as there is a "batch effect"(but on the same array batches) regarding the time of "creating" getting and each batch, should i preprossess each batch with the fRMA algorithm like this:

i.e. batch1 < ReadAffy(..)

norm1 <- frma(batch_1, background="rma", normalize="quantile", summarize="robust_weighted_average", target="probeset) ?

and then merge all the collected batches with some package such as inSilicoMerging which has a batch effect option ?

Or also there are other options in the frma() command that can also account for this ?

Please excuse me for my naive questions, but im a beginner in R, and i believe that this question is important, as in the end i want to unite these small batches i collect to have a final ExpressionSet

ADD COMMENTlink modified 4.1 years ago by Matthew McCall830 • written 4.1 years ago by svlachavas660


I have a couple questions that will help me answer your question:

1. Do you really mean "technical replicates" -- same biological sample analyzed multiple times? From your description of how the data arise, it sounds like you probably have "biological replicates". 

2. Does each batch also represent a different biological unit (e.g. different patient)?

Best, Matt


ADD REPLYlink written 4.1 years ago by Matthew McCall830

Dear Mathiew,

please excuse me for the naive description. Actually, i checked my notes and there are indeed biological replicates: we use a specific cancer cell line and 4 different substances with a control to evaluate posssible apoptotic and other effects. So each batch represent the same procedure with the specific cell line and these substances

ADD REPLYlink written 4.1 years ago by svlachavas660
Answer: Appropriate merge of batches of the same array to create a final total Expressio
gravatar for Matthew McCall
4.1 years ago by
United States
Matthew McCall830 wrote:

You can preprocess your data using fRMA, in the manner you suggest, on each batch of samples and then combine the data for analysis. This will allow you to avoid repeatedly preprocessing your data and still be able to make comparisons across batches. However, fRMA only addresses one specific type of batch effects (ones affecting individual probes), so you may need to perform additional batch effect correction / modeling. There are several ways to do this (searching the BioC help archives will turn up multiple options).

ADD COMMENTlink written 4.1 years ago by Matthew McCall830

Thank you for your answer. After reading the vignette for fRMA, i found that is appropriate to perform a additional batch effect correction, while i merge/combine the different batches to have the complete dataset. Thats why i found the package inSilicoMerging package and the function merge(): merge(esets, method='NONE'); method:   
Merging method aimed at removing inter-study bias. Possible options are: BMC, COMBAT, GENENORM and XPN.

But i wil also search other packages for a similar approach. Moreover, i would like to ask you one more specific question:

regarding the summarization method of fRMA, should i use the "robust_weighted_average" option or another option is more apropriate ?

Best Regards

ADD REPLYlink written 4.1 years ago by svlachavas660

The default summarization for fRMA, "robust_weighted_average", works well in most cases. If the probe effects differ across your batches, you might want to try "random_effect" summarization. However, in most cases the two will provide very similar results.

ADD REPLYlink written 4.1 years ago by Matthew McCall830
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 111 users visited in the last hour