normalization for custom chip

0

Entering edit mode

Deng, Shibing ▴ 10

@deng-shibing-1006

Last seen 11.5 years ago

Hi, We are designing a custom Affymetrix chip with about 1700 genes. By design, a large number of genes on the chip will be differentially expressed between our treatment and control samples. The assumption for quantile normalization and other distribution-based normalization methods will not hold for these chips. To normalize them, we plan to put some "house-keeping" genes or "invariant" genes covering a wide range of intensities on the chip. We are not sure how many house-keeping genes we should have to get a good normalization? I will appreciate your input on this issue. Shibing LEGAL NOTICE\ Unless expressly stated otherwise, this messag...{{dropped}}

Normalization Normalization • 1.4k views

ADD COMMENT • link updated 21.3 years ago by Wolfgang Huber ★ 13k • written 21.3 years ago by Deng, Shibing ▴ 10

0

Entering edit mode

Hinnerk Boriss ▴ 220

@hinnerk-boriss-111

Last seen 11.5 years ago

Dear Shibing, I do not recommend using house keeping genes for normalization. In several experiments they turn out being differentially expressed. A better approach would be to use a normalization method that searches for an invariant set of genes in the sample. "VSN" and Li & Wong's "invariant set" do that. The methods have limits though regarding the minimum proportion of not differentially expressed genes. Below 30% things become difficult. Another aspect you should be aware of is that a bias in the treatment effect, i.e. treatment causes either mostly up- or down-regulation of genes, will distort your normalization. VSN is most robust against this bias. Just an idea for you chip design: make a list of all genes that you think could react to the planned treatment for 70-80% of your probe sets, then take a random sample from all the remaining genes (of which you have no prior evidence for differential expression) to design the remaining 20-30% of the chip. This should get you a way out your normalization problem typical for custom chips. In fact, you could restrict the invariant set algorithm to search only in the random selection of genes. Cheers, Hinnerk -----Original Message----- From: bioconductor-bounces@stat.math.ethz.ch [mailto:bioconductor-bounces@stat.math.ethz.ch] On Behalf Of Deng, Shibing Sent: Thursday, November 11, 2004 10:08 PM To: 'bioconductor@stat.math.ethz.ch' Subject: [BioC] normalization for custom chip Hi, We are designing a custom Affymetrix chip with about 1700 genes. By design, a large number of genes on the chip will be differentially expressed between our treatment and control samples. The assumption for quantile normalization and other distribution-based normalization methods will not hold for these chips. To normalize them, we plan to put some "house-keeping" genes or "invariant" genes covering a wide range of intensities on the chip. We are not sure how many house-keeping genes we should have to get a good normalization? I will appreciate your input on this issue. Shibing LEGAL NOTICE\ Unless expressly stated otherwise, this messag...{{dropped}} _______________________________________________ Bioconductor mailing list Bioconductor@stat.math.ethz.ch https://stat.ethz.ch/mailman/listinfo/bioconductor

ADD COMMENT • link 21.3 years ago Hinnerk Boriss ▴ 220

0

Entering edit mode

Hi Hinnerk, I am also tryying to normalize a custom chip. Could you please tell me what VSN stands for and where can I find that method for normalization and a reference for it? Thanks a lot Inma On Fri, 12 Nov 2004, Hinnerk Boriss wrote: >_Dear Shibing, >_ >_I do not recommend using house keeping genes for normalization. In several >_experiments they turn out being differentially expressed. A better approach >_would be to use a normalization method that searches for an invariant set of >_genes in the sample. "VSN" and Li & Wong's "invariant set" do that. The >_methods have limits though regarding the minimum proportion of not >_differentially expressed genes. Below 30% things become difficult. Another >_aspect you should be aware of is that a bias in the treatment effect, i.e. >_treatment causes either mostly up- or down-regulation of genes, will distort >_your normalization. VSN is most robust against this bias. >_ >_Just an idea for you chip design: make a list of all genes that you think >_could react to the planned treatment for 70-80% of your probe sets, then >_take a random sample from all the remaining genes (of which you have no >_prior evidence for differential expression) to design the remaining 20-30% >_of the chip. This should get you a way out your normalization problem >_typical for custom chips. In fact, you could restrict the invariant set >_algorithm to search only in the random selection of genes. >_ >_Cheers, >_Hinnerk >_ >_ >_-----Original Message----- >_From: bioconductor-bounces@stat.math.ethz.ch >_[mailto:bioconductor-bounces@stat.math.ethz.ch] On Behalf Of Deng, Shibing >_Sent: Thursday, November 11, 2004 10:08 PM >_To: 'bioconductor@stat.math.ethz.ch' >_Subject: [BioC] normalization for custom chip >_ >_Hi, >_We are designing a custom Affymetrix chip with about 1700 genes. By design, >_a large number of genes on the chip will be differentially expressed between >_our treatment and control samples. The assumption for quantile normalization >_and other distribution-based normalization methods will not hold for these >_chips. To normalize them, we plan to put some "house-keeping" genes or >_"invariant" genes covering a wide range of intensities on the chip. We are >_not sure how many house-keeping genes we should have to get a good >_normalization? I will appreciate your input on this issue. >_ >_Shibing >_ >_ >_LEGAL NOTICE\ Unless expressly stated otherwise, this messag...{{dropped}} >_ >________________________________________________ >_Bioconductor mailing list >_Bioconductor@stat.math.ethz.ch >_https://stat.ethz.ch/mailman/listinfo/bioconductor >_ >________________________________________________ >_Bioconductor mailing list >_Bioconductor@stat.math.ethz.ch >_https://stat.ethz.ch/mailman/listinfo/bioconductor >_ --

ADD REPLY • link 21.3 years ago M Inmaculada Barrasa ▴ 50

0

Entering edit mode

Hi Inma, you can check the list of released packages on the Bioconductor webpage, and have a look at the package called vsn. In there, you'll also find a vignette and some literature references. library(reposTools) install.packages2("vsn", develOK=TRUE) library("vsn") openVignette("vsn") Best wishes Wolfgang ------------------------------------- Wolfgang Huber European Bioinformatics Institute European Molecular Biology Laboratory Wellcome Trust Genome Campus Cambridge CB10 1SD England Phone: +44 1223 494642 Http: www.dkfz.de/abt0840/whuber ------------------------------------- M Inmaculada Barrasa wrote: > Hi Hinnerk, > > I am also tryying to normalize a custom chip. > Could you please tell me what VSN stands for and where can I find that > method for normalization and a reference for it? > > Thanks a lot > > Inma > > > > > > On Fri, 12 Nov 2004, Hinnerk Boriss wrote: > > >>_Dear Shibing, >>_ >>_I do not recommend using house keeping genes for normalization. In several >>_experiments they turn out being differentially expressed. A better approach >>_would be to use a normalization method that searches for an invariant set of >>_genes in the sample. "VSN" and Li & Wong's "invariant set" do that. The >>_methods have limits though regarding the minimum proportion of not >>_differentially expressed genes. Below 30% things become difficult. Another >>_aspect you should be aware of is that a bias in the treatment effect, i.e. >>_treatment causes either mostly up- or down-regulation of genes, will distort >>_your normalization. VSN is most robust against this bias. >>_ >>_Just an idea for you chip design: make a list of all genes that you think >>_could react to the planned treatment for 70-80% of your probe sets, then >>_take a random sample from all the remaining genes (of which you have no >>_prior evidence for differential expression) to design the remaining 20-30% >>_of the chip. This should get you a way out your normalization problem >>_typical for custom chips. In fact, you could restrict the invariant set >>_algorithm to search only in the random selection of genes. >>_ >>_Cheers, >>_Hinnerk >>_ >>_ >>_-----Original Message----- >>_From: bioconductor-bounces@stat.math.ethz.ch >>_[mailto:bioconductor-bounces@stat.math.ethz.ch] On Behalf Of Deng, Shibing >>_Sent: Thursday, November 11, 2004 10:08 PM >>_To: 'bioconductor@stat.math.ethz.ch' >>_Subject: [BioC] normalization for custom chip >>_ >>_Hi, >>_We are designing a custom Affymetrix chip with about 1700 genes. By design, >>_a large number of genes on the chip will be differentially expressed between >>_our treatment and control samples. The assumption for quantile normalization >>_and other distribution-based normalization methods will not hold for these >>_chips. To normalize them, we plan to put some "house-keeping" genes or >>_"invariant" genes covering a wide range of intensities on the chip. We are >>_not sure how many house-keeping genes we should have to get a good >>_normalization? I will appreciate your input on this issue. >>_ >>_Shibing

ADD REPLY • link 21.3 years ago Wolfgang Huber ★ 13k

0

Entering edit mode

Wolfgang Huber ★ 13k

@wolfgang-huber-3550

Last seen 4 months ago

EMBL European Molecular Biology Laborat…

M Inmaculada Barrasa wrote: > Hi Wolfram, > I read this in a previous e-mail on this thread. > >> I do not recommend using house keeping genes for normalization. In >> several experiments they turn out being differentially expressed. >> A better approach would be to use a normalization method that searches >> for an invariant set of genes in the sample. "VSN" and Li & Wong's >> "invariant set" do that. The methods have limits though regarding >> the minimum proportion of not differentially expressed genes. >> Below 30% things become difficult. .... > > > Can you please give more details on how would you apply VSN to search for > invariant set of genes that can be later used to normalize the array. > > Sorry if I am making so sense. > I will read your paper carefully > > Inma Hi Inma, vsn does not directly search for an "invariant set". Rather, it regards the fitting of scale factors and background offsets as a parameter estimation problem and uses a standard robust estimation technique for this. The performance of this estimator, and in particular the sensitivity to total number and asymmetric proportions of up- and down-regulated genes, is discussed in (hopefully exhaustive) detail in the paper Parameter estimation for the calibration and variance stabilization of microarray data. W. Huber, A. von Heydebreck, H. S?ltmann, A. Poustka, M. Vingron. Statistical Applications in Genetics and Molecular Biology 2003 Vol. 2: No. 1, Article 3 http://www.bepress.com/sagmb/vol2/iss1/art3 in particular, Fig. 8 shows that up to about 30% of differentially expressed genes are OK that are all up or all down, and more, if they are more symmetric. -- Best wishes Wolfgang ------------------------------------- Wolfgang Huber European Bioinformatics Institute European Molecular Biology Laboratory Wellcome Trust Genome Campus Cambridge CB10 1SD England Phone: +44 1223 494642 Http: www.dkfz.de/abt0840/whuber

ADD COMMENT • link 21.3 years ago Wolfgang Huber ★ 13k

Login before adding your answer.