vennDiagram Statistics Advice

0

Entering edit mode

Noah Dowell ▴ 410

@noah-dowell-3791

Last seen 9.6 years ago

Dear All, I have used the excellent limma package to analyze my 2-color Agilent Yeast microarray data and have determined the differentially expressed genes (as compared to wild-type expression) in four independent experiments. I am showing the summary of my results below: > results <- decideTests(fit2, method="global") > summary(results) # mutant1 mutant2 mutant3 deletionstrain #-1 147 126 40 252 # 0 5924 6033 6171 5600 # 1 185 97 45 404 The three mutant experiments represent expression data from cells expressing different point mutants in the gene that is deleted in the expression strain. The point mutation in Mutant3 is a control that should not affect the protein's function given our current knowledge of how the protein works therefore the relatively small number of genes differentially expressed as compared to wild-type is consistent with our using that strain as a "control." Mutants1 and 2 are similar in their biological defects. They are also hypomorphic alleles as compared to the complete deletion strain so the smaller number of differentially expressed genes is again consistent with our working model. I have created vennDiagrams of these four experiments to look at the overlap of differential expression between experiments. My question is if there is a test I can run on the vennDiagrams (or simply on the overlapping gene lists) to show that there is significantly more overlap between mutant 1 or mutant 2 and the deletion strain when compared to mutant 3? Thank you for your time and input! Noah

Microarray limma Microarray limma • 1.1k views

ADD COMMENT • link updated 14.0 years ago by Wolfgang Huber ★ 13k • written 14.0 years ago by Noah Dowell ▴ 410

0

Entering edit mode

Wolfgang Huber ★ 13k

@wolfgang-huber-3550

Last seen 18 days ago

EMBL European Molecular Biology Laborat…

Dear Noah a first, immediate answer is to look at ? fisher.test Some light reading on its background can be found here: http://en.wikipedia.org/wiki/Fisher%27s_exact_test and see e.g. Agresti's excellent book 'Categorial Data Analysis' for something more substantial. The second, more conceptual and perhaps more useful answer is to use heatmaps and clustering of the expression profiles to show the desired similarity relationships between the 4 expression profiles. Simply using Euclidean distance, say on the subset of genes that is the union of all differentially expressed genes, on the fold changes should be a good start. Best wishes Wolfgang Noah Dowell scripsit 27/04/10 00:37: > Dear All, > > I have used the excellent limma package to analyze my 2-color Agilent Yeast microarray data and have determined the differentially expressed genes (as compared to wild-type expression) in four independent experiments. I am showing the summary of my results below: > >> results <- decideTests(fit2, method="global") > >> summary(results) > > # mutant1 mutant2 mutant3 deletionstrain > #-1 147 126 40 252 > # 0 5924 6033 6171 5600 > # 1 185 97 45 404 > > > The three mutant experiments represent expression data from cells expressing different point mutants in the gene that is deleted in the expression strain. The point mutation in Mutant3 is a control that should not affect the protein's function given our current knowledge of how the protein works therefore the relatively small number of genes differentially expressed as compared to wild-type is consistent with our using that strain as a "control." > > Mutants1 and 2 are similar in their biological defects. They are also hypomorphic alleles as compared to the complete deletion strain so the smaller number of differentially expressed genes is again consistent with our working model. > > I have created vennDiagrams of these four experiments to look at the overlap of differential expression between experiments. > > My question is if there is a test I can run on the vennDiagrams (or simply on the overlapping gene lists) to show that there is significantly more overlap between mutant 1 or mutant 2 and the deletion strain when compared to mutant 3? > > Thank you for your time and input! > > Noah > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Wolfgang Huber EMBL http://www.embl.de/research/units/genome_biology/huber

ADD COMMENT • link 14.0 years ago Wolfgang Huber ★ 13k

0

Entering edit mode

Thank you Wolfgang!! The Fisher test looks like it will work nicely. I will try to get my hands on the Agresti book to make sure I am using it correctly. I will take another stab at a heatmap approach on specific subsets. My initial attempts at clustering and heatmap presentation were less than convincing. My statistician friends suggested it was due to less than robust data and my biologist friends suggested that it was inherent to looking at subtle (single point mutants versus complete knock outs) biological effects. Hopefully we can all meet somewhere in the middle:) Best, noah On Apr 27, 2010, at 4:56 AM, Wolfgang Huber wrote: > > Dear Noah > > a first, immediate answer is to look at > ? fisher.test > > Some light reading on its background can be found here: http://en.wikipedia.org/wiki/Fisher%27s_exact_test > and see e.g. Agresti's excellent book 'Categorial Data Analysis' for something more substantial. > > The second, more conceptual and perhaps more useful answer is to use heatmaps and clustering of the expression profiles to show the desired similarity relationships between the 4 expression profiles. > > Simply using Euclidean distance, say on the subset of genes that is the union of all differentially expressed genes, on the fold changes should be a good start. > > Best wishes > Wolfgang > > > Noah Dowell scripsit 27/04/10 00:37: >> Dear All, >> I have used the excellent limma package to analyze my 2-color Agilent Yeast microarray data and have determined the differentially expressed genes (as compared to wild-type expression) in four independent experiments. I am showing the summary of my results below: >>> results <- decideTests(fit2, method="global") >>> summary(results) >> # mutant1 mutant2 mutant3 deletionstrain >> #-1 147 126 40 252 >> # 0 5924 6033 6171 5600 >> # 1 185 97 45 404 >> The three mutant experiments represent expression data from cells expressing different point mutants in the gene that is deleted in the expression strain. The point mutation in Mutant3 is a control that should not affect the protein's function given our current knowledge of how the protein works therefore the relatively small number of genes differentially expressed as compared to wild-type is consistent with our using that strain as a "control." >> Mutants1 and 2 are similar in their biological defects. They are also hypomorphic alleles as compared to the complete deletion strain so the smaller number of differentially expressed genes is again consistent with our working model. >> I have created vennDiagrams of these four experiments to look at the overlap of differential expression between experiments. >> My question is if there is a test I can run on the vennDiagrams (or simply on the overlapping gene lists) to show that there is significantly more overlap between mutant 1 or mutant 2 and the deletion strain when compared to mutant 3? >> Thank you for your time and input! >> Noah >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at stat.math.ethz.ch >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor > > > -- > > > Wolfgang Huber > EMBL > http://www.embl.de/research/units/genome_biology/huber > >

ADD REPLY • link 14.0 years ago Noah Dowell ▴ 410

Login before adding your answer.