Question

Re: multiple comparisons in limma [was: Now I get it(almost)]

0

Entering edit mode

Gordon Smyth 50k

@gordon-smyth

Last seen 1 hour ago

WEHI, Melbourne, Australia

At 05:48 AM 7/08/2004, Richard Friedman wrote: >Gordon, > > I now believe that I understand your answer. In order to do > adjust for both multiple comparisons and >multiple tests I use classifyTestsF() with method="fdr". If I understand >the documentation correctly, the fdr part >refers to the multiple test correction across genes on top of the multiple >comparison adjustment across levels performed > if no method were to be specified. > > Do I have it straight? No, not quite. classifyTestsF() doesn't actually have a 'method' option, so you can't use it with method="fdr". Please understand that making multiple adjustments simutaneously across genes and across contrasts (comparisons between levels for example) is still an unsolved problem, so it may be too much to expect neat solutions from limma or concise explanations from me. One way that you could proceed is to compute an F-test p-value for each gene, testing for any differences betwen the levels of your factor, and use p.adjust() to apply "fdr" adjustment to those p-values across genes. Then for those genes which are chosen as showing some differences, you can examine the individual contrast p-values more closely to decide which levels are different. Notice that the F-test p-value is computed automatically by eBayes() and is stored as fit$F.p.value in your MArrayLM linear model data object. Here is some example code which applies "fdr" adjustment in a hierarchical fashion, first at the overall gene level, and then at the contrast level after deciding on the gene cutoff: fit <- eBayes(fit) selectedgenes <- p.adjust(fit$F.p.value, method="fdr") < 0.05 pmax <- pmin(fit$F.p.value[!selectedgenes], na.rm=TRUE) results <- classifyTestsP(fit, p.value=pmax, method="fdr") Gordon >Thanks and best wishes, >Rich >On Aug 6, 2004, at 9:57 AM, Gordon Smyth wrote: > >>At 11:36 PM 6/08/2004, Richard Friedman wrote: >>>Gordon, >>> >>> Thank you for answering my questions. The last equation in your >>> paper makes intuitive sense to me. >>> >>> I'm wondering if you can take the time to answer two more >>> questions: >>> >>>1. Say I have the following case: >>> >>>Level A (3 replicates) >>>Level B (2 replicates) >>>Level C(1 replicate) >>>Level D(1 replicate) >>> >>>Can I legitimately calculate a P value for the contrast Level A to level >>>C in the linear model even though I have only one replicate on Level C. >>>I am not talking about just Limma here. I am talking about the linear >>>model in general. >> >>Given assumption of common variance across levels, yes. >> >>> Also, >>>I realize that one replicate is poor experimental design. This is what I >>>was given to analyze. >>> >>>2. If I wished to apply a multiple test correction to the pvalues from >>>non-orthogonal contrasts, would the following procedure be legitimate:: >>> >>> 1. Generate a pvalue for each contrast in the set of >>> nonothogonal contrasts for each gene using classifyTestsF(). >>> 2. Correct the pvalues using a multiple test correction such as >>> FDR. >> >>Nothing special about this design. All usual things, e.g. in limma, apply. >> >>Gordon >> >>>I realize that no multiple-test correction is entirely satisfactory, I >>>just want to get an approximate estimate of the p-values for each >>>contrast as a guide to further experimentation and literature searching. >>> >>>Best wishes, >>>Rich

limma limma • 1.5k views

ADD COMMENT • link updated 19.7 years ago by Richard Friedman ★ 2.0k • written 19.7 years ago by Gordon Smyth 50k

score 0 · Answer 1 · 2004-08-09

Gordon, Thank you very much. Although the problem is not neatly solved I would still appreciate your advice in the matter. Do you also suggest applying the FDR correction to the contrast p-values? Or should I use the uncorrected contrast p-values? Best wishes, Rich On Aug 7, 2004, at 2:51 AM, Gordon Smyth wrote: > At 05:48 AM 7/08/2004, Richard Friedman wrote: >> Gordon, >> >> I now believe that I understand your answer. In order to do >> adjust for both multiple comparisons and >> multiple tests I use classifyTestsF() with method="fdr". If I >> understand the documentation correctly, the fdr part >> refers to the multiple test correction across genes on top of the >> multiple comparison adjustment across levels performed >> if no method were to be specified. >> >> Do I have it straight? > > No, not quite. classifyTestsF() doesn't actually have a 'method' > option, so you can't use it with method="fdr". > > Please understand that making multiple adjustments simutaneously > across genes and across contrasts (comparisons between levels for > example) is still an unsolved problem, so it may be too much to expect > neat solutions from limma or concise explanations from me. > > One way that you could proceed is to compute an F-test p-value for > each gene, testing for any differences betwen the levels of your > factor, and use p.adjust() to apply "fdr" adjustment to those p-values > across genes. Then for those genes which are chosen as showing some > differences, you can examine the individual contrast p-values more > closely to decide which levels are different. > > Notice that the F-test p-value is computed automatically by eBayes() > and is stored as fit$F.p.value in your MArrayLM linear model data > object. > > Here is some example code which applies "fdr" adjustment in a > hierarchical fashion, first at the overall gene level, and then at the > contrast level after deciding on the gene cutoff: > > fit <- eBayes(fit) > selectedgenes <- p.adjust(fit$F.p.value, method="fdr") < 0.05 > pmax <- pmin(fit$F.p.value[!selectedgenes], na.rm=TRUE) > results <- classifyTestsP(fit, p.value=pmax, method="fdr") > > Gordon > >> Thanks and best wishes, >> Rich >> On Aug 6, 2004, at 9:57 AM, Gordon Smyth wrote: >> >>> At 11:36 PM 6/08/2004, Richard Friedman wrote: >>>> Gordon, >>>> >>>> Thank you for answering my questions. The last equation in >>>> your paper makes intuitive sense to me. >>>> >>>> I'm wondering if you can take the time to answer two more >>>> questions: >>>> >>>> 1. Say I have the following case: >>>> >>>> Level A (3 replicates) >>>> Level B (2 replicates) >>>> Level C(1 replicate) >>>> Level D(1 replicate) >>>> >>>> Can I legitimately calculate a P value for the contrast Level A to >>>> level C in the linear model even though I have only one replicate >>>> on Level C. I am not talking about just Limma here. I am talking >>>> about the linear model in general. >>> >>> Given assumption of common variance across levels, yes. >>> >>>> Also, >>>> I realize that one replicate is poor experimental design. This is >>>> what I was given to analyze. >>>> >>>> 2. If I wished to apply a multiple test correction to the pvalues >>>> from non-orthogonal contrasts, would the following procedure be >>>> legitimate:: >>>> >>>> 1. Generate a pvalue for each contrast in the set of >>>> nonothogonal contrasts for each gene using classifyTestsF(). >>>> 2. Correct the pvalues using a multiple test correction >>>> such as FDR. >>> >>> Nothing special about this design. All usual things, e.g. in limma, >>> apply. >>> >>> Gordon >>> >>>> I realize that no multiple-test correction is entirely >>>> satisfactory, I just want to get an approximate estimate of the >>>> p-values for each contrast as a guide to further experimentation >>>> and literature searching. >>>> >>>> Best wishes, >>>> Rich > > ------------------------------------------------------------ Richard A. Friedman, PhD Associate Research Scientist Herbert Irving Comprehensive Cancer Center Oncoinformatics Core Lecturer Department of Biomedical Informatics Box 95, Room 130BB or P&S 1-420C Columbia University Medical Center 630 W. 168th St. New York, NY 10032 (212)305-6901 (5-6901) (voice) friedman@cancercenter.columbia.edu http://cancercenter.columbia.edu/~friedman/ In Memoriam, Francis Crick