(no subject)

0

Entering edit mode

abdul rawoof ▴ 60

@abdul-rawoof-5869

Last seen 5.4 years ago

United States

Hello, I want to do an intergrated network analysis for my Differential Expressed genes. So could anyone tell me how can I do. Which package should I use. Thanks, Abdul Rawoof [[alternative HTML version deleted]]

Network Network • 1.2k views

ADD COMMENT • link updated 11.0 years ago by Gordon Smyth 50k • written 11.0 years ago by abdul rawoof ▴ 60

0

Entering edit mode

Sam McInturf ▴ 300

@sam-mcinturf-5291

Last seen 8.6 years ago

United States

Abdul, Can you tell more about your project and what exactly you mean by integrated network analysis. What are you integrating into your DE genes? what organism are you working with? What are you trying to find in the data. Is the data RNA seq or micro array (and which platform). bioconductor has many packages that you can use to handle networks, but the `best` package depends on your goals / aims / resources. Best On Tue, May 7, 2013 at 5:57 AM, Abdul Rawoof <abdul87.edu@gmail.com> wrote: > Hello, > > I want to do an intergrated network analysis for my Differential Expressed > genes. So could anyone tell me how can I do. Which package should I use. > > Thanks, > Abdul Rawoof > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > -- Sam McInturf [[alternative HTML version deleted]]

ADD COMMENT • link 11.0 years ago Sam McInturf ▴ 300

0

Entering edit mode

Conversation went off the list: Dear Sam, I am working with Microarray Data analysis for illumina's WG-DASL assay based Human sample. Actually I want to know Gene-Gene interaction between my differentially expressed genes. Thanks, Abdul Rawoof ______________________________________________________________________ ___________ Abdul, When you say gene-gene interaction, what exactly do you mean? Are you looking for protein-protein binding data for your DE genes? Or are you after other genes that have been shown to (or may) be co-regulated? There are many ways to do analysis, and everything has pros and cons, and if you aren't careful your analysis will mean nothing because you applying the wrong method/models. In order to answer your question, we must know exactly what you want to do and find out. Integrated network analysis and gene-gene interaction can have several interpretations. Please, be verbose Best, On Tue, May 7, 2013 at 9:59 AM, Sam McInturf <smcinturf@gmail.com> wrote: > Abdul, > Can you tell more about your project and what exactly you mean > by integrated network analysis. What are you integrating into your DE > genes? what organism are you working with? What are you trying to find in > the data. Is the data RNA seq or micro array (and which platform). > > bioconductor has many packages that you can use to handle networks, but > the `best` package depends on your goals / aims / resources. > > Best > > > On Tue, May 7, 2013 at 5:57 AM, Abdul Rawoof <abdul87.edu@gmail.com>wrote: > >> Hello, >> >> I want to do an intergrated network analysis for my Differential Expressed >> genes. So could anyone tell me how can I do. Which package should I use. >> >> Thanks, >> Abdul Rawoof >> >> [[alternative HTML version deleted]] >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor@r-project.org >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: >> http://news.gmane.org/gmane.science.biology.informatics.conductor >> > > > > -- > Sam McInturf > -- Sam McInturf [[alternative HTML version deleted]]

ADD REPLY • link 11.0 years ago Sam McInturf ▴ 300

0

Entering edit mode

Gordon Smyth 50k

@gordon-smyth

Last seen 6 minutes ago

WEHI, Melbourne, Australia

Dear Hilary, Your test for the 3-way interaction is correct, although 3-way interactions are pretty hard to interpret. However testing for the 2-way interaction in the presence of a 3-way interaction does not make statistical sense. This is because the parametrization of the 2-way interaction as a subset of the 3-way is somewhat arbitrary. Before you can test the 2-way interaction species*treatment in a meaningful way you would need to accept that the 3-way interaction is not necessary and remove it from the model. In general, I am of the opinion that classical statistical factorial interation models do not usually provide the most meaningful parametrizations for genomic experiments. In most cases, I prefer to fit the saturated model (a different level for each treatment combination) and make specific contrasts. There is some discussion of this in the limma User's Guide. In your case, I guess that you might want to test for species*treatment interaction separately at each time point. It is almost impossible to do this within the classical 3-way factorial setup. However it is easy with the one-way approach I just mentioned, or else you could use: ~Age + Age:Species + Age:Treatment + Age:Species:Treatment Best wishes Gordon > Date: Thu, 9 May 2013 14:55:46 -0400 > From: Hilary Smith <hilary.a.smith.964 at="" nd.edu=""> > To: "bioconductor at r-project.org" <bioconductor at="" r-project.org=""> > Subject: [BioC] Statistics question for multi-factor interaction test > in edgeR > > Hi. I need to generate two GLM tests of a factorial design with RNA- Seq > count data. I have 3 factors with 2 levels apiece (2 species X 2 > treatments X 2 times), and 4 separate replicates each (i.e., we made a > total of 2*2*2*4 = 32 separate libraries). Our main interest is in the > interaction of species*treatment, as we think species A will alter gene > expression in the treatment stress vs. treatment benign, whereas > species B is expected to show little change. However, we?d like to also > do another test of species*treatment*time, because it is possible that > the ability of species A to alter gene expression in response to the > stress treatment may differ at the 1st versus 2nd time point. > > I think the way to set this up, is to create a design matrix as follows, > with the lrt test with coef 5 giving the differentially expressed genes > for the species*treatment test, and coef 8 giving the the differentially > expressed gene for the species*treatment*time test (after calling > topTags that is). Yet to ensure I have the statistics correct, my > questions are: (1) is this thinking correct, as I don?t see many 3x2 > factorial models to follow, and (2) do I need to set up a reference > somehow (which I assume would be the set of four samples with > TreatmentBenign*SpeciesB*Time2, but I?m not fully sure if that is > correct or needed). > > Many thanks in advance for your insight! > ~Hilary > >> designFF <- model.matrix(~Treatment*Species*Age) >> colnames(designFF) > [1] "(Intercept)" > [2] " TreatmentStress" > [3] "SpeciesA " > [4] "Time1" > [5] "TreatmentStress:SpeciesA" > [6] "TreatmentStress:Time1" > [7] "SpeciesA:Time1" > [8] "TreatmentStress:SpeciesA:Time1" > > And then to run tests with: >> fit <- glmFit(y, designFF) > >> lrtInteractionStressSpecies <- glmLRT(fitFF, coef=5) >> lrtInteractionStressSpeciesTime <- glmLRT(fitFF, coef=8) > ______________________________________________________________________ The information in this email is confidential and intend...{{dropped:5}}

ADD COMMENT • link 11.0 years ago Gordon Smyth 50k

0

Entering edit mode

Hi Gordon, In a previous email on the list you said: > testing for the 2-way interaction in the presence of a 3-way > interaction does not make statistical sense. This is because the > parametrization of the 2-way interaction as a subset of the 3-way is > somewhat arbitrary. Before you can test the 2-way interaction > species*treatment in a meaningful way you would need to accept that > the 3-way interaction is not necessary and remove it from the model. Does this mean that it is impossible to test for a 2-way interaction when your model includes a 3-way interaction term? Or does it just mean that the parametrization provided by "model.matrix(~1+factor1*factor2*factor3)" is such that the 2-way interaction is not represented by any coefficient, but rather by a complex contrast? > I prefer to fit the saturated model (a different level for each > treatment combination) and make specific contrasts. There is some > discussion of this in the limma User's Guide. If I understand correctly here, you are saying that one can fit a model where each coefficient represents the abundance for one specific combination of the 3 factors, as in the Limma User's Guide section 8.5.2 "Analysing as for a Single Factor". In other words, one could do "model.matrix(~0+factor1:factor2:factor3)" and this would be an alternate parametrization of the same design. And with this parametrization, all the 2- and 3-way interaction terms (and simple pairwise comparisons) can easily be tested from the single full 3-way interaction by specifying the appropriate contrasts. Do I understand correctly? -Ryan Thompson [[alternative HTML version deleted]]

ADD REPLY • link 11.0 years ago Ryan C. Thompson ★ 7.9k

0

Entering edit mode

On Sun, 12 May 2013, Ryan C. Thompson wrote: > Hi Gordon, > > In a previous email on the list you said: > >> testing for the 2-way interaction in the presence of a 3-way >> interaction does not make statistical sense. This is because the >> parametrization of the 2-way interaction as a subset of the 3-way is >> somewhat arbitrary. Before you can test the 2-way interaction >> species*treatment in a meaningful way you would need to accept that the >> 3-way interaction is not necessary and remove it from the model. > > Does this mean that it is impossible to test for a 2-way interaction > when your model includes a 3-way interaction term? It is mathematically possible but has no scientific meaning. This is called the marginality principle in linear models: http://en.wikipedia.org/wiki/Principle_of_marginality > Or does it just mean that the parametrization provided by > "model.matrix(~1+factor1*factor2*factor3)" is such that the 2-way > interaction is not represented by any coefficient, but rather by a > complex contrast? The same principle applies regardless of the parametrization. >> I prefer to fit the saturated model (a different level for each treatment >> combination) and make specific contrasts. There is some discussion of this >> in the limma User's Guide. > If I understand correctly here, you are saying that one can fit a model > where each coefficient represents the abundance for one specific > combination of the 3 factors, as in the Limma User's Guide section 8.5.2 > "Analysing as for a Single Factor". In other words, one could do > "model.matrix(~0+factor1:factor2:factor3)" and this would be an > alternate parametrization of the same design. And with this > parametrization, all the 2- and 3-way interaction terms (and simple > pairwise comparisons) can easily be tested from the single full 3-way > interaction by specifying the appropriate contrasts. Do I understand > correctly? Yes, I am recommending the group mean parametrization, as in the limma User's Guide Section 8.5.2 or edgeR User's Guide Section 3.3.1. I recommend this parametrization because each contrast that is drawn has an explicit meaning in terms of comparisons of groups and can be interpretted on its own terms. The original poster did what I intended. Best Gordon > -Ryan Thompson > ______________________________________________________________________ The information in this email is confidential and intend...{{dropped:4}}

ADD REPLY • link 11.0 years ago Gordon Smyth 50k

0

Entering edit mode

Dear Ryan, The marginality principle is most easily understood in a 2-way factorial model. Suppose you have a 2x2 factorial experiment with two genotypes and two treatments (active vs control). If a two-way interaction exists, then this means that the treatment effect is different for the genotypes. It makes no sense to test for a "treatment effect" in this situation (even though mathematical models allow you to do so) because there is no consistent treatment effect without specifying the genotype. On the other hand, it always meaningful to test for a treatment effect in the two genotypes separately, and then to ask whether the two treatment effects are consistent or different. In a 3-way factorial model, a 3-way interaction means that the experiment cannot be reduced to 2-way marginals in any meaningful way. Best wishes Gordon On Mon, 13 May 2013, Gordon K Smyth wrote: > On Sun, 12 May 2013, Ryan C. Thompson wrote: > >> Hi Gordon, >> >> In a previous email on the list you said: >> >>> testing for the 2-way interaction in the presence of a 3-way interaction >>> does not make statistical sense. This is because the parametrization of >>> the 2-way interaction as a subset of the 3-way is somewhat arbitrary. >>> Before you can test the 2-way interaction species*treatment in a >>> meaningful way you would need to accept that the 3-way interaction is not >>> necessary and remove it from the model. >> >> Does this mean that it is impossible to test for a 2-way interaction when >> your model includes a 3-way interaction term? > > It is mathematically possible but has no scientific meaning. This is called > the marginality principle in linear models: > > http://en.wikipedia.org/wiki/Principle_of_marginality > >> Or does it just mean that the parametrization provided by >> "model.matrix(~1+factor1*factor2*factor3)" is such that the 2-way >> interaction is not represented by any coefficient, but rather by a complex >> contrast? > > The same principle applies regardless of the parametrization. > >>> I prefer to fit the saturated model (a different level for each treatment >>> combination) and make specific contrasts. There is some discussion of this >>> in the limma User's Guide. > >> If I understand correctly here, you are saying that one can fit a model >> where each coefficient represents the abundance for one specific >> combination of the 3 factors, as in the Limma User's Guide section 8.5.2 >> "Analysing as for a Single Factor". In other words, one could do >> "model.matrix(~0+factor1:factor2:factor3)" and this would be an alternate >> parametrization of the same design. And with this parametrization, all the >> 2- and 3-way interaction terms (and simple pairwise comparisons) can easily >> be tested from the single full 3-way interaction by specifying the >> appropriate contrasts. Do I understand correctly? > > Yes, I am recommending the group mean parametrization, as in the limma User's > Guide Section 8.5.2 or edgeR User's Guide Section 3.3.1. > > I recommend this parametrization because each contrast that is drawn has an > explicit meaning in terms of comparisons of groups and can be interpretted on > its own terms. > > The original poster did what I intended. > > Best > Gordon > >> -Ryan Thompson >> > ______________________________________________________________________ The information in this email is confidential and intend...{{dropped:4}}

ADD REPLY • link 11.0 years ago Gordon Smyth 50k

0

Entering edit mode

Dear Gordon, I understand now. The 3-way experiment cannot be reduced to 2-way marginals, but one can still test for 2-way interactions within each level of the 3rd factor. That matches with my own previous understanding, and my confusion was just one of terminology. Thank you for the clarification! -Ryan On Sun May 12 18:29:20 2013, Gordon K Smyth wrote: > Dear Ryan, > > The marginality principle is most easily understood in a 2-way > factorial model. Suppose you have a 2x2 factorial experiment with two > genotypes and two treatments (active vs control). > > If a two-way interaction exists, then this means that the treatment > effect is different for the genotypes. It makes no sense to test for > a "treatment effect" in this situation (even though mathematical > models allow you to do so) because there is no consistent treatment > effect without specifying the genotype. > > On the other hand, it always meaningful to test for a treatment effect > in the two genotypes separately, and then to ask whether the two > treatment effects are consistent or different. > > In a 3-way factorial model, a 3-way interaction means that the > experiment cannot be reduced to 2-way marginals in any meaningful way. > > Best wishes > Gordon > > > On Mon, 13 May 2013, Gordon K Smyth wrote: > >> On Sun, 12 May 2013, Ryan C. Thompson wrote: >> >>> Hi Gordon, >>> >>> In a previous email on the list you said: >>> >>>> testing for the 2-way interaction in the presence of a 3-way >>>> interaction does not make statistical sense. This is because the >>>> parametrization of the 2-way interaction as a subset of the 3-way >>>> is somewhat arbitrary. Before you can test the 2-way interaction >>>> species*treatment in a meaningful way you would need to accept that >>>> the 3-way interaction is not necessary and remove it from the model. >>> >>> Does this mean that it is impossible to test for a 2-way interaction >>> when your model includes a 3-way interaction term? >> >> It is mathematically possible but has no scientific meaning. This is >> called the marginality principle in linear models: >> >> http://en.wikipedia.org/wiki/Principle_of_marginality >> >>> Or does it just mean that the parametrization provided by >>> "model.matrix(~1+factor1*factor2*factor3)" is such that the 2-way >>> interaction is not represented by any coefficient, but rather by a >>> complex contrast? >> >> The same principle applies regardless of the parametrization. >> >>>> I prefer to fit the saturated model (a different level for each >>>> treatment combination) and make specific contrasts. There is some >>>> discussion of this in the limma User's Guide. >> >>> If I understand correctly here, you are saying that one can fit a >>> model where each coefficient represents the abundance for one >>> specific combination of the 3 factors, as in the Limma User's Guide >>> section 8.5.2 "Analysing as for a Single Factor". In other words, >>> one could do "model.matrix(~0+factor1:factor2:factor3)" and this >>> would be an alternate parametrization of the same design. And with >>> this parametrization, all the 2- and 3-way interaction terms (and >>> simple pairwise comparisons) can easily be tested from the single >>> full 3-way interaction by specifying the appropriate contrasts. Do I >>> understand correctly? >> >> Yes, I am recommending the group mean parametrization, as in the >> limma User's Guide Section 8.5.2 or edgeR User's Guide Section 3.3.1. >> >> I recommend this parametrization because each contrast that is drawn >> has an explicit meaning in terms of comparisons of groups and can be >> interpretted on its own terms. >> >> The original poster did what I intended. >> >> Best >> Gordon >> >>> -Ryan Thompson >>> >> > > ______________________________________________________________________ > The information in this email is confidential and inte...{{dropped:6}}

ADD REPLY • link 11.0 years ago Ryan C. Thompson ★ 7.9k

Login before adding your answer.