4 main questions about making complicated model matrix and fitting the model
0
0
Entering edit mode
@gordon-smyth
Last seen 53 minutes ago
WEHI, Melbourne, Australia
Dear Mingkwan, I'll make some short responses below. I think though that your design is getting too complex to give help by email like this. This is the sort of data problem for which it is usually beneficial to establish a collaborative relationship with a statistical bioinformatics specialist, at your own institution or nearby, rather than trying to do it all yourself, unless you are a statistician yourself. > Date: Tue, 7 Dec 2010 19:05:45 +0100 > From: Mingkwan Nipitwattanaphon <mingkwan.nipitwattanaphon at="" unil.ch=""> > To: bioconductor at r-project.org > Subject: [BioC] 4 main questions about making complicated model matrix > and fitting the model > Message-ID: <f3479687-f88d-41f2-b00e-e18295c5fd7b at="" unil.ch=""> > Content-Type: text/plain > > Dear BioC users, > > I am using 2 color-spotted microarrays. My samples are queen ants > with different genotypes (BB=D, Bb=H, bb=R), social forms (Monogyne, > Polygyne) and ages (young virgin queen=2d, mature virgin queen=11d, > and mated/reproductive/ mother queen=mom). They are hybridized to the > reference RNA. > > I would like to analyze my data by making model matrix like > y~ Age + Genotype *nested within* Social form + Age : Genotype *nested > within* Social form + fixed factor (Batch) > > I have tried to analyze my data as show below and I got many questions. > > 1. Differences between different commands of model.matrix. From the > examples below, I know that ~0 or ~1 is just whether I want the > intercept or not but when there is an intercept, one of the groups is > disappeared from the model (in this case, batchI or M11dD is > disappeared depending on which factor comes first). Would this affect > the result? Is the order of the factors in the model important? Yes, it does affect the result. Yes, it is important. You shouldn't normally be trying to remove the intercept here. Doing so doesn't change the model that is fitted, so the total number of coefficients doesn't change, but it does change the parametrization. To see how this works, try experimenting on your own with a oneway anova type data set, to get familiar with how model.matrix works. > 2. Although I have 3 different genotypes, 2 social forms and 3 time > points, some combinations are biologically impossible, e.g. M11dH, > M2dH, MmomH, M11dR, M2dR, MmomR, PmomD, PmomR. When I use the command > below, limma gives me all of the combinations but only some do exist. > So, the ones with do not exist, have only 0 value in all rows. These > groups will result in NA values for all the spots after the model fit. > Is it OK to have NA values or it is better to remove these groups > from the model matrix before fitting the model? I don't know what your data looks like, but this suggests to me that the linear model formula is not appropriate for your data. It will therefore be difficult to interpret the resulting coefficients. You probably need a different approach. > 3. Problem with model fit > After fitting the model. I got NA values for all spots of all the > groups that do not exist. This could be normal, but the group "P11dR" > do exist. Why did I also get NAs from this group? > > isGene <- MA$genes$Status == "cDNA" > > fitDesign1 <- lmFit(MA[isGene,], design1) > Coefficients not estimable: M11dH M2dH MmomH M11dR M2dR MmomR PmomD > P11dR P2dR PmomR Your linear model contains more terms than can be estimated from your data. You probably need a different approach. Best to show your data to a statistician, who can work with you, to consider how your data should be analysed. > 4. After fitting the model, do I need to make contrasts between > interesting comparisons? Although all slides are compared to reference, > with this complicated linear model, I am not sure that if I do not make > contrast, the limma will give me the coefficient of each group compared > to reference or something else. The fact that not everything can be estimated makes the interpretation of your model difficult. Best wishes Gordon > I truly appreciate all the answers in advance! > > best regards, > > Mingkwan Nipitwattanaphon ______________________________________________________________________ The information in this email is confidential and intend...{{dropped:4}}
limma limma • 929 views
ADD COMMENT

Login before adding your answer.

Traffic: 695 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6