Hi there,
When trying to see tumor vs normal diferences in a subgroup fashion regarding the tumors but using all normals (tumor1 vs all normals, tumor2 vs all normals...etc) I am getting this error:
> vdata <- voom(all, design, plot=TRUE)
Coefficients not estimable: tumorSCC_4
Warning message:
Partial NA coefficients for 395 probe(s)
Design matrix looks like this
> design %>% head
normal tumor normalADC_2 normalADC_3 normalADC_4 normalLCC_1 normalLCC_2 normalSCC_1 normalSCC_2 normalSCC_3 normalSCC_4
T1 0 1 0 0 0 0 0 0 0 0 0
T2 0 1 0 0 0 0 0 0 0 0 0
T3 0 1 0 0 0 0 0 0 0 0 0
T4 0 1 0 0 0 0 0 0 0 0 0
T5 0 1 0 0 0 0 0 0 0 0 0
T6 0 1 0 0 0 0 0 0 0 0 0
tumorADC_1 tumorADC_2 tumorADC_3 tumorADC_4 tumorLCC_1 tumorLCC_2 tumorSCC_1 tumorSCC_2 tumorSCC_3 tumorSCC_4
T1 0 1 0 0 0 0 0 0 0 0
T2 0 0 1 0 0 0 0 0 0 0
T3 1 0 0 0 0 0 0 0 0 0
T4 1 0 0 0 0 0 0 0 0 0
T5 0 1 0 0 0 0 0 0 0 0
T6 0 0 1 0 0 0 0 0 0 0
So, for this specific analysis, I would be only interested in the columns "normal" and all tumors (tumorADC1 tumorADC2 tumorADC3 tumorADC4 tumorLCC1 tumorLCC2 tumorSCC1 tumorSCC2 tumorSCC3 tumorSCC4).
Any thoughts?
Thank you
Hi Gordon,
Sorry for being a little short on info in my previous answer. I'll try to make it clearer by answering those:
BUT, we also would like to check the effect of NORMAL tissues in any possible form:
And, in this particular case I was asking now:
Each of the tumor subgroups at a time compared to all normal samples and not only against the corresponding (by cluster) normal samples (as we know all normal samples behave more or less similarly between them by looking at a PCA plot).
Whereas the metadata looks like:
What im doing wrong in this example?
Additionally, do you think doing a paired analysis makes sense when actually the correlation between tumor and normal is very low?:
PD: All data come from a expression panel of only 395 genes, not the typical RNA-Seq.
The
type
variable is redundant because you have already coded the difference between tumor and normal into thegroup
variable. So you need to removetype
from the model.But then how do I specifically contrast each of the ADC tumors (4 groups) to normal, found in columnd type? I can't find the way of doing that.
I still can't tell how many groups you have or what you want to compare. What does
normalADC_2
mean for example?Anyway, just use contrasts in the usual way. It's straightforward to compare one group to the average of several others, or the average or normals to the average of tumors, or anything you need really.
Each number at the end of every column name in the design matrix means the cluster number of each sample for every tumor.
With your reply I just realized I was assigning a cluster also to normal samples (eg: "normalADC_2"), when actually every normal sample should remain as "normal".
Now I am able to run every comparison without the "Coefficients not estimable" warning.
Thank you!