Hi All,
I have been using Deseq2 to analyze a dataset I have and ran into a problem I am not sure how to solve.
I have been using the following code to run deseq2 on my dataset:
dds <-DESeqDataSetFromMatrix(countData = ep,colData = cp,design = ~Risk)
dds <- estimateSizeFactors(dds)
rld <- rlog(dds)
plotPCA(rld, intgroup="Risk")
vsd <- varianceStabilizingTransformation(dds)
plotPCA(vsd, intgroup="Risk")
The two PCA plots I got look completely different, so I am not sure which transformation I should rely on for further analysis.
Any help?
Thanks
Thanks I have 2 groups that I want to compare in my dataset (of rna seq data) - one group contains 6 samples the other group contains 100 samples.
When I run Deseq2 I get more than 1000 DE genes. But for some reason when I plot pca using vsd and again using rlog I see different separation of the groups.
Interestingly, when I narrowed my analysis to 6 vs 6 the plots do look similar.
Is it a known problem comparing highly unequal number of groups?
Thanks!
Try blind=FALSE. This is recommended in the vignette when there are many large differences
Thank you. I tried to use it with the top 30 genes and it didn't work. I was wondering if the highly unequal size of the two compared groups bias the pca and the clustering, because when I narrow down to equal size of groups I do see clear separation (with both vsd and rlog).
I would really like to upload the plots but I don't know to which URL I should upload it.