-After i downloaded the TCGA raw gene counts. I created a DESeq object using Design=~1
-To visualize the normalized effect I ran this code lon <- log2(counts(dds, normalized=TRUE) + 1)
-I also ran this code r<-rlog(dds)
clustering both matrices results in different outcome.
Any insight???
"Zero inflated". This means, within a biological condition, observing counts like: {0,0,0,1000,0,2000,0,0,3000}. This is not a good match for the negative binomial, but is better modeled by a combination of two distributions: one component with a spike at 0 and another component for the large counts.