Heatmap of the Count Matrix
1
0
Entering edit mode
@b39b3713
Last seen 3 months ago
United Kingdom

When following your workflow in https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#standard-workflow, why do you use rowMeans to display the Heatmap of count matrix? Can I use data from PCA to select the top 20 genes? In my case, I have 6,770 genes. If I choose only the top 20 genes, I assume it does not accurately represent the genes, so I want to use PCA loadings. Does it make sense?

library("pheatmap")
select <- order(rowMeans(counts(dds,normalized=TRUE)),
                decreasing=TRUE)[1:20]
df <- as.data.frame(colData(dds)[,c("condition","type")])
pheatmap(assay(ntd)[select,], cluster_rows=FALSE, show_rownames=FALSE,
         cluster_cols=FALSE, annotation_col=df)

sessionInfo( )
rnaseqGene • 2.1k views
ADD COMMENT
0
Entering edit mode

Perhaps PCA loadings offer a more comprehensive perspective. Now, thinking back, I recall struggling with a similar data reduction dilemma in my student days, trying to represent complex network traffic patterns using only a few key metrics. It felt like navigating a tricky Slither io level, trying to capture the essence without getting swallowed by the details.

ADD REPLY
1
Entering edit mode
ATpoint ★ 4.9k
@atpoint-13662
Last seen 9 hours ago
Germany

This section of the vignette is merely a very basic data exploration and by no means set into stone. The rowMeans is just an ordering function here, do whatever you feel is appropriate to explore your data properly.

ADD COMMENT

Login before adding your answer.

Traffic: 691 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6