Question

Heatmap of rnaseq data with z-score scale

0

Entering edit mode

giovanna ▴ 10

@a090cf12

Last seen 5 months ago

Italy

I have a doubt/question regarding the heatmap visualization of gene expression data obtained with bulk RNA-seq technology from different datasets, with z-score row scaling. By using the same list of genes, when the heatmap generated by using only samples from the same datasets heatmap highlights difference in the gene expression between patients vs controls (Figure1) but when the matrix include also samples from different datasets differences between patients and controls seem to disappear, while it seems to be opposite expression trends between samples from different datasets (Figure2). can you give me some suggestions on how to solve this problem?

heatmaps RNASeq • 1.3k views

ADD COMMENT • link updated 5 months ago by Paul Harrison ▴ 110 • written 5 months ago by giovanna ▴ 10

0

Entering edit mode

ADD REPLY • link 5 months ago giovanna ▴ 10

0

Entering edit mode

There can be considerable differences between datasets just due to different handling, sequencing method, batch effects, etc etc, and this might be dominating the differential expression you are seeing within your original dataset.

For a heatmap, you could consider trying to subtract out the dataset effect, for example with limma::removeBatchEffect.

I'm not sure if you have both conditions in both datasets? If not, there might not be much value in combining the datasets.

ADD REPLY • link 5 months ago Paul Harrison ▴ 110

score 0 · Answer 1 · 2025-05-21

0

Entering edit mode

James W. MacDonald 68k

@james-w-macdonald-5106

Last seen 1 day ago

United States

Without including the row names for the heatmap, it's impossible to know if the directionality is switched or not. Do note that the ordering of the rows will vary depending on what samples you include. In addition, the colors will vary depending on the number of samples as well, as the z-score is based on the mean and variance of all the samples.

Also, this question is off-topic for this site, as it has nothing to do with any particular Bioconductor packages. In future please direct general questions like this to biostars.org instead.

ADD COMMENT • link 5 months ago James W. MacDonald 68k

1

Entering edit mode

thank you for your reply. I'm sorry if the question is off topic. the analysis and relative construction of the heatmap was done using bioconductor packages (deseq2, heatmap.2), for this reason I inserted the discussion here. in any case here I have reported only one example but I have noticed that this observation is maintained with a notable number of different gene lists.

ADD REPLY • link 5 months ago giovanna ▴ 10

1

Entering edit mode

heatmap.2 is a CRAN package, and that's the relevant issue for this plot. It's got nothing to do with DESeq2 or any other Bioconductor packages.

ADD REPLY • link 5 months ago James W. MacDonald 68k