Question

Regarding scRNA seq vs Bulk RNA seq

0

Entering edit mode

naman.sep • 0

@namansep-13879

Last seen 6.7 years ago

Hi

I have data from scRNA- seq and Bulk RNA seq data from same cell/tissues. I want to know why there are certain transcripts which show up in single cells are not shown in Bulk RNA seq data. Is it something normal? I would assume that all transcripts in single cell should be represented in bulk data, while the same will not be true for bulk data because you much more transcripts detected in bulk data. But it was weird for me to see single cell RNA seq data had few transcripts which never showed up in bulk RNA seq. Ofcourse i would believe the difference between single cell data with different transcripts number detected but why do we have this variation in single vs bulk data. IS this a technical noise that we should ignore.

Regards

single cell rnaseq • 7.5k views

ADD COMMENT • link updated 6.7 years ago by galib36 ▴ 10 • written 6.7 years ago by naman.sep • 0

score 0 · Answer 1 · 2017-09-01

Because they involve different protocols? Bulk RNA-seq tends to use (ribosome-depleted) total RNA protocols nowadays, while most single-cell RNA-seq uses poly-A'd approaches. I can imagine that this would result in different biases and preferences for particular transcripts.

There are also considerations with cell dissociation and size. For example, if a tissue contains some fragile cell types, these would lyse and not show up in the single-cell data. In comparison, the fragile cell types would still be present in the bulk data where no dissociation is required, only lysis of the entire tissue. The resulting bulk-only transcripts would compete with and suppress the coverage of transcripts unique to other cell types, resulting in counts that are only observed in single-cell data. A similar effect occurs with large and small cells in the same bulk population, where transcripts unique to small cells get suppressed in bulk data.

Finally, there is always sampling noise, which means that transcripts for lowly expressed genes may be sampled in a few cells on a plate but not in the bulk sample. You would have to have equal total sequencing depth between the bulk sample and all single-cell samples for the counts to be fully comparable. Obviously you will miss transcripts if the bulk sample is not sequenced to the same depth.

P.S. This question is more suited for a general forum like SEQAnswers, it doesn't seem to involve any Bioconductor packages.

score 0 · Answer 2 · 2017-09-01

0

Entering edit mode

galib36 ▴ 10

@galib36-9138

Last seen 6.3 years ago

United Kingdom

Are these lower expressed genes? It could be that as there are fixed number of reads that are shared across the mRNAs, the highly expressed genes are taking the majority share of the reads and so the lowly expressed genes are not getting any read.

ADD COMMENT • link 6.7 years ago galib36 ▴ 10

0

Entering edit mode

Thank you galib for the reply. But how do we explain that if the genes which are low expressed are present in single cells and not in bulk data. My problem is that there are few genes which show up in single cells but not in the bulk data. But i understood your point. Logically genes from single cells should all be present in the bulk(50 cells in my case) data- right?

ADD REPLY • link 6.7 years ago naman.sep • 0

0

Entering edit mode

The last para of Aaron Lun's modified reply answers the question. In Bulk population cells were dominating with some very high expressed genes causing all the reads to go for those genes. This would not be dominated in single-cells because some of the cells would have those highly expressed genes moderately expressed causing the lowly expressed genes to share some reads and thus causing those genes to show up. One way to check is to look at the expression distribution or variance in the highly expressed and low expressed genes.

ADD REPLY • link 6.7 years ago galib36 ▴ 10