Deconvolution with CIBERSORT: input data
1
1
Entering edit mode
Matan G. ▴ 40
@matan-g-22483
Last seen 4 weeks ago

Hi,

I'd like to perform a deconvolution on my bulk RNA seq data. I'm testing CIBERSORT as an option.

1. Is the VST transformation on raw counts appropriate for use as CIBERSORT input? It is said that the data used as input should not be log-space normalized

2. If VST is not appropriate how can I actually normalize the data appropriately?

3. Since it is declared by CIBERSORT it seems that it is more appropriate for Chip data rather than RNAseq. Does anyone know an validated alternative (maybe a good R package)

Best

normalization r vst cibersort • 1.4k views
0
Entering edit mode
@kevin
Last seen 11 hours ago
Republic of Ireland

It seems that CIBERSORT prefers the input from RNA-seq to be normalised and follow a Poisson / Negative Binomial. So, please use the 'normalised' counts, i.e., not transformed via logCPM(), rlog(), or vst()

Regarding your point #3, I am not sure that it is true (?) - can you share the resource where you have read this? CIBERSORT has been around a long time and was initially developed on cDNA microarray data.

Kevin

0
Entering edit mode

Are you familiar with an R package that can help with normalization methods as the one you suggested?

Regarding my comment on CIBERSORT RNA seq deconvolution, it is taken from the FAQs section in the official website. I'm attaching both a screenshot and a ref links.

Do you recommend CIBERSORT for RNA seq bulk data? Do recommend to try any other source for this application?

Best

Refs: https://ibb.co/tczTQ57 https://cibersort.stanford.edu/faq.php see "Can I use LM22 with data from other platforms, such as RNA-Seq? "

0
Entering edit mode

Thanks, I can now [slightly] better interpret what they mean. They indicate that, if you use RNA-seq data, you should disable quantile normalisation in their options; however, it is not clear if they expect you to use RNA-seq normalised counts or normalised+transformed (vst, rlog, logCPM) expression levels. For full clarity, it may be better to contact the CIBERSORT group.

Fundamentally, though, keep in mind that CIBERSORT was designed on Affymetrix and Illumina microarray data, leading to the next point: Methods to predict cell-types in bulk RNA-seq data are still coming out. I believe there are a few post on Biostars about this.

Note also that this forum is for problems related to R / Bioconductor packages.

1
Entering edit mode

Hi Kevin,

I'm aware of the fact that this forum is less suitable thank you for pointing it out.

All the best