Deconvolution with CIBERSORT: input data
1
1
Entering edit mode
Matan G. ▴ 60
@matan-g-22483
Last seen 2.7 years ago

Hi,

I'd like to perform a deconvolution on my bulk RNA seq data. I'm testing CIBERSORT as an option.

  1. Is the VST transformation on raw counts appropriate for use as CIBERSORT input? It is said that the data used as input should not be log-space normalized

  2. If VST is not appropriate how can I actually normalize the data appropriately?

  3. Since it is declared by CIBERSORT it seems that it is more appropriate for Chip data rather than RNAseq. Does anyone know an validated alternative (maybe a good R package)

Best

normalization r vst cibersort • 7.9k views
ADD COMMENT
0
Entering edit mode
Kevin Blighe ★ 3.9k
@kevin
Last seen 7 days ago
Republic of Ireland

It seems that CIBERSORT prefers the input from RNA-seq to be normalised and follow a Poisson / Negative Binomial. So, please use the 'normalised' counts, i.e., not transformed via logCPM(), rlog(), or vst()

Regarding your point #3, I am not sure that it is true (?) - can you share the resource where you have read this? CIBERSORT has been around a long time and was initially developed on cDNA microarray data.

Kevin

ADD COMMENT
0
Entering edit mode

Hi, thank you for answering.

Are you familiar with an R package that can help with normalization methods as the one you suggested?

Regarding my comment on CIBERSORT RNA seq deconvolution, it is taken from the FAQs section in the official website. I'm attaching both a screenshot and a ref links.

Do you recommend CIBERSORT for RNA seq bulk data? Do recommend to try any other source for this application?

Best

Refs: https://ibb.co/tczTQ57 https://cibersort.stanford.edu/faq.php see "Can I use LM22 with data from other platforms, such as RNA-Seq? "

ADD REPLY
0
Entering edit mode

Thanks, I can now [slightly] better interpret what they mean. They indicate that, if you use RNA-seq data, you should disable quantile normalisation in their options; however, it is not clear if they expect you to use RNA-seq normalised counts or normalised+transformed (vst, rlog, logCPM) expression levels. For full clarity, it may be better to contact the CIBERSORT group.

Fundamentally, though, keep in mind that CIBERSORT was designed on Affymetrix and Illumina microarray data, leading to the next point: Methods to predict cell-types in bulk RNA-seq data are still coming out. I believe there are a few post on Biostars about this.

Note also that this forum is for problems related to R / Bioconductor packages.

ADD REPLY
1
Entering edit mode

Hi Kevin,

Thank you for your answer. It has helped me.

I'm aware of the fact that this forum is less suitable thank you for pointing it out.

All the best

ADD REPLY

Login before adding your answer.

Traffic: 798 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6