Question

Complete chromosome losses and polyploidies

0

Entering edit mode

mikko.arvas • 0

@mikkoarvas-9106

Last seen 7.8 years ago

European Union

Dear cn.mops crew,

I am trying cn.mops on a set of yeast genomes. Some issues:

sometimes some strain lacks completely a certain chromosome, hence reads counts will be zero for that chromosome. Of course I can deal with this with preprocessing, but it would be very nice if cn.MOPS would not give an error but rather do something smarter
I have a guess based on lab work on ploidy level of the samples. They vary from 4n to 2n. It would be very nice if this information could be used in the CNV counting instead of just giving a single ploidy level for all the samples.

Would you have recommendations on settings to use with this kind of samples? Especially the normalisation step is prone to fail with the whole chromosome losses I am seeing.

cn.mops polyploidy • 1.6k views

ADD COMMENT • link 9.1 years ago mikko.arvas • 0

score 0 · Answer 1 · 2015-11-04

Hello Mikko,

Thanks for your interest in my work and for using cn.MOPS! I apologise for the difficulties you are having. cn.MOPS was actually developed for germline CNVs of diploid organisms (WGS), but because of its high performance people started using it for other types of data - it worked really well for Exome Sequencing data, for tumor and for haploid genomes. However, these additional features have been added later to the package and are still non-automatic. I would like to give the cn.MOPS package a larger revision as soon as possible.

The function "normalizeChromosomes" has an argument "ploidy", in which you can put your ploidy guesses. Please note that - after this step - the parameter "norm" of cn.MOPS has to be set to "norm=0". We have another normalization function that takes into account different ploidy, and large CNVs. I will email this function to you as soon as possible.

Regards,
Günter

score 0 · Answer 2 · 2015-11-06

0

Entering edit mode

mikko.arvas • 0

@mikkoarvas-9106

Last seen 7.8 years ago

European Union

Thanks Günter, I look forward to testing your function!

ADD COMMENT • link 9.1 years ago mikko.arvas • 0

0

Entering edit mode

Hello Mikko,

I have put these normalization functions that one of my students developed on http://www.bioinf.jku.at/software/cnmops/ (see "Additional normalization functions"). These functions take into account different ploidys or large CNVs for normalization. I hope the documentation in the code is sufficient. Otherwise please contact us again!

Regards,

Günter

ADD REPLY • link 9.1 years ago Günter Klambauer ▴ 540

0

Entering edit mode

Hi Günter,

I am also examining individuals which contain whole-chromosome dosage polymorphisms. Many of these individuals are also of varying sequencing coverage. I am not having success using "normalizeChromosomes" (assigning their relative ploidy estimates), since a few individuals which I know to have extra or fewer chromosomes are not showing the expected increased/decreased log values for the respective chromosomes (via segPlot).

Would it be best to use the normalizeTumor functions in this case?

If so, I'm a little confused as to how to set it up. The first function, toGR, takes as an input a data frame of chromosomes and their copy numbers, correct? I see there isn't a "sample" column, so does this function need to be performed for each sample.bam file? Should this data frame include the estimated copy number of EVERY chromosome in each sample? If so, how am I supposed to know the estimated copy number, isn't that supposed to be the output of cn.mops? Alternatively, Would I need to just provide a few chromosomes I suspect to be 2N to act as a baseline?

Thanks,

Mike

ADD REPLY • link 7.5 years ago mmats010 • 0

0

Entering edit mode

Hi Mike,

before you can use normalizeTumor() it is necessary to estimate the copy number for large segments and the purity of your data. One way to get this estimates would be to use PyLOH: https://github.com/uci-cbcl/PyLOH or my improved version: https://github.com/patrick-praher/PyLOH_Opt

Based on the CN estimates normalizeTumor takes the segments with an estimated CN of 2 as basis for the normalization. Unfortunately http://www.bioinf.jku.at/software/cnmops/code/normalizeTumorSample.R does not show that the variable cnvs_CCL is read from PyLOH results file. If you want to I can provide you with a function to read the PyLOH results into R.

Based on the purity estimates the fold change vector is corrected, before cn.mops can be applied as shown in normalizeTumorSample.R

I hope this short explanation was helpful. Please feel free to ask if you need anything.

Cheers,

ADD REPLY • link 7.5 years ago patrick.praher • 0