Question

How to overcome negative values obtained from vst for downstream analysis ( calculating bray-curtis and PCoA plot)?

0

Entering edit mode

zcbthug • 0

@zcbthug-22428

Last seen 4.4 years ago

My data set is a large gene count table which contains 500,000 genes as rows and 6 columns as samples ( 1:3 are control and 4:6 are disease). Whilst some of my samples have a few 0 values, no column contains all 0 values.

I am trying to run a PCoA plot following Bray-Curtis dissimilarity calculation.

Here is my code; gctab <- read.csv("final.gene.count.table.nonzero.csv", row.names=1)

DF = data.frame(id=colnames(gctab),type=rep(c("ctrl","disease"),each=3)) dds = DESeqDataSetFromMatrix(gctab,DF,~type)

vsd <- vst(dds, blind=TRUE) vegDistOut=vegdist(t(assay(vsd)),"bray")

vegDistOut=vegdist(t(assay(vsd) + min(assay(vsd))),"bray") ### this does not work either.

I cannot proceed with making the PCoA, because the error message I get is: In vegdist(t(assay(vsd)), "bray") : results may be meaningless because data have negative entries in method “bray”.

I am not too sure how to overcome this error. Please could anybody advise?

Edit; vegdist is part of the Vegan package in R. The problem is arising because the vst is producing negative values, and bray curtis dissimilarity can only be calculated with positive values. Not too sure whether anybody would suggest using the zinbwave package for preprocessing rather than vst?

Thank-you

zinbwave deseq2 vegan • 2.2k views

ADD COMMENT • link updated 4.4 years ago by James W. MacDonald 65k • written 4.4 years ago by zcbthug • 0

0

Entering edit mode

It's not clear what package vegdist comes from, but you should add that as a tag so the maintainer will get an email. I don't think it's a function in DESeq2, so Mike Love is the only one who is getting an email.

ADD REPLY • link 4.4 years ago James W. MacDonald 65k

score 0 · Answer 1 · 2019-11-25

0

Entering edit mode

James W. MacDonald 65k

@james-w-macdonald-5106

Last seen 23 hours ago

United States

The vegan package isn't part of Bioconductor, so your question is probably more appropriate for R-help or maybe biostars.org. That said, the vegan package is in general intended to be used for ecological counts, so why are you converting your count data using vst?

ADD COMMENT • link 4.4 years ago James W. MacDonald 65k