Question

some values in assay are not integers

0

Entering edit mode

Rim Khlifa • 0

@rim-khlifa-17020

Last seen 5.7 years ago

Hi,

I used Tax4Fun for functional community profiling based on 16S rRNA data. I would like to go further in my analyses using DESeq2. However, I got the following error message: "Error in DESeqDataSet(se, design = design, ignoreRank) : some values in assay are not integers". I read the details on vignette("DESeq2"). But I still have two questions:

1) Is it possible to overcome this issue by transforming Tax4Fun data?

2) If the answer is yes, then, what transformation would be most appropriate?

Here are the code that I used, and an overview of my Tax4Fun data:

ddsMT<- DESeqDataSetFromMatrix(countData = t(data.dd2biom.tax4fun.corr.deseq),
                                colData = sam2,
                                design= ~ treatment_N + treatment_H + treatment_N:treatment_H)

> head(t(data.dd2biom.tax4fun.corr.deseq))

                                                                                     BER203   BER211

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001202 1.001175

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000021 1.000020

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000440 1.000513

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000032 1.000039

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000039 1.000030

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000043 1.000049

                                                                                     BER220   BER226

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001026 1.000934

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000024 1.000024

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000519 1.000343

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000033 1.000029

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000052 1.000094

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000058 1.000063

                                                                                     BER233   BER237

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001285 1.001052

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000011 1.000019

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000419 1.000393

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000020 1.000032

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000031 1.000053

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000028 1.000053

                                                                                     BER241   BER247

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001144 1.001156

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000023 1.000019

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000470 1.000469

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000036 1.000039

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000026 1.000039

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000051 1.000045

                                                                                     BER251   BER257

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001110 1.001266

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000017 1.000016

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000448 1.000466

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000033 1.000031

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000036 1.000038

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000056 1.000040

                                                                                     BER262   BER263

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001235 1.001239

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000018 1.000018

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000490 1.000487

K00004...R.R..butanediol.dehydrogenase...diacetyl.reductase..EC.1.1.1.4.1.1.1.303. 1.000042 1.000043

K00005..glycerol.dehydrogenase..EC.1.1.1.6.                                        1.000019 1.000064

K00007..D.arabinitol.4.dehydrogenase..EC.1.1.1.11.                                 1.000058 1.000032

                                                                                     SIM403   SIM413

K00001..alcohol.dehydrogenase..EC.1.1.1.1.                                         1.001376 1.001229

K00002..alcohol.dehydrogenase..NADP....EC.1.1.1.2.                                 1.000014 1.000019

K00003..homoserine.dehydrogenase..EC.1.1.1.3.                                      1.000455 1.000462

DESeq2 Tax4Fun • 3.5k views

ADD COMMENT • link updated 5.7 years ago by Michael Love 42k • written 5.7 years ago by Rim Khlifa • 0

score 0 · Answer 1 · 2018-08-22

0

Entering edit mode

Michael Love 42k

@mikelove

Last seen 15 hours ago

United States

What are the values you are trying to input to DESeq2? Note that DESeq2 is only designed for count data. E.g. the (i, j) element in the matrix is the count of observations of feature i in sample j.

ADD COMMENT • link 5.7 years ago Michael Love 42k

0

Entering edit mode

Thank you for your reply. The values that I'm trying to input to DESeq2 come from Tax4Fun output, which is a list of enzymes with abundances scores for each sample (= numbers with a decimal. So I think that DESeq2 is not designed for this kind of data, am I right?). They are as follows:

	BER203	BER211	BER220	BER226
K00001; alcohol dehydrogenase [EC:1.1.1.1]	0.00120189931368663	0.00117478838929879	0.00102598869027412	0.000934206125155835
K00002; alcohol dehydrogenase (NADP+) [EC:1.1.1.2]	2.06533288497377e-05	1.96294977525732e-05	2.38030569150464e-05	2.35263803557737e-05
K00003; homoserine dehydrogenase [EC:1.1.1.3]	0.000440183655887781	0.000512802322993243	0.000519348753567777	0.000343234823239996

ADD REPLY • link 5.7 years ago Rim Khlifa • 0

1

Entering edit mode

Yes, these do not look like appropriate input if they aren’t counts / observations.

ADD REPLY • link 5.7 years ago Michael Love 42k