Question

Accounting for breadth of coverage while using DESeq2

0

Entering edit mode

Peter • 0

@2770a49e

Last seen 13 months ago

United States

Hello,

I'm trying to incorporate breadth of coverage into my metatranscriptomic analysis. For each gene I have the # of mapped reads and the percentage of bps that are covered (breadth of coverage). In the example gene set:

gene counts breadth 
A.      10.      0.8
B.      20.      0.1
C.      15.      1.0

Normally just the counts would go into DESeq, but I'm wondering if it could work to adjust these counts based on breadth of coverage. In this example gene B has many counts, but it is only covering a small portion of that gene—likely due to sequencing bias or ambiguous mapping. I was thinking about scaling the count values by breadth. It would just be breadth_adjusted = counts * breadth, which would essential give the average number of counts per bp in a gene. So in the above example:

gene counts breadth   breadth_adjusted
A.      10.      0.8.          8
B.      20.      0.1.          2
C.      15.      1.0.         15

I'm completely aware that DESeq is meant for non-transformed count values. That said, could this breadth adjusted value be used with DESeq? I think this could be a good way of accounting for breadth of coverage prior to normalization. I thought about setting all genes with low breadth coverage to zero, but this would influence both the scaling as well as over-inflate expression of genes.

Thanks Pete

DESeq2 • 473 views

ADD COMMENT • link updated 13 months ago by Michael Love 41k • written 13 months ago by Peter • 0

score 1 · Answer 1 · 2023-03-14

1

Entering edit mode

Michael Love 41k

@mikelove

Last seen 8 hours ago

United States

You can't transform the counts and provide to DESeq2.

I would try RUV or sva, and then look to see if the factors that are estimated correlate with breadth of coverage for certain genes.

ADD COMMENT • link 13 months ago Michael Love 41k