Hi, I am confused about the calculation of the size factors when the gene has at least one zero count in the samples when using the default params. Though I have checked the source code of deseq2, I did not get the point. Because I found that if there is at least one gene without zero among all of the samples, the deseq2 runs smoothly. When I changed the sfType = "poscounts", which I think is proper for the data that have a lot of zero, the size factors are similar to the default param (sfType = "ratio", right?). In theory, the genes with at least one zero among samples, deseq2 will ignore them for the calculation of size factors.
So my ultimate question is, Can you be kind to explain why the size factors results of sfType = "poscounts" and the default param are similar when there are a lot of genes with at least one zero among samples?
Thanks for your quick reply. I understand and have noticed the code you mentioned above.
later I found that this is true for just some project. I think I know the reason, if there are only very little rows that have no zero among samples, the size factors should be biased.