I'll leave others to answer number 1, but I'll poke my head in for question 2.
Consider the cause of the zero inflation in single-cell RNA-seq data. It's due to a combination of low input RNA, low capture efficiency and strong amplification of captured transcripts to obtain enough cDNA for sequencing. This means that successful capture (and subsequent amplification) of a small number of transcripts in a few cells results in a separate distribution of large counts along with lots of zero counts corresponding to cells in which the transcripts failed to be captured. Indeed, if you get rid of the amplification effects with UMIs, you find that the zero inflation is greatly attenuated - possibly still present due to biological heterogeneity, but that's another story.
In bulk RNA-seq, these considerations are not particularly relevant as you have high input quantities of RNA. This results in high-complexity libraries, reducing the chance of sequencing multiple amplicons of the same original cDNA molecule. Bulk populations also have more stable average expression profiles than single cells, so there's less chance of getting one replicate with zero and the others with large non-zero counts. Obviously, you will always get some zeros when your mean is close to zero - this is already handled by count-based models with no need for an extra zero inflation term.
Any zero inflation would manifest as large dispersion estimates in negative binomial models. I haven't seen that in bulk RNA-seq data - or specifically, I haven't seen that in a way that is caused by an excess of zeros (as large dispersions tend to be caused by a spread of non-zero counts); or when I have seen it, it's usually caused by something biological, e.g., I forgot to block on sex and Xist is now "zero-inflated". Clearly, the better solution in the latter case would be to block on that factor, or find hidden factors... with RUVseq.
Thanks for replying. One way to rephrase 2) is that when the number of observations is low (typical for bulk RNA) then it's hard to say whether zeros are due to zero inflation or just to NB/Poisson distribution having a low mean. That's one good reason why you haven't seen compelling evidence in favor of zero-inflated models. I believe if it were possible to increase the sample size in bulk RNA to the same level as in scRNA, zero-inflated models would quickly gain popularity in bulk RNA studies.
I'm not sure that sample size has much to do with this. Low amount of RNA and amplification bias seems more relevant. The first scRNA-seq dataset that I worked with had 10 cells and there was no doubt a large amount of zero inflation.
Anyway, re: zero-inflation and bulk RNA-seq, this is an interesting paper:
https://academic.oup.com/biostatistics/article/14/1/113/250560/Bayesian-analysis-of-RNA-sequencing-data-by?keytype=ref&ijkey=GDoEiRTJTP8Ed3o
What tool did you use to measure zero inflation?
We did a few goodness-of-fit plot and it seemed that the negative binomial model was underestimating the number of zeros.
It must be very hard to get compelling evidence in favor of any particular distribution with just 10 observations. In zero inflated case the only scenario I can think of is having 9 zeros out of 10, with the 10th observation being very large.
Well it's not quite like 10 observations since you have data for ~10,000 genes and you can take advantage of the mean-variance relation expected from the negative binomial distribution to look at goodness-of-fit. Anyway, the data are public so you can play around with the data yourself: https://www.ncbi.nlm.nih.gov/pubmed/24299736
It has a nice pool/split experiment that can be used to tell apart biological and technical variation.
Would a zero-inflated model fit better? Maybe, maybe not. Would it quickly gain popularity? This discussion is somewhat academic, but I doubt it. Current experimental designs for RNA-seq with low (3-5) numbers of replicates are very cost-effective for their intended purpose; to screen for interesting candidate genes for further functional studies. I would have a tough time convincing collaborators to generate 20-50 replicates for a bulk RNA-seq experiment, just so I could model zero inflation. (This wouldn't just be 20 separate library preps; it would be 20 separate cell cultures/mice/treatments/etc., which is probably the most expensive part of the process nowadays.) Scientifically speaking, the money is better spent elsewhere.