Hello,
Sorry for asking a lot of questions about clusterProfiler. I try to fully understand the results and graphs I want to use.
When using dotplot() on the result of enrichGO() and enrichDO(), I always observe that the most significant terms are the one with the biggest number of genes and the highest gene ratio. Whereas, we can imagine to get a very significant term with very high gene ratio but small (at least not the biggest) number of genes...
I observed that also on all the examples I saw on the vignette and on the web.
What is the rational behind this? This way, the small gene sets can never appear significant...
Jane
gene ratio is k/n and gene count is k, they are indeed positive related. https://bioconductor.org/packages/devel/bioc/vignettes/DOSE/inst/doc/enrichmentAnalysis.html#over-representation-analysis