Question

topGO - Two-tailed (depletion/enrichment) analysis?

0

Entering edit mode

nk ▴ 10

@nk-7193

Last seen 7.0 years ago

United Kingdom

I would like to do a simple GO enrichment analysis with topGO. In theory I believe this should be possible with a simple "classic fisher" test.

However, I am not only interested in GO terms that are significantly enriched in significant genes, but also GO terms that are significantly depleted. While I could not find any details on this in the documentation, it seems that topGO only does a single-tailed test, so these don't seem to come up.

I did find a workaround by reversing the geneSelectionFun, essentially checking for an enrichment of certain categories in non-significant genes. However, I am not sure whether this doesn't violate some assumptions in the underlying statistics, so I would prefer to just do a proper two-tailed test.

Is this possible, and if so, how?

topgo go • 2.7k views

ADD COMMENT • link updated 10.1 years ago by Gordon Smyth 52k • written 10.1 years ago by nk ▴ 10

score 0 · Answer 1 · 2014-12-20

0

Entering edit mode

Gordon Smyth 52k

@gordon-smyth

Last seen 1 hour ago

WEHI, Melbourne, Australia

Surely it would make more sense to test for depletion and enrichment separately, rather than combined in a two tailed test.

Anyway, the GOstats package allows you to test for depletion as well as enrichment.

ADD COMMENT • link 10.1 years ago Gordon Smyth 52k

0

Entering edit mode

Why would it make more sense to test separately? Wouldn't two separate one-tailed tests actually overestimate significance, which we would have to correct for again? It definitely feels "statistically cleaner" to me to do a two-tailed test for significance first and then check the direction afterwards.

I'll have a look at GOstats, thanks. However, if there is a way to do this with topGO I'd be happy to hear about it, since I'd prefer to use this instead.

ADD REPLY • link 10.1 years ago nk ▴ 10

0

Entering edit mode

Fair enough, it would be logical to do a two-tailed test, provided that you check the direction for significant terms afterwards.

I was concerned that the two-tailed Fisher test gives no output other than a p-value and therefore doesn't distinguish between enrichment and depletion.

ADD REPLY • link 10.1 years ago Gordon Smyth 52k