Hey,
for my GO enrichment analyses I am using the topGO package. For my last analysis I discovered a discrepancy between the output from topGO for GO terms and the amount of actually annotated genes for these GO terms. For instance, topGO reports for the term 'defense response' 36 annotated genes of which 7 are significantly regulated, but if I search for these 36 genes I can only find 10 genes in my original input table where my gene IDs are connected to the GO-IDs. Has someone any idea on what this discrepancy might be based? I am thankful for any clue! Thank you in advance.
Kind regards.
Could it be that some other genes are annotated to a GO term that implies this "defense response" term? I don't recall now if in this case the number of reported genes would be the ones directly annotated to a term or all that are linked to it.