GO.db package GOBPPARENTS
2
1
Entering edit mode
1frcn ▴ 20
@1frcn-6501
Last seen 9.5 years ago
Germany

Hi all,

I've got a question concerning the GO.db package, in particular about the annotation of GO identifiers to their parents. I read the pdf that describes the functions of GOMFPARENTS, GOCCPARENTS and GOBPPARENTS and was wondering if there is a little mistake: At the GOBPPARENTS description it says "This data set describes associations between GO molecular function (BP) terms and [...]". Shouldn't it be "This data set describes associations between GO biological process (BP) terms and [...]". The same mistake(?) is at the description of GOCCPARENTS.

Then, there is another question:
I looked up the node "GO:0016301" at the gene ontology consortium website. There is a graph that shows, that this node has two parents: node number "GO:0016772" and number "GO:0016310". Now, if I do in R:
 

library(GO.db)
x<-as.list(GOMFPARENTS)
xx<-as.list(GOCCPARENTS)
xxx<-as.list(GOBPPARENTS)
x['GO:0016301']
#$`GO:0016301`
#       is_a
#"GO:0016772"
xx['GO:0016301']
#$<NA>
#NULL
xxx['GO:0016301']
#$<NA>
#NULL


I can't find the parent-node number "GO:0016310", which has -following AmiGO - a "part_of"- relation to the node "GO:0016301". And in the details of GO_BP/MF/CC_PARENTS it says, that "[t]he name associated with the, parent term will be either isa, hasa or partof". Hence, I thought the term "GO:0016310" should be in the data. I could imagine that it is a data base problem. Maybe GO.db isn't as up-to-date as AmiGO. Do you think this actually is the reason or is there another problem?

Thanks in advance,
1frcn

 

 version
#
#platform       x86_64-w64-mingw32         
#arch           x86_64                     
#os             mingw32                    
#system         x86_64, mingw32            
#status                                    
#major          3                          
#minor          1.1                        
#year           2014                       
#month          07                         
#day            10                         
#svn rev        66115                      
#language       R                          
#version.string R version 3.1.1 (2014-07-10)
go annotation • 1.7k views
ADD COMMENT
0
Entering edit mode
@laurent-gatto-5645
Last seen 1 day ago
Belgium

I don't think that GO.db captures information across name spaces, such as the part_of that you describe or occurs_in from Annotation Extension for GO terms post. I think it would definitely be interesting to have such functionality in Bioc.

ADD COMMENT
0
Entering edit mode
Marc Carlson ★ 7.2k
@marc-carlson-2264
Last seen 7.7 years ago
United States

Hi,

So thank you for mentioning the typo in the documentation.  I will make an effort to tidy those up. 

It turns out that the missing term you mentioned ("GO:0016310") is not actually missing from the GO.db database.  You can test that idea out directly by using select like this:

library(GO.db)
select(GO.db, 'GO:0016310', columns = c('TERM','ONTOLOGY'), keytype='GOID')

So the reason it doesn't show up in the GOMFPARENTS mapping is because that mapping is an 'MF' mapping.  Whereas the "GO:0016310" term you were expecting to see as a parent is actually a 'BP' term. 

What is probably confusing you is the image from Amigo (drawn below).  And the reason it's confusing is that they are drawing all the different GO terms interlaced together as a single graph (when in fact GO is actually three separate ontologies).  If you look closely at the image you will notice that at the top you can see the roots of all three major ontologies represented...  In contrast, the MFPARENTS mapping that you were searching in can only get you parents from the 'MF' ontology. 

 

 

 

http://amigo.geneontology.org/visualize?mode=amigo&term_data_type=string&inline=false&format=png&term_data=GO%3A0016301

ADD COMMENT
0
Entering edit mode

Hi Marc,

thank you for your answer. I'm really glad that it was just a typo in the documentation - this would have confused me a lot...

To the second part of my question: In my code above I actually looked up the "GO:0016310" term in all three PARENT data bases. So I was confused why it didn't appear at all and especially not in the GOBPPARENT data base. I'm sorry that my question was that imprecise...

 

ADD REPLY

Login before adding your answer.

Traffic: 666 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6