Uniprot and UniProt.ws
1
0
Entering edit mode
Juliet Hannah ▴ 360
@juliet-hannah-4531
Last seen 4.9 years ago
United States

I would like to find the protein sequence for human APOB.

I was not able to query by "APOB" so I first looked up the Entrez ID, 338.

libraryUniProt.ws)
human <- UniProt.ws(9606)

keys <- c("338")
columns <- c("SEQUENCE","UNIPROTKB","PDB")
kt <- "ENTREZ_GENE"
res <- select(human, keys, columns, kt)

This returns the main protein, but also two variants. One variant is labeled "Apolipoprotein B variant". How can I add this label so that I am able to determine the one I want.

Also, can I search "ABOB" directly rather than going through the Entrez ID? Thanks!

James MacDonald's suggestion worked for this part.

keys <- c("APOB")
columns <- c("SEQUENCE","UNIPROTKB","PDB","GENECARDS")
kt <- "GENECARDS"
res <- select(human, keys, columns, kt)
Uniprot Annotation Identifiers • 994 views
ADD COMMENT
2
Entering edit mode
@james-w-macdonald-5106
Last seen 1 hour ago
United States

I think you can use the GENECARDS keytype:

> select(ws, "APOB", c("SEQUENCE","UNIPROTKB","PDB"), "GENECARDS")
Getting mapping data for APOB ... and ACC
Getting mapping data for P04114 ... and PDB_ID
Getting extra data for P04114
'select()' returned 1:1 mapping between keys and columns
  GENECARDS
1      APOB
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             SEQUENCE
1 MDP<snip>KLAPGELTIIL
  UNIPROTKB  PDB
1    P04114 <NA>
> 
ADD COMMENT
0
Entering edit mode

Thanks James! "GENECARDS" did indeed link the name. Do you have any suggestions about adding the variant information I describe above?

ADD REPLY
0
Entering edit mode
> z <- select(ws, "338", c("PROTEIN-NAMES","UNIPROTKB","PDB"), "ENTREZ_GENE")
Getting mapping data for 338 ... and ACC
Getting mapping data for P04114 ... and PDB_ID
Getting extra data for P04114, Q59HB3, Q7Z7Q0
'select()' returned 1:many mapping between keys and columns
> z
  ENTREZ_GENE
1         338
2         338
3         338
                                                                    PROTEIN-NAMES
1 Apolipoprotein B-100 (Apo B-100) [Cleaved into: Apolipoprotein B-48 (Apo B-48)]
2                                             Apolipoprotein B variant (Fragment)
3                                                                    APOB protein
  UNIPROTKB  PDB
1    P04114 <NA>
2    Q59HB3 <NA>
3    Q7Z7Q0 <NA>

ADD REPLY

Login before adding your answer.

Traffic: 772 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6