Question

Annotating the results with associated gene symbols

0

Entering edit mode

sss1991.f • 0

@sss1991f-6940

Last seen 9.5 years ago

United States

hi

i am new to R.I am using a windows base system. I am following the tut by Daniel Swan. After writing the following command > gene.symbols <- getSYMBOL(probeset.list$ID, "hgu133plus2") get this eror:

Error in unlist(lookUp(x, data, "SYMBOL")) : 
  error in evaluating the argument 'x' in selecting a method for function
'unlist': Error in lookUp(x, data, "SYMBOL") : No keys provided

microarray limma • 1.7k views

ADD COMMENT • link updated 8.9 years ago by Gordon Smyth 50k • written 9.5 years ago by sss1991.f • 0

score 2 · Answer 1 · 2014-10-28

You have to ensure that probeset.list$ID actually contains some data. The error indicates that it does not. In addition, you should probably use the more modern select() function. Note that you need to also ensure that you are passing in character values, rather than factors.

gns <- select(hgu133plus2.db, as.character(probeset.list$ID), "SYMBOL")

score 1 · Answer 2 · 2015-03-31

1

Entering edit mode

arontommi ▴ 10

@arontommi-7540

Last seen 9.1 years ago

Sweden

I know this is late but if someone is googling, this will come up so here goes :
you dont have a probeset.list$ID

if you look at our dataframe you can see that there is no ID column

to fix this you use rownames as the ID column like his :
probeset.list$ID = rownames(probeset.list)

now it should run

ADD COMMENT • link 9.1 years ago arontommi ▴ 10

score 0 · Answer 3 · 2015-05-13

This question apparently relates to Daniel Swan's 2011 tutorial about using the simpleaffy and limma packages to analyse Affymetrix microarray data:

http://bioinformatics.knowledgeblog.org/2011/06/20/analysing-microarray-data-in-bioconductor/

Exactly the same question has been asked before on this site:

annotating the results with association gene symbol

The reason why Daniel Swan's code doesn't run exactly as is because the limma topTable now (in response to user requests) puts the probe IDs into the rownames of the output table rather than into a column called "ID". You can easily see this for yourself just by looking at the top table. So you just have to use rownames(probe.list) instead of probe.list$ID.

Alternatively you could follow the case study (Section 17.2) in the limma User's Guide, which is somewhat better because it puts the gene symbols in the data object right at the beginning of the analysis pipeline instead of the adding them later after the top table has been produced.