Annotating the results with associated gene symbols
3
0
Entering edit mode
sss1991.f • 0
@sss1991f-6940
Last seen 9.5 years ago
United States

hi

i am new to R.I am using a windows base system. I am following the tut by Daniel Swan. After writing the following command > gene.symbols <- getSYMBOL(probeset.list$ID, "hgu133plus2") get this eror: 

Error in unlist(lookUp(x, data, "SYMBOL")) : 
  error in evaluating the argument 'x' in selecting a method for function
'unlist': Error in lookUp(x, data, "SYMBOL") : No keys provided
microarray limma • 1.7k views
ADD COMMENT
2
Entering edit mode
@james-w-macdonald-5106
Last seen 6 hours ago
United States

You have to ensure that probeset.list$ID actually contains some data. The error indicates that it does not. In addition, you should probably use the more modern select() function. Note that you need to also ensure that you are passing in character values, rather than factors.

gns <- select(hgu133plus2.db, as.character(probeset.list$ID), "SYMBOL")
ADD COMMENT
1
Entering edit mode
arontommi ▴ 10
@arontommi-7540
Last seen 9.1 years ago
Sweden

I know this is late but if someone is googling, this will come up so here goes :
you dont have a probeset.list$ID

if you look at our dataframe you can see that there is no ID column

to fix this you use rownames as the ID column like his :
probeset.list$ID = rownames(probeset.list)

now it should run

ADD COMMENT
0
Entering edit mode
@gordon-smyth
Last seen 17 minutes ago
WEHI, Melbourne, Australia

This question apparently relates to Daniel Swan's 2011 tutorial about using the simpleaffy and limma packages to analyse Affymetrix microarray data:

  http://bioinformatics.knowledgeblog.org/2011/06/20/analysing-microarray-data-in-bioconductor/

Exactly the same question has been asked before on this site:

  annotating the results with association gene symbol

The reason why Daniel Swan's code doesn't run exactly as is because the limma topTable now (in response to user requests) puts the probe IDs into the rownames of the output table rather than into a column called "ID". You can easily see this for yourself just by looking at the top table. So you just have to use rownames(probe.list) instead of probe.list$ID.

Alternatively you could follow the case study (Section 17.2) in the limma User's Guide, which is somewhat better because it puts the gene symbols in the data object right at the beginning of the analysis pipeline instead of the adding them later after the top table has been produced.

ADD COMMENT

Login before adding your answer.

Traffic: 560 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6