Question

Annotating the Human Gene 2.0 ST Array annotate with hugene20sttranscriptcluster.db hugene20sthsentrezg or hugene20stprobeset

1

Entering edit mode

sofia.figueiredo ▴ 10

@sofiafigueiredo-11041

Last seen 7.9 years ago

Dear Bioconductor community,

I am working with Human Gene 2.0 ST Array. At the annotation process, I have doubts which .db library to use:

- hugene20sttranscriptcluster.db

- hugene20sthsentrezg.db or

- hugene20stprobeset.db?

Can you please indicate what are the differences between these three and in which situations should I use one or the other .db library?

Many thanks in advance

Sofia

---

Ana Sofia Figueiredo

Bioinformatician, University of Heidelberg

affymetrix microarrays hugene20sttranscriptcluster.db • 2.4k views

ADD COMMENT • link updated 8.0 years ago by James W. MacDonald 66k • written 8.0 years ago by sofia.figueiredo ▴ 10

0

Entering edit mode

you may try the pd.hugene.2.0.st which provide all the annotations for the platform, I found it easier as a non expert. But for the explication, I will suggest you look at the reference manual of each of theses package, it tell exactly what they do.

ADD REPLY • link 8.0 years ago giroudpaul ▴ 40

score 2 · Answer 1 · 2016-07-13

2

Entering edit mode

James W. MacDonald 66k

@james-w-macdonald-5106

Last seen 3 hours ago

United States

The hugene20sttranscriptcluster.db package is for when you have summarized the data at the 'core' level, which is by transcript. The hugene20stprobeset.db package is for when you have summarized at the 'probeset' level, which is roughly the exon level. The hugene20sthsentrezg.db package is for when you summarized using the MBNI re-mapped probesets.

In the vast majority of cases you should use the hugene20sttranscriptcluster.db package. Also, while the pd.hugene.2.0.st package does contain annotation data, those data are simply parsed versions of the Affymetrix annotation csv files, and while giroudpaul may find those easier to deal with, I think that is probably a minority opinion.

ADD COMMENT • link 8.0 years ago James W. MacDonald 66k

0

Entering edit mode

Many thanks for your clarifications. I normalized my data again, adding the option target="core" to the rma function and did the annotations with hugene20sttranscriptcluster.db.

Everything is working fine :)

Sofia

ADD REPLY • link 8.0 years ago sofia.figueiredo ▴ 10