I have a question concerning SCAN normalization for the Affymetrix HTA 2.0 platform. When I normalize .CEL files from this platform using SCAN.UPC, everything works smoothly. However, the resulting exprs object contains data on the probeset/exon level (analogous to the Gene Expression Omnibus annotation file GPL17585), rather than on the gene/transcript level (as in GPL17586).
How could I use SCAN.UPC to generate transcript-level data? The number of probesets (925032) in the data returned by SCAN is huge, and I would prefer the transcript-level mappings. The annotation information in hta20transcriptcluster.db is on the gene/transcript level.
Thank you for your help!
I will have to go into the code to remember why that is the case and see if there is a better way to handle it. It might take me a few days (due to the US holidays) to get to it.
Does hta20transcriptcluster.db have probe information?
What do you mean by 'have probe information'? The hta20transcriptcluster.db package maps the transcript-level probeset IDs to annotation data like Entrez Gene, etc.
The pdInfo package for this array is the pd.hta.2.0 package, which is probably what you are after.
Nice, that would be great!