Question: problem with custom cdf
gravatar for tadoktor
3.5 years ago by
tadoktor0 wrote:


I am new in R and i trying to re-annotate some Affy cell files to Ensemle ID and then afterwards normalize the data with RMA. The problem  is that as an end result in stead of having  a reduced number of IDs (from the normal 54 000 Affy IDs to something around 20-15 000 ENSE IDs) which is the logic step, i end up with a list of around 77 000 IDs/genes. Does anyone has an idea what i am doing wrong?

Here is the code:

CELs.start <- ReadAffy(celfile.path = "/Users/dimiter/genomark/NewTest/Working",cdfname="hgu133plus2hsensecdf")
eset.rma <- rma(CELs.start)
matrix <- exprs(eset.rma)

Thanks in advance!



ADD COMMENTlink modified 3.5 years ago by James W. MacDonald46k • written 3.5 years ago by tadoktor0
gravatar for James W. MacDonald
3.5 years ago by
United States
James W. MacDonald46k wrote:

You haven't done anything wrong, except perhaps for being confused as to what ENSE means. Please note that ENSE is Ensembl Exon IDs, whereas ENSG is Ensembl Gene IDs. So if you want to summarize at the gene level, based on Ensembl genes, then you need the hgu133plus2hsensgcdf, which has 20,049 genes as compared to the ENSE version, which has 77,898 exons.

ADD COMMENTlink written 3.5 years ago by James W. MacDonald46k

Many thanks for the answer!  :)

ADD REPLYlink written 3.5 years ago by tadoktor0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 141 users visited in the last hour