I perform differential expression on codelink data and output is shown below. Now i am looking to annotate the probes and i use `ACCN` column from hwgcod.db to annotate. I am trying to get the ACCN's from probeName column ,about 7000 probe names and ACCN does not match, which leads to loss of information. Can i get the ACCN instead of probeName as shown below as the second column. Note: I am using the infromation from probeName to get ACCN which i can use to annotate, say something like "1691167CB1" "NM_002575" "3251556CB1" "NM_001045" "201678" "8187031CB1" "1019621" "1023071". probeName probeType logicalRow logicalCol meanSNR logFC 17143 1691167CB1_PROBE1 DISCOVERY 278 26 0.5780484 -6.767287 1460 NM_002575.1_PROBE1 DISCOVERY 26 23 0.6308975 6.655497 19529 3251556CB1_PROBE1 DISCOVERY 315 57 0.6364621 -4.456800 19025 NM_001045.1_PROBE1 DISCOVERY 307 66 0.5722841 5.984381 5311 201678.10_PROBE1 DISCOVERY 87 40 0.5945943 -5.464377 7264 8187031CB1_PROBE1 DISCOVERY 118 23 0.6700745 6.160234 9024 1019621.1_PROBE1 DISCOVERY 146 5 0.7937480 -7.693983 6481 1023071.1_PROBE1 DISCOVERY 105 69 0.7680499 -6.333866 AveExpr t P.Value adj.P.Val B 17143 1.379396 -19.89355 8.231218e-06 0.1629205 -2.990755 1460 1.468584 14.45828 3.746444e-05 0.3707668 -3.020015 19529 2.990133 -13.15169 5.856035e-05 0.3806185 -3.032661 19025 1.065737 11.83323 9.614977e-05 0.3806185 -3.049624 5311 1.250057 -11.28423 1.200560e-04 0.3960448 -3.058378 7264 2.368912 10.32842 1.812488e-04 0.4675345 -3.076801 9024 4.481637 -17.38061 8.927544e-05 0.3806185 -3.659384 6481 4.973425 -14.24512 1.889696e-04 0.4675345 -3.668655
My sample data is here https://www.dropbox.com/s/kz12cwgqes10oqu/GSM108290.TXT?dl=0 , which has ACCN column.
The second confusion i want to clear is, is there a single annotation package for affymetrix, illumina and codelink data platforms to annotate from. Thanks
In the file you link there is no ACCN column. Are you reading this file with readCodelinkSet() or from GEO with GEOquery? How are you getting the ACCN information (i.e. write the actual code)? This question is not about differential expression (as the title and tags suggests), only about annotation.
I can find " Annotation_NCBI-Acc"
in the above attached file as the sixth column, If that's not ACCN column , then what is that. And i have data already downloaded so i use readCodelinkSet() instead of GEOquery().
Please find the code which i use to extract information from probeNames() to annotate for h20kcod.db
I do not know why you assume that we should read "ACCN" column as "Annotation_NCBI-Acc"- they may or may not refer to the same thing. Please, be specific when referring to data columns (or in general, to anything you want others to help you with) or define before hand any alias you wish to use. I am preparing a workaround for your issue and will post it as an answer soon.