I am trying to use the GEOquery package to extract phenotype data, but data for some rows data seem to be missing for some characteristics, causing the data to be shifted as shown below. Has anyone else seen this and know a workaround (that does not involve manually manipulating data) or is this simply an error in the raw data that GEOquery cannot handle?
>GSE44861=getGEO("GSE44861") >x=pData(phenoData(GSE44861[[1]])) >x[20:30,c("characteristics_ch1","characteristics_ch1.2","characteristics_ch1.3")] characteristics_ch1 characteristics_ch1.2 characteristics_ch1.3 GSM1092928 case_id: 11731 rs5995355 genotype: AA tissue: adjacent nontumor GSM1092929 case_id: 11873 rs5995355 genotype: AG tissue: adjacent nontumor GSM1092930 case_id: 11873 rs5995355 genotype: AG tissue: Tumor GSM1092931 case_id: 11918 tissue: adjacent nontumor GSM1092932 case_id: 11918 tissue: Tumor mir34a: -2.2859 GSM1092933 case_id: 12031 rs5995355 genotype: AG tissue: adjacent nontumor GSM1092934 case_id: 12051 rs5995355 genotype: GG tissue: adjacent nontumor GSM1092935 case_id: 12051 rs5995355 genotype: GG tissue: Tumor GSM1092936 case_id: 12076 rs5995355 genotype: GG tissue: adjacent nontumor GSM1092937 case_id: 12124 rs5995355 genotype: AG tissue: adjacent nontumor GSM1092938 case_id: 12124 rs5995355 genotype: AG tissue: Tumor