Hi,
I'm trying to use minfi's read.metharray.exp() / read.metharray() to read in 48 samples from 6 Illumina EPIC arrays. The arrays were all ordered at the same time from Illumina, but appear to come from two different main batches: two arrays have Sentrix_IDs 20111453005X and the other four arrays have IDs 2013888700XX. When trying to read them in, I get the folllowing error:
> rgSet <- read.metharray.exp(targets=targets, force = TRUE)
[read.metharray] Trying to parse IDAT files from different arrays.
Inferred Array sizes and types:
array size
201114530050_R05C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R06C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R07C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R08C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R01C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R02C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R03C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R04C01 "IlluminaHumanMethylationEPIC" "1052641"
201388870032_R05C01 "Unknown" "1051943"
201388870032_R06C01 "Unknown" "1051943"
201388870032_R07C01 "Unknown" "1051943"
201388870032_R08C01 "Unknown" "1051943"
201388870033_R01C01 "Unknown" "1051943"
201388870033_R02C01 "Unknown" "1051943"
201388870033_R03C01 "Unknown" "1051943"
201388870033_R04C01 "Unknown" "1051943"
201388870035_R05C01 "Unknown" "1051943"
201388870035_R06C01 "Unknown" "1051943"
201388870035_R07C01 "Unknown" "1051943"
201388870035_R08C01 "Unknown" "1051943"
201388870055_R01C01 "Unknown" "1051943"
201388870055_R02C01 "Unknown" "1051943"
201388870055_R03C01 "Unknown" "1051943"
201114530050_R01C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R02C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R03C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530050_R04C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R05C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R06C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R07C01 "IlluminaHumanMethylationEPIC" "1052641"
201114530051_R08C01 "IlluminaHumanMethylationEPIC" "1052641"
201388870032_R01C01 "Unknown" "1051943"
201388870032_R02C01 "Unknown" "1051943"
201388870032_R03C01 "Unknown" "1051943"
201388870032_R04C01 "Unknown" "1051943"
201388870033_R05C01 "Unknown" "1051943"
201388870033_R06C01 "Unknown" "1051943"
201388870033_R07C01 "Unknown" "1051943"
201388870033_R08C01 "Unknown" "1051943"
201388870035_R01C01 "Unknown" "1051943"
201388870035_R02C01 "Unknown" "1051943"
201388870035_R03C01 "Unknown" "1051943"
201388870035_R04C01 "Unknown" "1051943"
201388870055_R04C01 "Unknown" "1051943"
201388870055_R05C01 "Unknown" "1051943"
201388870055_R06C01 "Unknown" "1051943"
201388870055_R07C01 "Unknown" "1051943"
201388870055_R08C01 "Unknown" "1051943"
Error in read.metharray(files, extended = extended, verbose = verbose, :
[read.metharray] Trying to parse different IDAT files, of different size and type.
Looking at the help for ?read.metharray I see that I have another case of arrays with different numbers of probes. However, I can't use the force = TRUE argument as one set of my arrays doesn't even get recognized as "IlluminaHumanMethylationEPIC" because it's number of probes, 1051943 is just under the 1052000 threshold in the internal .guessArrayTypes().
I tried overcoming this by calling debugonce(read.metharray), and manually setting arrayTypes[,1] <- "IlluminaHumanMethylationEPIC" and arrayTypes[,2] <- "ilm10b2.hg19" but I'm still missing something because when I try to step through the rest of the function, I get an error:
Error in `sampleNames<-`(`*tmp*`, value = c("1600101", "1600111", "1600115", : number of new names (1052641) should equal number of rows in AnnotatedDataFrame (1051943)
So I'm stuck for now. Help please!
Thanks,
Jenny
> sessionInfo() R version 3.3.3 (2017-03-06) Platform: x86_64-w64-mingw32/x64 (64-bit) Running under: Windows >= 8 x64 (build 9200) locale: [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252 [3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C [5] LC_TIME=English_United States.1252 attached base packages: [1] splines grid stats4 parallel stats graphics grDevices utils datasets [10] methods base other attached packages: [1] stringr_1.2.0 [2] DMRcate_1.10.8 [3] DMRcatedata_1.10.1 [4] DSS_2.14.0 [5] bsseq_1.10.0 [6] Gviz_1.18.2 [7] minfiData_0.20.0 [8] IlluminaHumanMethylation450kanno.ilmn12.hg19_0.6.0 [9] IlluminaHumanMethylation450kmanifest_0.4.0 [10] matrixStats_0.51.0 [11] missMethyl_1.8.0 [12] RColorBrewer_1.1-2 [13] IlluminaHumanMethylationEPICmanifest_0.3.0 [14] IlluminaHumanMethylationEPICanno.ilm10b2.hg19_0.6.0 [15] minfi_1.20.2 [16] bumphunter_1.14.0 [17] locfit_1.5-9.1 [18] iterators_1.0.8 [19] foreach_1.4.3 [20] Biostrings_2.42.1 [21] XVector_0.14.1 [22] SummarizedExperiment_1.4.0 [23] GenomicRanges_1.26.4 [24] GenomeInfoDb_1.10.3 [25] IRanges_2.8.2 [26] S4Vectors_0.12.2 [27] Biobase_2.34.0 [28] BiocGenerics_0.20.0 [29] limma_3.30.13 loaded via a namespace (and not attached): [1] colorspace_1.3-2 siggenes_1.48.0 mclust_5.2.3 [4] biovizBase_1.22.0 htmlTable_1.9 base64enc_0.1-3 [7] dichromat_2.0-0 base64_2.0 interactiveDisplayBase_1.12.0 [10] AnnotationDbi_1.36.2 R.methodsS3_1.7.1 codetools_0.2-15 [13] methylumi_2.20.0 knitr_1.15.1 Formula_1.2-1 [16] Rsamtools_1.26.1 annotate_1.52.1 cluster_2.0.5 [19] GO.db_3.4.0 R.oo_1.21.0 shiny_1.0.0 [22] httr_1.2.1 backports_1.0.5 assertthat_0.1 [25] Matrix_1.2-8 lazyeval_0.2.0 acepack_1.4.1 [28] htmltools_0.3.5 tools_3.3.3 gtable_0.2.0 [31] doRNG_1.6 Rcpp_0.12.10 multtest_2.30.0 [34] preprocessCore_1.36.0 nlme_3.1-131 rtracklayer_1.34.2 [37] mime_0.5 ensembldb_1.6.2 rngtools_1.2.4 [40] gtools_3.5.0 statmod_1.4.29 XML_3.98-1.5 [43] beanplot_1.2 org.Hs.eg.db_3.4.0 AnnotationHub_2.6.5 [46] zlibbioc_1.20.0 MASS_7.3-45 scales_0.4.1 [49] BSgenome_1.42.0 VariantAnnotation_1.20.3 BiocInstaller_1.24.0 [52] GEOquery_2.40.0 yaml_2.1.14 memoise_1.0.0 [55] gridExtra_2.2.1 ggplot2_2.2.1 pkgmaker_0.22 [58] biomaRt_2.30.0 rpart_4.1-10 reshape_0.8.6 [61] latticeExtra_0.6-28 stringi_1.1.3 RSQLite_1.1-2 [64] genefilter_1.56.0 permute_0.9-4 checkmate_1.8.2 [67] GenomicFeatures_1.26.3 BiocParallel_1.8.1 bitops_1.0-6 [70] nor1mix_1.2-2 lattice_0.20-34 ruv_0.9.6 [73] GenomicAlignments_1.10.1 htmlwidgets_0.8 plyr_1.8.4 [76] magrittr_1.5 R6_2.2.0 Hmisc_4.0-2 [79] DBI_0.6 foreign_0.8-67 survival_2.41-2 [82] RCurl_1.95-4.8 nnet_7.3-12 tibble_1.2 [85] data.table_1.10.4 digest_0.6.12 xtable_1.8-2 [88] httpuv_1.3.3 illuminaio_0.16.0 R.utils_2.5.0 [91] openssl_0.9.6 munsell_0.4.3 registry_0.3 [94] BiasedUrn_1.07 quadprog_1.5-5

I'm getting the same error, but mine is probably because I had one swath in one sample that would not scan. Any resolution so far? Thanks, Adrienne
> RGSet = read.metharray.exp(targets = targets)
Error in read.metharray(files, extended = extended, verbose = verbose, :
[read.metharray] Trying to parse IDAT files with different array size but seemingly all of the same type.
You can force this by 'force=TRUE', see the man page ?read.metharray
> RGSet = read.metharray.exp(targets = targets, force = T)
Error in `sampleNames<-`(`*tmp*`, value = c("1600101", "1600111", "1600115", :
number of new names (1051943) should equal number of rows in AnnotatedDataFrame (1051539)
...............................
R version 3.3.1 (2016-06-21)
Platform: x86_64-apple-darwin13.4.0 (64-bit)
Running under: OS X 10.11.6 (El Capitan)
FYI, I updated R and RStudio to get the latest version of minfi. That worked for me.