affyPara - rmaPara fails using cdfname
0
0
Entering edit mode
@tobias-verbeke-4173
Last seen 9.6 years ago
L.S. This issue only appears when using alternative cdfs such as the hgu133plus2hsentrezgcdf from BrainArray http://brainarray.mbni.med.umich.edu/brainarray/Database/CustomCDF/gen omic_curated_CDF.asp The patches below make affyPara work as well with such cdf packages. In the current development version of the package (1.17.0) the cdfname is not passed along to the AffyBatch calls, which results in the error reported below. Best, Tobias diff --git a/affyPara/R/preproPara.R b/affyPara/R/preproPara.R index d78eafb..63d2130 100644 --- a/affyPara/R/preproPara.R +++ b/affyPara/R/preproPara.R @@ -109,7 +109,7 @@ ################################# if (verbose) cat("Initialize AffyBatches at slaves ") t0 <- proc.time(); - check <- clusterApply(cluster, object.list, .initAffyBatchSF, object.type, ...) + check <- clusterApply(cluster, object.list, .initAffyBatchSF, object.type, cdfname = cdfname, ...) t1 <- proc.time(); if (verbose) cat(round(t1[3]-t0[3],3),"sec DONE\n") diff --git a/affyPara/R/readAffybatchPara.R b/affyPara/R/readAffybatchPara.R index 8da4ce0..c9edfcb 100644 --- a/affyPara/R/readAffybatchPara.R +++ b/affyPara/R/readAffybatchPara.R @@ -12,7 +12,7 @@ read.affybatchPara <- function(object, phenoData = new("AnnotatedDataFrame"), description = NULL, notes = "", - cluster, verbose=getOption("verbose")) + cluster, verbose=getOption("verbose"), cdfname) { ######## # Checks @@ -66,7 +66,7 @@ ################################## if (verbose) cat("Create AffyBatches at slaves ") t0 <- proc.time(); - check <- clusterApply(cluster, object.list, .initAffyBatchSF, object.type) + check <- clusterApply(cluster, object.list, .initAffyBatchSF, object.type, cdfname = cdfname) t1 <- proc.time(); if (verbose) cat(round(t1[3]-t0[3],3),"sec DONE\n") diff --git a/affyPara/man/readAffybatchPara.Rd b/affyPara/man/readAffybatchPara.Rd index d84b733..f9036b8 100644 --- a/affyPara/man/readAffybatchPara.Rd +++ b/affyPara/man/readAffybatchPara.Rd @@ -22,6 +22,7 @@ \item{cluster}{ A cluster object obtained from the function \link[snow:snow-startstop]{makeCluster} in the SNOW package. For default \code{.affyParaInternalEnv$cl} will be used. } \item{verbose}{ A logical value. If \code{TRUE} it writes out some messages. default: getOption("verbose") } + \item{cdfname}{cdfname to pass to the AffyBatch call} } \details{ Parallelized creation of an AffyBatch object. Especially useful on multi-core machines On 06/18/2012 03:51 PM, Wetzels, Yves [JRDBE Extern] wrote: > Dear > > I am investigating whether affyPara can be used to analyze a large number of Microarray data. > As a test case I have 20 CEL files. This works just fine running > > ... > expressionSetRma<- rmaPara(files) > ... > > If I want to use the "cdfname" parameter however > > ... > expressionSetRma<- rmaPara(files, cdfname="hgu133plus2hsentrezg", verbose=TRUE) > ... > > I receive following error: > > Error in dimnames(eset_mat) <- list(ids, samples.names) : > length of 'dimnames' [1] not equal to array extent > Calls: rmaPara -> preproPara -> .doSummarizationPara > In addition: Warning messages: > 1: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' > 2: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' > Execution halted > > > I saw a thread http://answerpot.com/showthread.php?1408276-affyPara mentioning a bug in the .initAffyBatchSF function date 21/10/2010. > Might this be the same bug ? > > Many thanks for your help and/or ideas. > > > Kind Regards > > Yves Wetzels > Contractor on behalf of > Janssen > Turnhoutseweg 30 > B-2340-Beerse, Belgium > > > Below you`ll find the logfile/environment settings for both test runs. > > ****************************************************************** > Log for expressionSetRma<- rmaPara(files) => OK > > ****************************************************************** > ubuntu at ip-10-239-95-215:~/test$ cat runAffyPara.R.log.withoutcdf > > R version 2.15.0 (2012-03-30) > Copyright (C) 2012 The R Foundation for Statistical Computing > ISBN 3-900051-07-0 > Platform: x86_64-unknown-linux-gnu (64-bit) > > R is free software and comes with ABSOLUTELY NO WARRANTY. > You are welcome to redistribute it under certain conditions. > Type 'license()' or 'licence()' for distribution details. > > Natural language support but running in an English locale > > R is a collaborative project with many contributors. > Type 'contributors()' for more information and > 'citation()' on how to cite R or R packages in publications. > > Type 'demo()' for some demos, 'help()' for on-line help, or > 'help.start()' for an HTML browser interface to help. > Type 'q()' to quit R. > >> library(affyPara) > Loading required package: affy > Loading required package: BiocGenerics > > Attaching package: ?BiocGenerics? > > The following object(s) are masked from ?package:stats?: > > xtabs > > The following object(s) are masked from ?package:base?: > > anyDuplicated, cbind, colnames, duplicated, eval, Filter, Find, > get, intersect, lapply, Map, mapply, mget, order, paste, pmax, > pmax.int, pmin, pmin.int, Position, rbind, Reduce, rep.int, > rownames, sapply, setdiff, table, tapply, union, unique > > Loading required package: Biobase > Welcome to Bioconductor > > Vignettes contain introductory material; view with > 'browseVignettes()'. To cite Bioconductor, see > 'citation("Biobase")', and for packages 'citation("pkgname")'. > > Loading required package: snow > Loading required package: vsn > Loading required package: aplpack > Loading required package: tcltk > Loading Tcl/Tk interface ... done > > Attaching package: ?affyPara? > > The following object(s) are masked from ?package:snow?: > > makeCluster, stopCluster > > Warning message: > In fun(libname, pkgname) : no DISPLAY variable so Tk is not available >> library(hgu133plus2hsentrezgcdf) >> sessionInfo() > R version 2.15.0 (2012-03-30) > Platform: x86_64-unknown-linux-gnu (64-bit) > > locale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=C LC_NAME=C > [9] LC_ADDRESS=C LC_TELEPHONE=C > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C > > attached base packages: > [1] tcltk stats graphics grDevices utils datasets methods > [8] base > > other attached packages: > [1] hgu133plus2hsentrezgcdf_12.1.0 affyPara_1.16.0 > [3] aplpack_1.2.6 vsn_3.24.0 > [5] snow_0.3-9 affy_1.34.0 > [7] Biobase_2.16.0 BiocGenerics_0.2.0 > > loaded via a namespace (and not attached): > [1] affyio_1.24.0 BiocInstaller_1.4.6 grid_2.15.0 > [4] lattice_0.20-6 limma_3.12.1 preprocessCore_1.18.0 > [7] tools_2.15.0 zlibbioc_1.2.0 >> path <- "/home/ubuntu/test/cel_files" >> makeCluster(2) >> files <- list.celfiles(path, full.names=TRUE) >> files > [1] "/home/ubuntu/test/cel_files/GSM686287.CEL" > [2] "/home/ubuntu/test/cel_files/GSM686289.CEL" > [3] "/home/ubuntu/test/cel_files/GSM686290.CEL" > [4] "/home/ubuntu/test/cel_files/GSM686291.CEL" > [5] "/home/ubuntu/test/cel_files/GSM686298.CEL" > [6] "/home/ubuntu/test/cel_files/GSM686300.CEL" > [7] "/home/ubuntu/test/cel_files/GSM686301.CEL" > [8] "/home/ubuntu/test/cel_files/GSM686303.CEL" > [9] "/home/ubuntu/test/cel_files/GSM686304.CEL" > [10] "/home/ubuntu/test/cel_files/GSM686305.CEL" > [11] "/home/ubuntu/test/cel_files/GSM686310.CEL" > [12] "/home/ubuntu/test/cel_files/GSM686311.CEL" > [13] "/home/ubuntu/test/cel_files/GSM686314.CEL" > [14] "/home/ubuntu/test/cel_files/GSM686316.CEL" > [15] "/home/ubuntu/test/cel_files/GSM686319.CEL" > [16] "/home/ubuntu/test/cel_files/GSM686320.CEL" > [17] "/home/ubuntu/test/cel_files/GSM686322.CEL" > [18] "/home/ubuntu/test/cel_files/GSM686323.CEL" > [19] "/home/ubuntu/test/cel_files/GSM686324.CEL" > [20] "/home/ubuntu/test/cel_files/GSM686325.CEL" >> expressionSetRma<- rmaPara(files) > Loading required package: AnnotationDbi > > > Attaching package: ?hgu133plus2cdf? > > The following object(s) are masked from ?package:hgu133plus2hsentrezgcdf?: > > i2xy, xy2i > > Warning messages: > 1: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' > 2: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' >> stopCluster() >> write.exprs(expressionSetRma,file="/home/ubuntu/test/expressionSetR ma.Rda") >> > > > > ****************************************************************** > Log for expressionSetRma<- rmaPara(files) => ERROR > > ****************************************************************** > ubuntu at ip-10-239-95-215:~/test$ more runAffyPara.R.log.withcdf > > R version 2.15.0 (2012-03-30) > Copyright (C) 2012 The R Foundation for Statistical Computing > ISBN 3-900051-07-0 > Platform: x86_64-unknown-linux-gnu (64-bit) > > R is free software and comes with ABSOLUTELY NO WARRANTY. > You are welcome to redistribute it under certain conditions. > Type 'license()' or 'licence()' for distribution details. > > Natural language support but running in an English locale > > R is a collaborative project with many contributors. > Type 'contributors()' for more information and > 'citation()' on how to cite R or R packages in publications. > > Type 'demo()' for some demos, 'help()' for on-line help, or > 'help.start()' for an HTML browser interface to help. > Type 'q()' to quit R. > >> library(affyPara) > Loading required package: affy > Loading required package: BiocGenerics > > Attaching package: ?BiocGenerics? > > The following object(s) are masked from ?package:stats?: > > xtabs > > The following object(s) are masked from ?package:base?: > > anyDuplicated, cbind, colnames, duplicated, eval, Filter, Find, > get, intersect, lapply, Map, mapply, mget, order, paste, pmax, > pmax.int, pmin, pmin.int, Position, rbind, Reduce, rep.int, > rownames, sapply, setdiff, table, tapply, union, unique > > Loading required package: Biobase > Welcome to Bioconductor > > Vignettes contain introductory material; view with > 'browseVignettes()'. To cite Bioconductor, see > 'citation("Biobase")', and for packages 'citation("pkgname")'. > > Loading required package: snow > Loading required package: vsn > Loading required package: aplpack > Loading required package: tcltk > Loading Tcl/Tk interface ... done > > Attaching package: ?affyPara? > > The following object(s) are masked from ?package:snow?: > > makeCluster, stopCluster > > Warning message: > In fun(libname, pkgname) : no DISPLAY variable so Tk is not available >> library(hgu133plus2hsentrezgcdf) >> sessionInfo() > R version 2.15.0 (2012-03-30) > Platform: x86_64-unknown-linux-gnu (64-bit) > > locale: > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 > [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 > [7] LC_PAPER=C LC_NAME=C > [9] LC_ADDRESS=C LC_TELEPHONE=C > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C > > attached base packages: > [1] tcltk stats graphics grDevices utils datasets methods > [8] base > > other attached packages: > [1] hgu133plus2hsentrezgcdf_12.1.0 affyPara_1.16.0 > [3] aplpack_1.2.6 vsn_3.24.0 > [5] snow_0.3-9 affy_1.34.0 > [7] Biobase_2.16.0 BiocGenerics_0.2.0 > > loaded via a namespace (and not attached): > [1] affyio_1.24.0 BiocInstaller_1.4.6 grid_2.15.0 > [4] lattice_0.20-6 limma_3.12.1 preprocessCore_1.18.0 > [7] tools_2.15.0 zlibbioc_1.2.0 >> path <- "/home/ubuntu/test/cel_files" >> makeCluster(2) >> files <- list.celfiles(path, full.names=TRUE) >> files > [1] "/home/ubuntu/test/cel_files/GSM686287.CEL" > [2] "/home/ubuntu/test/cel_files/GSM686289.CEL" > [3] "/home/ubuntu/test/cel_files/GSM686290.CEL" > [4] "/home/ubuntu/test/cel_files/GSM686291.CEL" > [5] "/home/ubuntu/test/cel_files/GSM686298.CEL" > [6] "/home/ubuntu/test/cel_files/GSM686300.CEL" > [7] "/home/ubuntu/test/cel_files/GSM686301.CEL" > [8] "/home/ubuntu/test/cel_files/GSM686303.CEL" > [9] "/home/ubuntu/test/cel_files/GSM686304.CEL" > [10] "/home/ubuntu/test/cel_files/GSM686305.CEL" > [11] "/home/ubuntu/test/cel_files/GSM686310.CEL" > [12] "/home/ubuntu/test/cel_files/GSM686311.CEL" > [13] "/home/ubuntu/test/cel_files/GSM686314.CEL" > [14] "/home/ubuntu/test/cel_files/GSM686316.CEL" > [15] "/home/ubuntu/test/cel_files/GSM686319.CEL" > [16] "/home/ubuntu/test/cel_files/GSM686320.CEL" > [17] "/home/ubuntu/test/cel_files/GSM686322.CEL" > [18] "/home/ubuntu/test/cel_files/GSM686323.CEL" > [19] "/home/ubuntu/test/cel_files/GSM686324.CEL" > [20] "/home/ubuntu/test/cel_files/GSM686325.CEL" >> expressionSetRma<- rmaPara(files, cdfname="hgu133plus2hsentrezg", verbose=TRUE) > Error in dimnames(eset_mat) <- list(ids, samples.names) : > length of 'dimnames' [1] not equal to array extent > Calls: rmaPara -> preproPara -> .doSummarizationPara > In addition: Warning messages: > 1: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' > 2: In is.na(xel) : is.na() applied to non-(list or vector) of type 'S4' > Execution halted > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor >
Microarray cdf affyPara Microarray cdf affyPara • 1.0k views
ADD COMMENT

Login before adding your answer.

Traffic: 554 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6