Running time for make.cdf.package
2
0
Entering edit mode
Jiang, Zhen ▴ 30
@jiang-zhen-3289
Last seen 9.6 years ago
Dear list, I have a CDF file of size 404,514KB. I'm using the following code to build a package from it. library(affy) library(makecdfenv) ## CDF Package make.cdf.package("xxxx.CDF", species="Homo_sapiens", version="0.0.1",unlink=TRUE, verbose=TRUE) It has been running for two and half days. Is it because the CDF is too large for R to handle or something is wrong with my code? I have successfully build R packages for other much smaller CDF files using the same code, only with different CDF name. The sessionInfo() is listed below. > sessionInfo() R version 2.7.0 (2008-04-22) i386-pc-mingw32 locale: LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 attached base packages: [1] tools stats graphics grDevices utils datasets methods [8] base other attached packages: [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 [4] affyio_1.8.1 Biobase_2.0.1 Thank you very much. Zhen Jiang Notice: This e-mail message, together with any attachme...{{dropped:15}}
cdf cdf • 902 views
ADD COMMENT
0
Entering edit mode
@kasper-daniel-hansen-2979
Last seen 9 months ago
United States
Why don't you tell us the size of the chip, ie. number of probes and number of probesets. You might even want to reveal the actual name (!). Since it is human, I guess it is pretty big. Your main problem is probably memory or a faulty file. In general, making a CDF package is pretty quick. I would be shocked if it took more than 1 hour on a suitable computer. Do you get any output, since you are running with verbose = TRUE. Kasper On Feb 16, 2009, at 9:57 , Jiang, Zhen wrote: > Dear list, > > I have a CDF file of size 404,514KB. I'm using the following code to > build a package from it. > > library(affy) > library(makecdfenv) > > ## CDF Package > make.cdf.package("xxxx.CDF", > species="Homo_sapiens", > version="0.0.1",unlink=TRUE, > verbose=TRUE) > > It has been running for two and half days. Is it because the CDF is > too > large for R to handle or something is wrong with my code? > I have successfully build R packages for other much smaller CDF files > using the same code, only with different CDF name. > > The sessionInfo() is listed below. > >> sessionInfo() > R version 2.7.0 (2008-04-22) > i386-pc-mingw32 > > locale: > LC_COLLATE=English_United States.1252;LC_CTYPE=English_United > States.1252;LC_MONETARY=English_United > States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > > attached base packages: > [1] tools stats graphics grDevices utils datasets > methods > > [8] base > > other attached packages: > [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 > [4] affyio_1.8.1 Biobase_2.0.1 > > Thank you very much. > > Zhen Jiang > > > Notice: This e-mail message, together with any attachme...{{dropped: > 15}} > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
ADD COMMENT
0
Entering edit mode
It's a custom Affy array. Because of company confidentiality, I'm not sure if I should reveal the name. The probeset size is only 3. So the number of probesets are huge comparing to normal Affy arrays, which have probeset size about 9 to 11. I have made three CDF packages before. The results almost came instantly for those three. Now this time, it's has been running for more than 3 days and I haven't got any output yet. That's why I'm asking if it is normal or it indicates something is wrong with my code or the file. Thanks, Zhen Jiang -----Original Message----- From: Kasper Daniel Hansen [mailto:khansen@stat.berkeley.edu] Sent: Monday, February 16, 2009 3:24 PM To: Jiang, Zhen Cc: bioconductor at stat.math.ethz.ch Subject: Re: [BioC] Running time for make.cdf.package Why don't you tell us the size of the chip, ie. number of probes and number of probesets. You might even want to reveal the actual name (!). Since it is human, I guess it is pretty big. Your main problem is probably memory or a faulty file. In general, making a CDF package is pretty quick. I would be shocked if it took more than 1 hour on a suitable computer. Do you get any output, since you are running with verbose = TRUE. Kasper On Feb 16, 2009, at 9:57 , Jiang, Zhen wrote: > Dear list, > > I have a CDF file of size 404,514KB. I'm using the following code to > build a package from it. > > library(affy) > library(makecdfenv) > > ## CDF Package > make.cdf.package("xxxx.CDF", > species="Homo_sapiens", > version="0.0.1",unlink=TRUE, > verbose=TRUE) > > It has been running for two and half days. Is it because the CDF is > too > large for R to handle or something is wrong with my code? > I have successfully build R packages for other much smaller CDF files > using the same code, only with different CDF name. > > The sessionInfo() is listed below. > >> sessionInfo() > R version 2.7.0 (2008-04-22) > i386-pc-mingw32 > > locale: > LC_COLLATE=English_United States.1252;LC_CTYPE=English_United > States.1252;LC_MONETARY=English_United > States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > > attached base packages: > [1] tools stats graphics grDevices utils datasets > methods > > [8] base > > other attached packages: > [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 > [4] affyio_1.8.1 Biobase_2.0.1 > > Thank you very much. > > Zhen Jiang > > > Notice: This e-mail message, together with any attachme...{{dropped: > 15}} > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor Notice: This e-mail message, together with any attachme...{{dropped:12}}
ADD REPLY
0
Entering edit mode
It sounds strange. I would check that it has a traditional PM/MM design. I would also check whether there is several groups in each unit or only one. Anyway, since you have tried doing this before, I would suggest using for example readCdf from affxparser and then construct the cdf environment by hand. That should be somewhat easy, if you have experience with cdf environments. A cdf package is essentially a wrapper around a cdf environment. Kasper On Feb 17, 2009, at 9:26 , Jiang, Zhen wrote: > It's a custom Affy array. Because of company confidentiality, I'm not > sure if I should reveal the name. > The probeset size is only 3. So the number of probesets are huge > comparing to normal Affy arrays, which have probeset size about 9 to > 11. > I have made three CDF packages before. The results almost came > instantly > for those three. > Now this time, it's has been running for more than 3 days and I > haven't > got any output yet. That's why I'm asking if it is normal or it > indicates something is wrong with my code or the file. > > Thanks, > > Zhen Jiang > > > -----Original Message----- > From: Kasper Daniel Hansen [mailto:khansen at stat.berkeley.edu] > Sent: Monday, February 16, 2009 3:24 PM > To: Jiang, Zhen > Cc: bioconductor at stat.math.ethz.ch > Subject: Re: [BioC] Running time for make.cdf.package > > Why don't you tell us the size of the chip, ie. number of probes and > number of probesets. You might even want to reveal the actual name > (!). Since it is human, I guess it is pretty big. Your main problem is > probably memory or a faulty file. In general, making a CDF package is > pretty quick. I would be shocked if it took more than 1 hour on a > suitable computer. > > Do you get any output, since you are running with verbose = TRUE. > > Kasper > > On Feb 16, 2009, at 9:57 , Jiang, Zhen wrote: > >> Dear list, >> >> I have a CDF file of size 404,514KB. I'm using the following code to >> build a package from it. >> >> library(affy) >> library(makecdfenv) >> >> ## CDF Package >> make.cdf.package("xxxx.CDF", >> species="Homo_sapiens", >> version="0.0.1",unlink=TRUE, >> verbose=TRUE) >> >> It has been running for two and half days. Is it because the CDF is >> too >> large for R to handle or something is wrong with my code? >> I have successfully build R packages for other much smaller CDF files >> using the same code, only with different CDF name. >> >> The sessionInfo() is listed below. >> >>> sessionInfo() >> R version 2.7.0 (2008-04-22) >> i386-pc-mingw32 >> >> locale: >> LC_COLLATE=English_United States.1252;LC_CTYPE=English_United >> States.1252;LC_MONETARY=English_United >> States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 >> >> attached base packages: >> [1] tools stats graphics grDevices utils datasets >> methods >> >> [8] base >> >> other attached packages: >> [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 >> [4] affyio_1.8.1 Biobase_2.0.1 >> >> Thank you very much. >> >> Zhen Jiang >> >> >> Notice: This e-mail message, together with any attachme...{{dropped: >> 15}} >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at stat.math.ethz.ch >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > > Notice: This e-mail message, together with any attach...{{dropped:13}}
ADD REPLY
0
Entering edit mode
@wolfgang-huber-3550
Last seen 11 days ago
EMBL European Molecular Biology Laborat…
Hi Jiang, other people may have more specific suggestion on what to do with Window, but what does task manager say about the CPU usage and memory footprint of this R process? Also, it is never a mistake to use the most recent release of R/Bioconductor, i.e. R 2.8.1 and Bioc 2.3 Best wishes Wolfgang Jiang, Zhen wrote: > Dear list, > > I have a CDF file of size 404,514KB. I'm using the following code to > build a package from it. > > library(affy) > library(makecdfenv) > > ## CDF Package > make.cdf.package("xxxx.CDF", > species="Homo_sapiens", > version="0.0.1",unlink=TRUE, > verbose=TRUE) > > It has been running for two and half days. Is it because the CDF is too > large for R to handle or something is wrong with my code? > I have successfully build R packages for other much smaller CDF files > using the same code, only with different CDF name. > > The sessionInfo() is listed below. > >> sessionInfo() > R version 2.7.0 (2008-04-22) > i386-pc-mingw32 > > locale: > LC_COLLATE=English_United States.1252;LC_CTYPE=English_United > States.1252;LC_MONETARY=English_United > States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > > attached base packages: > [1] tools stats graphics grDevices utils datasets methods > > [8] base > > other attached packages: > [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 > [4] affyio_1.8.1 Biobase_2.0.1 > > Thank you very much. > > Zhen Jiang > > > Notice: This e-mail message, together with any attachme...{{dropped:15}} > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Best wishes Wolfgang ---------------------------------------------------- Wolfgang Huber, EMBL-EBI, http://www.ebi.ac.uk/huber
ADD COMMENT
0
Entering edit mode
It's a company server. I don't have much freedom to install software. Do you think the version could be the problem? But I have build packages before without any problem. Thanks, Zhen Jiang -----Original Message----- From: Wolfgang Huber [mailto:huber@ebi.ac.uk] Sent: Monday, February 16, 2009 10:13 AM To: Jiang, Zhen Cc: bioconductor at stat.math.ethz.ch Subject: Re: [BioC] Running time for make.cdf.package Hi Jiang, other people may have more specific suggestion on what to do with Window, but what does task manager say about the CPU usage and memory footprint of this R process? Also, it is never a mistake to use the most recent release of R/Bioconductor, i.e. R 2.8.1 and Bioc 2.3 Best wishes Wolfgang Jiang, Zhen wrote: > Dear list, > > I have a CDF file of size 404,514KB. I'm using the following code to > build a package from it. > > library(affy) > library(makecdfenv) > > ## CDF Package > make.cdf.package("xxxx.CDF", > species="Homo_sapiens", > version="0.0.1",unlink=TRUE, > verbose=TRUE) > > It has been running for two and half days. Is it because the CDF is too > large for R to handle or something is wrong with my code? > I have successfully build R packages for other much smaller CDF files > using the same code, only with different CDF name. > > The sessionInfo() is listed below. > >> sessionInfo() > R version 2.7.0 (2008-04-22) > i386-pc-mingw32 > > locale: > LC_COLLATE=English_United States.1252;LC_CTYPE=English_United > States.1252;LC_MONETARY=English_United > States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > > attached base packages: > [1] tools stats graphics grDevices utils datasets methods > > [8] base > > other attached packages: > [1] makecdfenv_1.18.0 affy_1.18.2 preprocessCore_1.2.1 > [4] affyio_1.8.1 Biobase_2.0.1 > > Thank you very much. > > Zhen Jiang > > > Notice: This e-mail message, together with any attachme...{{dropped:15}} > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Best wishes Wolfgang ---------------------------------------------------- Wolfgang Huber, EMBL-EBI, http://www.ebi.ac.uk/huber Notice: This e-mail message, together with any attachme...{{dropped:12}}
ADD REPLY

Login before adding your answer.

Traffic: 723 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6