Fwd: GO terms: Annotation for HumanMethylation450
1
0
Entering edit mode
Tim Triche ★ 4.2k
@tim-triche-3561
Last seen 4.3 years ago
United States
Hi all, Not sure how I managed not to cc: the list on this initially. Here's some GO.db code with a sort of "moral" to it ;-) --t Begin forwarded message: library(IlluminaHumanMethylation450k.db) ## allow both singly- and multiply-mapped probes (e.g. for SYMBOL) IlluminaHumanMethylation450kGOall <-toggleProbes(IlluminaHumanMethylation450kGO, 'all') ## now let's look at the differences that result from toggleProbes() mapped_probes_default <- mappedkeys(IlluminaHumanMethylation450kGO) mapped_probes_toggled <- mappedkeys(IlluminaHumanMethylation450kGOall) multimapped <- setdiff( mapped_probes_toggled, mapped_probes_default ) res0 <- mget(head(multimapped), IlluminaHumanMethylation450kGO, ifnotfound=NA) res <- mget(head(multimapped), IlluminaHumanMethylation450kGOall, ifnotfound=NA) ## fetch the GOIDs from the unencumbered toggled map, to get terms for them library(GO.db) GOids <- lapply(res, function(x) unlist(lapply(x, function(y) y$GOID))) GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) head(GOterms) > I'll add this to the docs (next release) > > thanks, > > --t > > > > On Fri, Mar 29, 2013 at 11:24 AM, Fabrice Tourre <fabrice.ciup@gmail.com> wrote: >> Tim, >> >> Thank you very much for your reply. >> I have a list of probe list. >> Do you a example script for me to get the GO terms, instead of GO ID? >> >> The Documentation is not very clear for this. >> http://www.bioconductor.org/packages/2.11/data/annotation/html/Illu minaHumanMethylation450k.db.html >> >> On Fri, Mar 29, 2013 at 12:29 PM, Tim Triche, Jr. <tim.triche@gmail.com> wrote: >> > Oddly enough, the paper from UCSD with Illumina's folks on it (*) used the >> > IlluminaHumanMethylation450k.db package (which I am currently rebuilding to >> > have a startup message about toggleProbes()) to annotate both CpG islands >> > and GO terms. >> > >> > (*) >> > http://idekerlab.ucsd.edu/publications/Documents/Hannum_MolCell_2 012.pdf >> > >> > >> > On Fri, Mar 29, 2013 at 8:49 AM, Fabrice Tourre <fabrice.ciup@gmail.com> >> > wrote: >> >> >> >> Dear list, >> >> >> >> In the annotation file of Infinium HumanMethylation450 BeadChip, >> >> >> >> http://support.illumina.com/documents/MyIllumina/b78d361a-def5 -4adb-ab38-e8990625f053/HumanMethylation450_15017482_v.1.2.csv >> >> >> >> for each probe set, they do not have annotation for GO terms, pathways. >> >> >> >> As they have done in the annotation file: HG- U133_Plus_2.na32.annot.csv. >> >> >> >> Is there some bioconductor package to annotated the Infinium >> >> HumanMethylation450 probes? Given a probe, feed back the GO terms and >> >> pathways. >> >> >> >> Thank you very much in advance. >> >> >> >> _______________________________________________ >> >> Bioconductor mailing list >> >> Bioconductor@r-project.org >> >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> >> Search the archives: >> >> http://news.gmane.org/gmane.science.biology.informatics.conductor >> > >> > >> > >> > >> > -- >> > A model is a lie that helps you see the truth. >> > >> > Howard Skipper > > > > -- > A model is a lie that helps you see the truth. > > Howard Skipper [[alternative HTML version deleted]]
Annotation Pathways GO probe annotate Annotation Pathways GO probe annotate • 1.1k views
ADD COMMENT
0
Entering edit mode
Jinyan Huang ▴ 110
@jinyan-huang-5828
Last seen 10.3 years ago
When I want to get all GO terms on IlluminaHumanMethylation450k. There is a memory problem. It uses more than 10G memory. GOids <- lapply(res2, function(x) unlist(lapply(x, function(y) y$GOID))) GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) Error: memory exhausted (limit reached?) Execution halted --------------------------------------Get_all_GO.R-------------------- -------------------------- library(IlluminaHumanMethylation450k.db) ## allow both singly- and multiply-mapped probes (e.g. for SYMBOL) IlluminaHumanMethylation450kGOall <-toggleProbes(IlluminaHumanMethylation450kGO,'all') ## now let's look at the differences that result from toggleProbes() mapped_probes_toggled <- mappedkeys(IlluminaHumanMethylation450kGOall) res <- mget(mapped_probes_toggled, IlluminaHumanMethylation450kGOall, ifnotfound=NA) res2 <- lapply(res, function(x) x[sapply(x, function(y) y['Evidence']!='IEA')]) ## fetch the GOIDs from the unencumbered toggled map, to get terms for them library(GO.db) GOids <- lapply(res2, function(x) unlist(lapply(x, function(y) y$GOID))) GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) d<-lapply(GOterms,function(x)do.call(rbind,lapply(x,function(y)data.fr ame(y at Term,y at GOID,y at Ontology)))) df<-do.call(rbind,d) len <- sapply(d,function(x)length(x[,1])) probes <- rep(names(d),len) df.out<-data.frame(probes=probes,df) names(df.out)<-c("probe","GoTerm","GOID","GOCategory") write.table(df.out,"GO_all.txt",quote=F,row.names=F,col.names=T,sep="\ t") ---------------------------------------------------------------------- ------------------------------------------ On Tue, Apr 2, 2013 at 7:29 PM, Tim Triche, Jr. <tim.triche at="" gmail.com=""> wrote: > Hi all, > > Not sure how I managed not to cc: the list on this initially. Here's some GO.db code with a sort of "moral" to it ;-) > > --t > > Begin forwarded message: > > library(IlluminaHumanMethylation450k.db) > > ## allow both singly- and multiply-mapped probes (e.g. for SYMBOL) IlluminaHumanMethylation450kGOall <-toggleProbes(IlluminaHumanMethylation450kGO, 'all') > > ## now let's look at the differences that result from toggleProbes() > mapped_probes_default <- mappedkeys(IlluminaHumanMethylation450kGO) > mapped_probes_toggled <- mappedkeys(IlluminaHumanMethylation450kGOall) > multimapped <- setdiff( mapped_probes_toggled, mapped_probes_default ) > > res0 <- mget(head(multimapped), IlluminaHumanMethylation450kGO, ifnotfound=NA) > res <- mget(head(multimapped), IlluminaHumanMethylation450kGOall, ifnotfound=NA) > > ## fetch the GOIDs from the unencumbered toggled map, to get terms for them > > library(GO.db) > GOids <- lapply(res, function(x) unlist(lapply(x, function(y) y$GOID))) > GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) > head(GOterms) > > >> I'll add this to the docs (next release) >> >> thanks, >> >> --t >> >> >> >> On Fri, Mar 29, 2013 at 11:24 AM, Fabrice Tourre <fabrice.ciup at="" gmail.com=""> wrote: >>> Tim, >>> >>> Thank you very much for your reply. >>> I have a list of probe list. >>> Do you a example script for me to get the GO terms, instead of GO ID? >>> >>> The Documentation is not very clear for this. >>> http://www.bioconductor.org/packages/2.11/data/annotation/html/Ill uminaHumanMethylation450k.db.html >>> >>> On Fri, Mar 29, 2013 at 12:29 PM, Tim Triche, Jr. <tim.triche at="" gmail.com=""> wrote: >>> > Oddly enough, the paper from UCSD with Illumina's folks on it (*) used the >>> > IlluminaHumanMethylation450k.db package (which I am currently rebuilding to >>> > have a startup message about toggleProbes()) to annotate both CpG islands >>> > and GO terms. >>> > >>> > (*) >>> > http://idekerlab.ucsd.edu/publications/Documents/Hannum_MolCell_ 2012.pdf >>> > >>> > >>> > On Fri, Mar 29, 2013 at 8:49 AM, Fabrice Tourre <fabrice.ciup at="" gmail.com=""> >>> > wrote: >>> >> >>> >> Dear list, >>> >> >>> >> In the annotation file of Infinium HumanMethylation450 BeadChip, >>> >> >>> >> http://support.illumina.com/documents/MyIllumina/b78d361a-def5 -4adb-ab38-e8990625f053/HumanMethylation450_15017482_v.1.2.csv >>> >> >>> >> for each probe set, they do not have annotation for GO terms, pathways. >>> >> >>> >> As they have done in the annotation file: HG- U133_Plus_2.na32.annot.csv. >>> >> >>> >> Is there some bioconductor package to annotated the Infinium >>> >> HumanMethylation450 probes? Given a probe, feed back the GO terms and >>> >> pathways. >>> >> >>> >> Thank you very much in advance. >>> >> >>> >> _______________________________________________ >>> >> Bioconductor mailing list >>> >> Bioconductor at r-project.org >>> >> https://stat.ethz.ch/mailman/listinfo/bioconductor >>> >> Search the archives: >>> >> http://news.gmane.org/gmane.science.biology.informatics.conductor >>> > >>> > >>> > >>> > >>> > -- >>> > A model is a lie that helps you see the truth. >>> > >>> > Howard Skipper >> >> >> >> -- >> A model is a lie that helps you see the truth. >> >> Howard Skipper > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Best wishes, Jinyan HUANG
ADD COMMENT
0
Entering edit mode
Buy more RAM :-) --t On Apr 3, 2013, at 6:59 AM, Jinyan Huang <jhuang at="" hsph.harvard.edu=""> wrote: > When I want to get all GO terms on IlluminaHumanMethylation450k. There > is a memory problem. It uses more than 10G memory. > > GOids <- lapply(res2, function(x) unlist(lapply(x, function(y) y$GOID))) > GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) > Error: memory exhausted (limit reached?) > Execution halted > > > --------------------------------------Get_all_GO.R------------------ ---------------------------- > > library(IlluminaHumanMethylation450k.db) > ## allow both singly- and multiply-mapped probes (e.g. for SYMBOL) > IlluminaHumanMethylation450kGOall > <-toggleProbes(IlluminaHumanMethylation450kGO,'all') > ## now let's look at the differences that result from toggleProbes() > mapped_probes_toggled <- mappedkeys(IlluminaHumanMethylation450kGOall) > res <- mget(mapped_probes_toggled, IlluminaHumanMethylation450kGOall, > ifnotfound=NA) > res2 <- lapply(res, function(x) x[sapply(x, function(y) y['Evidence']!='IEA')]) > ## fetch the GOIDs from the unencumbered toggled map, to get terms for them > library(GO.db) > GOids <- lapply(res2, function(x) unlist(lapply(x, function(y) y$GOID))) > GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) > d<-lapply(GOterms,function(x)do.call(rbind,lapply(x,function(y)data. frame(y at Term,y at GOID,y at Ontology)))) > df<-do.call(rbind,d) > len <- sapply(d,function(x)length(x[,1])) > probes <- rep(names(d),len) > df.out<-data.frame(probes=probes,df) > names(df.out)<-c("probe","GoTerm","GOID","GOCategory") > write.table(df.out,"GO_all.txt",quote=F,row.names=F,col.names=T,sep= "\t") > > -------------------------------------------------------------------- -------------------------------------------- > > On Tue, Apr 2, 2013 at 7:29 PM, Tim Triche, Jr. <tim.triche at="" gmail.com=""> wrote: >> Hi all, >> >> Not sure how I managed not to cc: the list on this initially. Here's some GO.db code with a sort of "moral" to it ;-) >> >> --t >> >> Begin forwarded message: >> >> library(IlluminaHumanMethylation450k.db) >> >> ## allow both singly- and multiply-mapped probes (e.g. for SYMBOL) IlluminaHumanMethylation450kGOall <-toggleProbes(IlluminaHumanMethylation450kGO, 'all') >> >> ## now let's look at the differences that result from toggleProbes() >> mapped_probes_default <- mappedkeys(IlluminaHumanMethylation450kGO) >> mapped_probes_toggled <- mappedkeys(IlluminaHumanMethylation450kGOall) >> multimapped <- setdiff( mapped_probes_toggled, mapped_probes_default ) >> >> res0 <- mget(head(multimapped), IlluminaHumanMethylation450kGO, ifnotfound=NA) >> res <- mget(head(multimapped), IlluminaHumanMethylation450kGOall, ifnotfound=NA) >> >> ## fetch the GOIDs from the unencumbered toggled map, to get terms for them >> >> library(GO.db) >> GOids <- lapply(res, function(x) unlist(lapply(x, function(y) y$GOID))) >> GOterms <- lapply(GOids, function(x) mget(x, GOTERM, ifnotfound=NA)) >> head(GOterms) >> >> >>> I'll add this to the docs (next release) >>> >>> thanks, >>> >>> --t >>> >>> >>> >>> On Fri, Mar 29, 2013 at 11:24 AM, Fabrice Tourre <fabrice.ciup at="" gmail.com=""> wrote: >>>> Tim, >>>> >>>> Thank you very much for your reply. >>>> I have a list of probe list. >>>> Do you a example script for me to get the GO terms, instead of GO ID? >>>> >>>> The Documentation is not very clear for this. >>>> http://www.bioconductor.org/packages/2.11/data/annotation/html/Il luminaHumanMethylation450k.db.html >>>> >>>> On Fri, Mar 29, 2013 at 12:29 PM, Tim Triche, Jr. <tim.triche at="" gmail.com=""> wrote: >>>>> Oddly enough, the paper from UCSD with Illumina's folks on it (*) used the >>>>> IlluminaHumanMethylation450k.db package (which I am currently rebuilding to >>>>> have a startup message about toggleProbes()) to annotate both CpG islands >>>>> and GO terms. >>>>> >>>>> (*) >>>>> http://idekerlab.ucsd.edu/publications/Documents/Hannum_MolCell_ 2012.pdf >>>>> >>>>> >>>>> On Fri, Mar 29, 2013 at 8:49 AM, Fabrice Tourre <fabrice.ciup at="" gmail.com=""> >>>>> wrote: >>>>>> >>>>>> Dear list, >>>>>> >>>>>> In the annotation file of Infinium HumanMethylation450 BeadChip, >>>>>> >>>>>> http://support.illumina.com/documents/MyIllumina/b78d361a-def5 -4adb-ab38-e8990625f053/HumanMethylation450_15017482_v.1.2.csv >>>>>> >>>>>> for each probe set, they do not have annotation for GO terms, pathways. >>>>>> >>>>>> As they have done in the annotation file: HG- U133_Plus_2.na32.annot.csv. >>>>>> >>>>>> Is there some bioconductor package to annotated the Infinium >>>>>> HumanMethylation450 probes? Given a probe, feed back the GO terms and >>>>>> pathways. >>>>>> >>>>>> Thank you very much in advance. >>>>>> >>>>>> _______________________________________________ >>>>>> Bioconductor mailing list >>>>>> Bioconductor at r-project.org >>>>>> https://stat.ethz.ch/mailman/listinfo/bioconductor >>>>>> Search the archives: >>>>>> http://news.gmane.org/gmane.science.biology.informatics.conductor >>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> A model is a lie that helps you see the truth. >>>>> >>>>> Howard Skipper >>> >>> >>> >>> -- >>> A model is a lie that helps you see the truth. >>> >>> Howard Skipper >> >> [[alternative HTML version deleted]] >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor at r-project.org >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor > > > > -- > Best wishes, > > Jinyan HUANG
ADD REPLY

Login before adding your answer.

Traffic: 326 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6