Use of pData in assignments
1
0
Entering edit mode
@edwardsstephenepamailepagov-2074
Last seen 10.2 years ago
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/bioconductor/attachments/20070501/ a413d27c/attachment.pl
• 362 views
ADD COMMENT
0
Entering edit mode
@martin-morgan-1513
Last seen 4 months ago
United States
Thanks Stephen for the clear and reproducible example! The problem is that columns of pData need to have associated descriptions in varMetadata, so after your change > validObject(expressionSet, complete=TRUE) Error in validObject(expressionSet, complete = TRUE) : invalid class "ExpressionSet" object: In slot "phenoData" of class "AnnotatedDataFrame": All AnnotatedDataFrame pData column names must be present as rows in varMetadata, and vice versa A safer way to do what you want is, for instance, > expressionSet[["newCol"]] <- sub("rol", "", expressionSet[["type"]]) > phenoData(expressionSet) sampleNames: A, B, ..., Z (26 total) varLabels and varMetadata: sex: Female/Male type: Case/Control score: Testing Score newCol: NA > validObject(expressionSet, complete=TRUE) [1] TRUE (you could also use '$' on expressionSet) To actually provide meaning metadata requires an additional step > varMetadata(expressionSet)["newCol", "labelDescription"] <- + "My new column" though we'd like to make that simpler... It might still be useful in some circumstances to assign using pData, e.g., when not creating a new column, and doing several manipulations in sequence, a strategy might be to extract pData, manipulate the date frame, and reassign. Hope that helps, Martin Edwards.Stephen at epamail.epa.gov writes: > When running the following code to update the phenoData for my eset, the > resulting eset object works fine for downstream analysis. However, when I > try to subset the eset, I get the error noted just above the sessionInfo. > Am I doing something wrong in the assignment, or should I not assign using > the pData method? This usage is still documented, so I assumed it was > safe. I know I'm getting some warning messages with the expressionSet in > the example, but I get the same error when trying to subset a real dataset > created using ReadAffy() and rma() without the corresponding warnings. > >> data(sample.exprSet) >> expressionSet <- as(sample.exprSet,"ExpressionSet") > Warning messages: > 1: The phenoData class is deprecated, use AnnotatedDataFrame (with > ExpressionSet) instead > 2: The phenoData class is deprecated, use AnnotatedDataFrame (with > ExpressionSet) instead >> isCurrent(expressionSet) > R Biobase eSet ExpressionSet > TRUE TRUE TRUE TRUE >> >> expressionSet[1:10,c(2,4,10)] > ExpressionSet (storageMode: lockedEnvironment) > assayData: 10 features, 3 samples > element names: exprs, se.exprs > phenoData > rowNames: B, D, J > varLabels and varMetadata: > sex: Female/Male > type: Case/Control > score: Testing Score > featureData > rowNames: AFFX-MurIL2_at, AFFX-MurIL10_at, ..., AFFX-BioDn-5_at (10 > total) > varLabels and varMetadata: none > experimentData: use 'experimentData(object)' > Annotation [1] "hgu95av2" >> >> p <- cbind (pData(expressionSet), sub("rol", "", > pData(expressionSet)$type)) >> names(p)[4] <- "type2" >> pData(expressionSet) <- p >> expressionSet[1:10,c(2,4,10)] > Error in `row.names<-.data.frame`(`*tmp*`, value = c("sex", "type", > "score", : > invalid 'row.names' length >> >> sessionInfo() > R version 2.5.0 (2007-04-23) > i386-pc-mingw32 > > locale: > LC_COLLATE=English_United States.1252;LC_CTYPE=English_United > States.1252;LC_MONETARY=English_United > States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252 > > attached base packages: > [1] "tools" "stats" "graphics" "grDevices" "utils" "datasets" > > [7] "methods" "base" > > other attached packages: > affy affyio Biobase limma > "1.14.0" "1.4.0" "1.14.0" "2.10.0" >> > > ---------------------------------------- > Stephen W. Edwards, Ph.D. > Systems Biologist, ADHIO, NHEERL, ORD, USEPA > U.S. Environmental Protection Agency (B305-01) > 109 TW Alexander Drive > Research Triangle Park, NC 27711 > Ph#: 919/541-0514 FAX#: 919/685-3221 > > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor -- Martin Morgan Bioconductor / Computational Biology http://bioconductor.org
ADD COMMENT

Login before adding your answer.

Traffic: 564 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6