Question

Convert Expression Set into a Matrix or Data Frame

0

Entering edit mode

hshabbeer.09 • 0

@hshabbeer09-9583

Last seen 4.5 years ago

Hi all,

I have an expression set which contains the following:

ExpressionSet
## assayData: 15905 features, 276 samples
## element names: exprs
## protocolData: none
## phenoData
## sampleNames: MNC5 MNC6 ... P0892 (276 total)
## varLabels: HIPO-ID PID ... IC50batch (114 total)
## varMetadata: labelDescription
## featureData
## featureNames: D_001_1 D_001_2 ... ENSG00000273488 (15905 total)
## fvarLabels: name type id subtype
## fvarMetadata: labelDescription

Now i know how to access various elements i.e, RNA seq or DNA methylation data but its complicated and being a new user it becomes a tad bit difficult to use for further analysis etc. To make it easier i would like to convert it in a matrix or a data frame which can contain all the relevant data. For example, rows would be patient ID and columns can be other features like drugs and their dosages or methylation probe results or RNA seq results etc.

I know exprs gives the main data i.e, some survival data in this case, and pData or fData for phenotypic or other features. But what i want to know is whether there is a method i can combine all of them into one simple matrix or data frame? That would make analysis easier. So, is that possible?

Thanks,

Shabbeer

microarray biobase affy R • 7.9k views

ADD COMMENT • link updated 8.3 years ago by Diego Diez ▴ 760 • written 8.3 years ago by hshabbeer.09 • 0

0

Entering edit mode

I don't fully understand your problem, but you can convert your expression set to a data frame using the function:

as.data.frame("your expression set")--where the rows then would be the samples, and in the columns there would be all your features plus your phenoData variables.

Hope that helps,

Efstathios

ADD REPLY • link 8.3 years ago svlachavas ▴ 830

score 2 · Accepted Answer · 2016-01-26

A possibility is as below. Note that this will create a data.frame with a lot of columns (one for each probe + the additional ones for each column in the phenoData object)

eset # ExpressionSet
m <- exprs(eset) # matrix of intensities
pdata <- pData(eset) # data.frame of phenotypic information.

d <- cbind(pdata, t(m)) # data.frame with row being samples and columns variables (including phenotypic and intensity info for each probe)

Example:

> data(sample.ExpressionSet)
> sample.ExpressionSet
ExpressionSet (storageMode: lockedEnvironment)
assayData: 500 features, 26 samples 
  element names: exprs, se.exprs 
protocolData: none
phenoData
  sampleNames: A B ... Z (26 total)
  varLabels: sex type score
  varMetadata: labelDescription
featureData: none
experimentData: use 'experimentData(object)'
Annotation: hgu95av2 

> m <- exprs(sample.ExpressionSet)
> pdata <- pData(sample.ExpressionSet)
> d <- cbind(pdata, t(m))
> d[1:5,1:5]
     sex    type score AFFX-MurIL2_at AFFX-MurIL10_at
A Female Control  0.75       192.7420         97.1370
B   Male    Case  0.40        85.7533        126.1960
C   Male Control  0.73       176.7570         77.9216
D   Male    Case  0.42       135.5750         93.3713
E Female    Case  0.93        64.4939         24.3986