Data formats
1
0
Entering edit mode
@culhane-aedin-307
Last seen 11.3 years ago
Is there a list of the data formats accepted by bioconductor modules. I have checked the FAQ and can't find this information. I know .cel, .gpr (genepix) are accepted. What about other formats? Also if I wish to re-analyse published data, the raw data files are frequently not available, if I use read.table(filename, row.names =1, header =TRUE), what bioconductor modules are/are not available. Thanks for your help, Aedin
• 855 views
ADD COMMENT
0
Entering edit mode
@vincent-j-carey-jr-4
Last seen 3 months ago
United States
> Is there a list of the data formats accepted by bioconductor modules. I have > checked the FAQ and can't find this information. I know .cel, .gpr (genepix) > are accepted. What about other formats? > > Also if I wish to re-analyse published data, the raw data files are > frequently not available, if I use read.table(filename, row.names =1, header > =TRUE), what bioconductor modules are/are not available. read.marrayRaw indicates capacity to handle .xls, .spot, .gpr ... (in package marrayInput) but your second question is more telling. there is no a priori restriction on module use based on data source. procedures that are based on the exprSet class can be used provided you populate an exprSet with the information from your raw data file. typically this involves setting the exprs slot to hold the matrix of expression values (rows are genes, columns are samples), and setting the phenoData slot to hold information about the samples/design (information about the columns of the expression matrix). edd is an example of a module that requires an exprSet format. some of the affy procedures work from exprSets. genefilter and geneplotter do not require an exprSet. you can use tools in genefilter on a matrix of expression values. the annotation modules are usable in many different contexts. perhaps a good way to think about this is: if Bioconductor does not explicitly provide a way to handle a certain data resource, you have the full power of R and omegahat www.omegahat.org, which deals with many intersystem interfaces) to transform that data resource into something immediately amenable to analysis with tools in Bioconductor. if there's a specific data resource you are having trouble with, give us the details and a path may be suggested or coded.
ADD COMMENT

Login before adding your answer.

Traffic: 781 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6