Calculations (averaging and normalization) with expressionset
Entering edit mode
VanessaOak • 0
Last seen 9 months ago

I have an expressionset with timepoints on one axis and different samples on the other axis.

I need to average the data for different samples which are currently defined as different "rowNames", the metadata for those samples is defined by varLabels: Cell.Type Cell.Number Compound.Name. I would like to average all the samples with the same Compound name at each time point, this would essentially create a new expression set with fewer samples.

What would be the best way to approach this?

Here is my expression set info:

ExpressionSet (storageMode: lockedEnvironment)
assayData: 2384 features, 39 samples 
  element names: exprs 
protocolData: none
  rowNames: A1 A2 ... NA.3 (39 total)
  varLabels: Cell.Type Cell.Number Compound.Name
  varMetadata: X.labelDescription...c..Cell.Type....Cell.Number....Compound.Name... labelDescription
  featureNames: 0 0.006944444 ... 72.5775 (2384 total)
  fvarLabels: timepoints
  fvarMetadata: labelDescription
experimentData: use 'experimentData(object)'
sessionInfo( )
R version 4.0.2 (2020-06-22)
Platform: x86_64-apple-darwin17.0 (64-bit)
Running under: macOS Catalina 10.15.4

Matrix products: default
BLAS:   /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/4.0/Resources/lib/libRlapack.dylib

[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] parallel  stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] readxl_1.3.1        forcats_0.5.0       stringr_1.4.0       dplyr_1.0.2         purrr_0.3.4        
 [6] readr_1.4.0         tidyr_1.1.2         tibble_3.0.4        ggplot2_3.3.2       tidyverse_1.3.0    
[11] RTCA_1.42.0         gtools_3.8.2        RColorBrewer_1.1-2  Biobase_2.50.0      BiocGenerics_0.36.0

loaded via a namespace (and not attached):
 [1] Rcpp_1.0.5        cellranger_1.1.0  pillar_1.4.7      compiler_4.0.2    dbplyr_2.0.0      tools_4.0.2      
 [7] lubridate_1.7.9.2 jsonlite_1.7.1    lifecycle_0.2.0   gtable_0.3.0      pkgconfig_2.0.3   rlang_0.4.9      
[13] reprex_0.3.0      cli_2.2.0         DBI_1.1.0         rstudioapi_0.13   haven_2.3.1       withr_2.3.0      
[19] xml2_1.3.2        httr_1.4.2        fs_1.5.0          generics_0.1.0    vctrs_0.3.5       hms_0.5.3        
[25] grid_4.0.2        tidyselect_1.1.0  glue_1.4.2        R6_2.5.0          fansi_0.4.1       modelr_0.1.8     
[31] magrittr_2.0.1    backports_1.2.0   scales_1.1.1      ellipsis_0.3.1    rvest_0.3.6       assertthat_0.2.1 
[37] colorspace_2.0-0  utf8_1.1.4        stringi_1.5.3     munsell_0.5.0     broom_0.7.2       crayon_1.3.4
RTCA Biobase Bioconductor expressionset • 188 views
Entering edit mode
Last seen 8 hours ago
United States

The easiest way to do that would be to use the avereps function from the limma package.

Login before adding your answer.

Traffic: 371 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6