I compose big SingleCellExperiments and SummarizedExperiments, cbinding up to 1000s HDF5s. Would it be possible to speed up the saveHDF5SummarizedExperiment by many folds, perhaps parallelising it?
e.g. for this operation
Start writing assay 1/2 to HDF5 file:
/vast/projects/cellxgene_curated/cellNexus/pseudobulk_joined/assays.h5
/ reading and realizing sparse block 1/744 ... ok
\ Writing it ... OK
/ reading and realizing sparse block 2/744 ...
Thanks.
Thansk Herve', I will try.
For reference this is the parallelization implementation of zellkonverter, to save anndata from SCE
https://github.com/theislab/zellkonverter/issues/129#issuecomment-2473607227
saveTileDBSummarizedExperiment would be awesome. I think we need performance for execution time and memory required to save.