I'm interested in normalizing data from a metagenomic experiment. It's similar to a transcriptomic experiment in that I have counts for several thousand genes across several hundred samples. Two key differences that I'm wary of are (a) this data is much sparser than you would encounter in a transcriptomic experiment and (b) much greater inter sample variability than you would expect in a transcriptomic experiment
Would the mean ratio method (i.e. as called by estimateSizeFactors()) be appropriate for normalization in this case? Are there known core housekeeping genes in a metagenome which can be used as controls for size factor estimation?