Hi all,
I'm trying to perform a technical validation of MBD enrichment.
I sequenced a single library (150PE - 1.5 Million reads) as a preliminary test and now I'd like to check if the sequenced reads are on (or close to) CpG islands (i.e. coverage). The target species is Daphnia magna, so the genome is availble, but I guess it is necessary, at first, to "find the CpG islands", and then using sofwares for the coverage assessment. Looking at the most popular software perfroming the methylation analysis (MEDIPS, RaMWAS, QSEA), it seems they do not provide tools for this preliminary assessment. Isn't it?
If someone could give some suggestions, I would be really grateful.
Thank you.
Best
Marianna
If you have a
BSgenome
object then reading your data into MEDIPS/QSEA (although I would recommend QSEA) would let you look where the reads fall in relation to the calculated window based CpG density, which should show you a significant skew towards the windows with a higher CG content. But I can't see aBSgenome
object for your species.Another metric is the relative enrichment of the reads compared to the average for your genome, although again this requires a
BSgenome
in MEDIPS/QSEA. I am working on a package to expand QSEA at the moment. I haven't used RaMWAS so can't comment on that.