edgeR - Correct approach to compare abundances of genomic regions?
2
0
Entering edit mode
@hollandademello-22618
Last seen 4.6 years ago

Hello all,

This is mostly a conceptual question. I have the aim of testing the hypothesis that genomic windows within an organisms genome might have higher read mapping abundances than the same region for a different organism. I am wondering if edgeR, or any other differential expression software really, would be applicable for testing this hypothesis. I understand that read mapping and differential gene and transcript expression have different - and likely harder - challenges than mapping to genic regions, and this leads me to wonder if the approaches used by edgeR, such as the TMM normalization and the shrinkage of dipersion using an empirical Bayes approach, are adequate for genomic data. Thank you for your time.

Best,

Pietro

edger differential expression • 857 views
ADD COMMENT
2
Entering edit mode
Aaron Lun ★ 28k
@alun
Last seen 5 hours ago
The city by the bay

I imagine that csaw is close to what you want:

The real challenge in your case is finding which regions in one species are homologous to regions in another species, and applying appropriate normalization for uninteresting biases in mappability, sequenceability, etc. This is possibly a rare situation where you could use input controls in an interaction model to cancel out those biases.

ADD COMMENT
0
Entering edit mode
@gordon-smyth
Last seen 3 hours ago
WEHI, Melbourne, Australia

edgeR is used all the time for analysing DNA read counts from genomic windows using (for example) reads from ChIP-seq, ATAC-seq, BS-seq or HI-C. Genomic data causes no problems, in fact it is generally simpler than RNA-seq.

If your genomic windows are preset (for example promoter regions) then you can use edgeR directly. If you want to merge adjacent windows into larger DE regions while maintaining FDR control, then csaw is specifically designed for that purpose.

As already mentioned by Aaron, the more difficult issue is that you seem to be comparing different species with different genomes.

ADD COMMENT

Login before adding your answer.

Traffic: 802 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6