Hello everybody,
I have analysed an experiment of ribodepleted samples using both DESeq2 and edgeR robust. I read that one can expect a concordance of 70-80% between both tools. Here, this is not the case.
In this experiment, there are 3 conditions : cell lines of patient with one specific mutation (6 samples), cell lines of patient with another specific mutation (6 samples) and cell lines of controls (3 samples).
I attached the comparison of edgeR and DESeq2 for:
1/ cell lines of patient (12 samples) vs cell lines of controls (3 samples): http://i.imgur.com/gYWJ26b.png?1
2/ cell lines of patient with one specific mutation (6 samples) vs cell lines of patient with another specific mutation (6 samples): http://i.imgur.com/7JClh40.png?1
In both comparisons, I compare all the deregulated genes (top panel), genes with padj<=0.001 (middle panel) and genes with at least an average of 100 reads in one of the conditions (bottom panel). I wanted to see if deregulated genes found by only one tool were enriched in low significant or lowly expressed genes, which does not seem to be the case.
In the first comparison, DEseq2 finds more up-regulated genes, whereas edgeR finds more down-regulated genes.
In the second comparison, barely one third of the deregulated genes are found by both methods.
Any idea for these discrepancies? What do you think about these results?
I can provide the code I used for more details.
Thank you in advance for your help,
Jane
Put some code in your post to show how you ran each of the two analyses.
Sure, here is the DESeq2 code for the first analysis:
And the edgeR code for the first analysis:
Here is the DESeq2 code for analysis 2:
Sorry for the length...
Finally, the edgeR code for analysis 2: