[DESeq2] sequin [ANAQUIN] correction, too many factors in model
Entering edit mode
Last seen 10 months ago


I have an RNA seq dataset obtained from mRNA ribosomal pulldown (TRAP) from a mutated and control cell line. I also have input samples for each of these. Due to the nature of TRAP, these samples are strongly enriched in mRNA, and the inputs are similar to bulk RNA sequencing, with a more complex library. Each sample was spiked with sequins (http://sequins.xyz) from mixture A or B, which I want to use to normalise my data. I have additional potential sources of batch effect including lane and differentiation day. How should I decide what to include in my final DESeq GLM?

What is the best way of checking what the greatest batch effect on this kind of data is? So far I have not been able to obtain a nice correlation of the observed LFC with expected LFC for the sequins. I have tried only using the TRAP samples in the DESeq object in case the samples are too different to analyse together, but this does not fix the problem either.

Many thanks, Chaitra

deseq2 ruvseq anaquin sequins • 252 views
Entering edit mode
Last seen 7 hours ago
United States

Going in depth into how to normalize here is beyond the software support I can provide at the moment. If you have spike ins, you can use these to normalize with controlGenes described in ?estimateSizeFactors. And I like to use MultiQC and PCA plots (see vignette and workflow) to examine batch effects.


Login before adding your answer.

Traffic: 391 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6