Question: [DESeq2] sequin [ANAQUIN] correction, too many factors in model
0
27 days ago by
chaitra.sathyaprakash0 wrote:

Hello,

I have an RNA seq dataset obtained from mRNA ribosomal pulldown (TRAP) from a mutated and control cell line. I also have input samples for each of these. Due to the nature of TRAP, these samples are strongly enriched in mRNA, and the inputs are similar to bulk RNA sequencing, with a more complex library. Each sample was spiked with sequins (http://sequins.xyz) from mixture A or B, which I want to use to normalise my data. I have additional potential sources of batch effect including lane and differentiation day. How should I decide what to include in my final DESeq GLM?

What is the best way of checking what the greatest batch effect on this kind of data is? So far I have not been able to obtain a nice correlation of the observed LFC with expected LFC for the sequins. I have tried only using the TRAP samples in the DESeq object in case the samples are too different to analyse together, but this does not fix the problem either.

Many thanks, Chaitra

deseq2 ruvseq anaquin sequins • 41 views
modified 27 days ago by Michael Love24k • written 27 days ago by chaitra.sathyaprakash0
Answer: [DESeq2] sequin [ANAQUIN] correction, too many factors in model
0
27 days ago by
Michael Love24k
United States
Michael Love24k wrote:

Going in depth into how to normalize here is beyond the software support I can provide at the moment. If you have spike ins, you can use these to normalize with controlGenes described in ?estimateSizeFactors. And I like to use MultiQC and PCA plots (see vignette and workflow) to examine batch effects.