Hi recently there has been some publication on the importance of GC and length bias ( Mandelboum et al, 2019, PLOS) . I'm looking into how to do this and came across packages like EDAseq. So it looks pretty straightforward with something like this.
dataOffset <- withinLaneNormalization(data,"gc",
which="full",offset=TRUE)
this provide two slots, one for the normalized counts and the other for the offsets. I'm wondering if I can then use the dataOffset normalize count, say normalized_count as input for TMM normalization follow with voom. Would this work, something like.
y.df <- calcNormFactors( y.normalized_count , method = "TMM" )
voom <- voom(y.ydf )
and then just do the limma DGE?
thanks!
Hey Gordon thanks for your looking over this. One of my main concern is indeed because I use a preranked GSEA approach but in respect to your comment I will take a look at the limma equivalent.
Sorry for the confusion, I pulled out the example directly from the vignette, what I would probably do ( if I do decide to do it ) is normalized for both 'gc' and 'length' bias. As far as input to EdgeR my reading was that it can only be used as an offset so I was not sure if it can be inputed into
calcNormFactors
for TMM.Offsets can't be input to
calcNormFactors
but they would make almost no difference to TMM anyway so IMO ignoring them at the TMM step should be ok.Thanks again @Gordon. Does the limma romer function also protect against length bias? I tried reading the help and I don't understand what the correlation parameter is for?
Or In this case should I just stick with camera? thanks.
Yes,
romer
accounts for inter-gene correlation but I usecamera
myself.camera
will give strongest protection if you setinter.gene.cor=NA
but is then quite conservative.thanks again. I like camera as well since it was able to take the weighted precision from VOOM!
Just in case someone is interested. Here is a link to the primary camera article by Di Wu and Gordon Smyth: https://academic.oup.com/nar/article/40/17/e133/2411151
and a good summary of different pathway tools from limma: http://web.mit.edu/~r/current/arch/i386_linux26/lib/R/library/limma/html/10GeneSetTests.html