Question

GLMs in DESeq (convergence problems)

0

Entering edit mode

Gordon Smyth 50k

@gordon-smyth

Last seen 3 hours ago

WEHI, Melbourne, Australia

Hi Seanna, As Simon says, just increasing the number of iterations may well not solve your problem. DESeq just makes transcript-wise calls to the glm() function, and the glm() function doesn't always converge for RNA-Seq data, as was pointed out in a number of posts on the Bioconductor mailing list late last year. A less important annoyance is that transcript-wise calls to glm() are slow. The more complete solution is to implement new glm code that modifies the glm algorithm to prevent divergence. This has been done in the glm functions in the edgeR package. The edgeR glm code is purpose designed for RNA-Seq data, and has the added advantage of being very much faster. It is still possible to have convergence problems even with the new edgeR code, but we've found that they are dramatically less common. Best wishes Gordon -------------------------------------------- Professor Gordon K Smyth, NHMRC Senior Research Fellow, Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Vic 3052, Australia. smyth at wehi.edu.au http://www.wehi.edu.au http://www.statsci.org/smyth > Date: Wed, 09 Mar 2011 13:39:14 +0100 > From: Simon Anders <anders at="" embl.de=""> > To: Seanna McTaggart <smctagga at="" staffmail.ed.ac.uk="">, > "bioconductor at r-project.org" <bioconductor at="" r-project.org=""> > Subject: Re: [BioC] GLMs in DESeq > > Hi Seanna > > On 03/08/2011 05:13 PM, Seanna McTaggart wrote: >> I am using DESeq to determine DE of an RNA-Seq project that has >> multiple genotypes exposed to different treatments. I would like to >> use the GLM functionality to partition the variance in the count data >> between genotype and treatment. However, when I follow the >> suggestions of Simon (Bioconductor Digest, Vol 96, Issue 9), neither >> model (fit0 or fit1) is reaching convergence, and I was wondering if >> it was possible for me to increase the number of iterations to see if >> this would help out. Otherwise I would appreciate any advice on how >> to proceed. > > I've just added a new argument to the function 'nbinomTestGLM', called > 'glmControl', which is a list of GLM control parameters as described in > the 'glm.control' help page. > > So, you could try something like > > fit0 <- nbinomTestGLM( cds, count ~ whatever, > glmControl = list( maxit=75 ) ) > > This should increase the maximum number of iterations from 25 to 75. > > However, it is well possible that this does not help much. If your model > fails to converge for any gene, it might be a deeper problem, and if it > fails to converge only for a few genes, these might be somehow > "stubborn" and won't converge at all, no matter how many iterations you > wait. If necessary, ask again with more details. > > Simon ______________________________________________________________________ The information in this email is confidential and intend...{{dropped:4}}

edgeR DESeq edgeR DESeq • 838 views

ADD COMMENT • link 13.1 years ago Gordon Smyth 50k