I was following a protocol of RUVseq for a method RUVg. After performing a first pass of edger differential analysis to identify the most non-differential expressed genes I took a look on my table top and found out that I had only 7 genes with FDR < 0.9 and all others genes have an FDR of >0.999. The concept of RUVg is to take the most undifferentially expressed genes to find the factors of unwanted variants but if I have only 7 genes with <0.9, doesn't it mean already that RUVg will not help me to account for the batch effect?
First pass of edgeR:
design <- model.matrix( ̃x, data=pData(set)) y <- DGEList(counts=counts(set), group=x) y <- calcNormFactors(y, method="upperquartile") y <- estimateGLMCommonDisp(y, design) y <- estimateGLMTagwiseDisp(y, design) fit <- glmFit(y, design) lrt <- glmLRT(fit, coef=2) top <- topTags(lrt, n=nrow(set))$table