in my latest analysis with the DESeq2 package, I noticed that I had a few paralog genes (~30) which shared the same statistics and ensembl ID. I can use only the unique genes, but I was wondering whether it would be more appropriate to remove the duplicated genes before the statistical analysis since they contain redundant information which worsens my statistics. Would you recommend doing this or would you still include them in the analysis and maybe discard them later in downstream visualization? And if so, would you remove them before running results() or DESeq()? Hope this makes sense.