To Whom It May Concern,
I have a question regarding the script 'dexseq_count.py' from the DEXSeq package. My sequences were generated from a paired end sequencing project. My SAM files are the result of a merger of three sorted alignment files per sample. There are three files for each sample that are produced after adapter/quality trimming: one large read list of sequences with the interweaved R1 and R2 partners as well as one read list for sequences with only the R1 remaining and one read list for sequences with only the R2 remaining. Using the aligner bowtie2 separately on all three read lists, the merged file utilizes the most information from my samples with both the preserved paired reads as well as orphaned paired read alignments.
Will this cause any issues when using the count script with the argument “-p yes”? Will the program count all the reads equally or account for the orphans from paired in an appropriate way? Otherwise, would you recommend discarding the orphaned reads from the analysis (they are usually less than 3% of total reads for each sample).
Thank you in advance, arom
Thank you, that is exactly what I was worried about.