When featureCounts is used to count miRNAs present in genome aligned bam files, it gives miRNA size outputs, as well as long fragments (ie 8kb). I think this is happening because overlapping miRNAs exist, and thus featureCounts cannot distinguish where an individual miRNAs starts and ends in this situation (when there is an overlap of miRNAs in the same region, as confirmed by IGV visualization). In our protocol, we align the miRNAseq trimmed data to the genome, and we then use the same genome and its gtf format for miRNA counts. We do not use known miRNAs because we are trying to find novel miRNAs. We cannot use mirdeep2 for novel miRNAs because there are very few known miRNAs (and for some genomes there are 0 known miRNAs).
Thus I have the following questions: a. How can we change featureCounts settings to make this tool count individual miRNAs, rather than a cluster of overlapping miRNAs and label them as one fragment? b. If this is not possible, is there another tool that can be used in order to quantitate and discover all miRNAs (including novel miRNAs) from our genome aligned bam files? c. is it possible to tweak featureCounts to achieve this goal?
Note to other Bioc Support moderators: although this question relates to Galaxy rather than to Bioconductor software, the Subread-featureCounts authors have asked users to send questions to this forum (https://subread.sourceforge.net/), so I think it's a legitimate question here.
I added a featureCounts tag and removed metaMSdata as this question does not relate to the metaMS package.
Thank you for doing this. I could not add the featureCounts tab or any other related tab myself. The metaMSdata tab was one of the only tabs I could originally apply for some reason.