Has anyone done an enrichment analysis for non-model plant species such as Wheat (Triticum aestivum). As a first step, I generated a gene-GO-GOdescription mapping as fetched from Ensembl Plants, which looks something like this:
GeneID GO-ID GO-Description TraesCS3A02G154800 GO:0047617 acyl-CoA hydrolase activity TraesCS2B02G516800 GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds TraesCS2B02G516800 GO:0005975 carbohydrate metabolic process TraesCS2B02G516800 GO:0016787 hydrolase activity TraesCS2B02G516800 GO:0016798 hydrolase activity, acting on glycosyl bonds TraesCS2B02G516800 GO:0008152 metabolic process
I'm trying to generate an MSigDB type gmt file from this table, which can then be used as input for the clusterProfiler function enricher.
I get duplicates found error which is why I can't proceed any further with an enrichment analysis.
Any body has any nifty solution, please do share.