ReactomePA / Reactome: GSEA error message
0
0
Entering edit mode
@martinbusch-15897
Last seen 3.9 years ago

Hi everyone,

I am sorry to ask another question, however, there is an error message that keeps me puzzled. When passing over a list of human entrez IDs to reactomePA for GSEA using

result <- gsePathway(anaData, nPerm=10000, pvalueCutoff=0.2, pAdjustMethod="BH", verbose=FALSE)

Rstudio becomes busy and cannot finish computation. When I manually stop it I get the error message:

Warning message:
In fgsea(pathways = geneSets, stats = geneList, nperm = nPerm, minSize = minGSSize,  :
  There are duplicate gene names, fgsea may produce unexpected results

How can I pass over parameters like maxSize=500 and which parameter can I use to avoid duplicate gene names, although the entrez IDs are unique? Seems like the mapping yields duplicate gene names?!

Thank you so much in advance for your help,

Martin

 

P.S: Input look like this

> head(anaData,10)
    1301     3371     4069    57537    11081     5764   114899     2331     1303     7060 
6.198340 4.505550 3.962765 3.753962 3.461323 3.148910 3.075820 3.034261 3.010098 2.880258 
> length(anaData)
[1] 11317
reactomepa reactome gsea fgsea • 1.8k views
ADD COMMENT
1
Entering edit mode

Could you also paste the resul of any(duplicated(names(anaData)))? This is what is checked at fgsea.

ADD REPLY
0
Entering edit mode

Thank you so much for your comment - in fact I was pretty suprised to see that the result was true - something that should not have happened. There was some mapping involved and it seems that multiple ensembl IDs can be mapped to one entrez ID. I thought that I had this sorted out. Anyways, not it works pretty fine! Thanks a lot!

ADD REPLY

Login before adding your answer.

Traffic: 854 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6