Question

How many of my genes from my gene list are in each KEGG pathway?

0

Entering edit mode

julia_mcdonough • 0

@41ccc2f8

Last seen 27 days ago

United States

Hi! I have a gene list, ranked by log2FoldChange for the input of gseKEGG to return a kegg object or data frame that has info on enriched pathways. However, I would like to know how many of my genes from my gene list are in each enriched pathway. How can I do this? Thank you!

Pathways clusterProfiler gseKEGG KEGG • 2.8k views

ADD COMMENT • link written 22 months ago by julia_mcdonough • 0

score 2 · Answer 1 · 2024-04-12

2

Entering edit mode

Guido Hooiveld ★ 4.1k

@guido-hooiveld-2020

Last seen 10 days ago

Wageningen University, Wageningen, the …

You can compare the number in the column setSize versus the number of genes in core_enrichment. The former corresponds to the number of genes in a gene set (pathway), and the latter to the number of core enrichment genes a.k.a leading edge genes. These are the genes that contribute to the enrichment of the gene set. "The leading edge subset of a gene set is the subset of members that contribute most to the ES. For a positive ES (such as the one shown here), the leading edge subset is the set of members that appear in the ranked list prior to the peak score. For a negative ES, it is the set of members that appear subsequent to the peak score". (from: https://www.gsea-msigdb.org/gsea/doc/GSEAUserGuideFrame.html)

Also this info may be helpful: https://yulab-smu.top/biomedical-knowledge-mining-book/faq.html#how-to-extract-genes-of-a-specific-termpathway and https://github.com/YuLab-SMU/clusterProfiler/issues/103#issuecomment-338035194.

ADD COMMENT • link 22 months ago Guido Hooiveld ★ 4.1k

0

Entering edit mode

awesome thank you, that's really helpful!

ADD REPLY • link 22 months ago julia_mcdonough • 0

0

Entering edit mode

You can compare the number in the column setSize versus the number of genes in core_enrichment. The former corresponds to the number of genes in a gene set (pathway), and the latter to the number of core enrichment genes a.k.a leading edge genes. These are the genes that contribute to the enrichment of the gene set. "The leading edge subset of a gene set is the subset of members that contribute most to the ES. For a positive ES (such as the one shown here), the leading edge subset is the set of members that appear in the ranked list prior to the peak score. For a negative ES, it is the set of members that appear subsequent to the peak score". (from: https://www.gsea-msigdb.org/gsea/doc/GSEAUserGuideFrame.html) 101 games

Also this info may be helpful: https://yulab-smu.top/biomedical-knowledge-mining-book/faq.html#how-to-extract-genes-of-a-specific-termpathway and https://github.com/YuLab-SMU/clusterProfiler/issues/103#issuecomment-338035194.

Thanks for sharing!

ADD REPLY • link 15 months ago josephasmith1310 • 0