Question

KEGG pathway analysis on a predefined set of genes (with my own annotation)?

0

Entering edit mode

miyakokodama • 0

@miyakokodama-11409

Last seen 2.9 years ago

Denmark

I have a predefined set of interesting genes, along with their corresponding GO IDs and KEGG terms. According to the tutorial, I used TopGO and obtained significantly enriched GO terms using the Fisher's exact tests.

I am hoping to do something similar to perform KEGG pathway analysis, however I haven't been able to figure out how to do that. Does kegga() function in the limma package deal with a list of predefined genes with my own annotation?

Many thanks in advance,

Miyako

Note: KEGG mapper online to find genes involved in particular pathways, but am hoping to get a p-value for each KEGG term.

limma kegga • 1.7k views

ADD COMMENT • link updated 7.5 years ago by Gordon Smyth 50k • written 7.5 years ago by miyakokodama • 0

score 1 · Answer 1 · 2016-10-19

1

Entering edit mode

Aaron Lun ★ 28k

@alun

Last seen 11 hours ago

The city by the bay

Examining ?kegga may reveal some clues as to the expected input:

      de: a character vector of Entrez Gene IDs, or a list of such
          vectors, or an ‘MArrayLM’ fit object.

ADD COMMENT • link 7.5 years ago Aaron Lun ★ 28k

score 0 · Answer 2 · 2016-10-19

It's pretty straightforward. For example:

> GeneID
 [1] "8985" "7423" "8200" "5439" "3710" "1364" "1386" "8323" "3952" "5879"
> library(limma)
> ke <- kegga(GeneID)
> topKEGG(ke)
                                                           Pathway   N DE    P.DE
path:hsa05205                              Proteoglycans in cancer 205  3 0.00242
path:hsa04918                            Thyroid hormone synthesis  74  2 0.00453
path:hsa04060               Cytokine-cytokine receptor interaction 265  3 0.00502
path:hsa04925                  Aldosterone synthesis and secretion  82  2 0.00554
path:hsa04911                                    Insulin secretion  85  2 0.00594
path:hsa04972                                 Pancreatic secretion  96  2 0.00753
path:hsa04915                           Estrogen signaling pathway 100  2 0.00814
path:hsa04933 AGE-RAGE signaling pathway in diabetic complications 101  2 0.00830
path:hsa04922                           Glucagon signaling pathway 103  2 0.00862
path:hsa04151                           PI3K-Akt signaling pathway 341  3 0.01012
path:hsa04670                 Leukocyte transendothelial migration 118  2 0.01121
path:hsa04728                                 Dopaminergic synapse 130  2 0.01349
path:hsa05200                                   Pathways in cancer 397  3 0.01533
path:hsa04310                                Wnt signaling pathway 143  2 0.01618
path:hsa04932            Non-alcoholic fatty liver disease (NAFLD) 151  2 0.01793
path:hsa04390                              Hippo signaling pathway 154  2 0.01861
path:hsa04022                           cGMP-PKG signaling pathway 168  2 0.02193
path:hsa04510                                       Focal adhesion 203  2 0.03123
path:hsa05169                         Epstein-Barr virus infection 204  2 0.03151
path:hsa05203                                 Viral carcinogenesis 205  2 0.03180

I do wonder however what you mean by "with my own annotation"? There is no way to use your own association of KEGG terms with genes unless you do this for all genes, not just for your set of interesting genes.