Question

Genelist input for ClusterProfiler

0

Entering edit mode

t.b.kuipers • 0

@tbkuipers-23409

Last seen 3.1 years ago

Leiden

Hi everyone,

I have a question about an R-package: “ClusterProfiler”. There are two methods I’m using:

I can’t find an answer to what the exact input for these methods should be:

For gseKEGG, I need a genelist with FC values, but does this genelist contain only DE genes? Or does this list contain all genes, DE and not DE (after filtering low expressed genes offcourse)?
For enrichKEGG, I believe I only need the gene ID’s of DE genes, right?

I hope someone can help me out!

Tom

R ClusterProfiler Enrichment Pathway • 6.5k views

ADD COMMENT • link updated 20 months ago by Guido Hooiveld ★ 4.1k • written 4.8 years ago by t.b.kuipers • 0

score 3 · Accepted Answer · 2020-04-25

3

Entering edit mode

Kevin Blighe ★ 4.0k

@kevin

Last seen 8 weeks ago

Republic of Ireland

Hey Tom,

In general, you can regard the following as being true:

for gseKEGG(), the input can be a named vector of fold changes, and these can be either statistically significant or non-statistically significant genes, or both. Those that are not statistically significant will almost certainly have lower fold changes anyway, and this will be taken into account [via ranking] when performing the enrichment.
for enrichKEGG() / enrichGO(), yes, these just take a vector of gene names; therefore, the assumption would be that these are already genes of particular interest, i.e., genes that you have found as statistically significantly differentially expressed in your study.

There are a few examples in the Vignete

Kevin

ADD COMMENT • link 4.8 years ago Kevin Blighe ★ 4.0k

0

Entering edit mode

Hi Kevin, the input of gseKEGG is the same input of gseGO?

I appreciate any help

ADD REPLY • link 20 months ago Sofia • 0

0

Entering edit mode

Yes, these are the same. See ?gseGO.

ADD REPLY • link 20 months ago Guido Hooiveld ★ 4.1k