Question

WGCNA Hub Gene Selection Method

6

Entering edit mode

brohawndg ▴ 150

@brohawndg-7386

Last seen 8.7 years ago

United States

Hello,

I recently finished a WGCNA analysis with much success!

We found a module whose eigengene is highly associated with disease status and whose GO results indicate the module genes are associated with inflammatory processes that have been recurrently associated with the disease previously.

My question revolves around module hub gene selection methods. I know selecting a hub gene with a high gene significance value is important, but what about intramodular connectivity vs. kME values?

I have seen papers where folks select a hub gene based on it having a high intramodular k (connectivity) value within the module itself, and I have also seen people select a hub gene based on it having a high kME value (its expression values are highly correlated with that module's eigengene values across samples).

Does anyone understand the merits of using intramodular connectivity (k) vs. high modular membership (kME) values to pick a hub gene?

Discussion of these merits or pointing me to a paper would be most appreciated as I am having trouble finding evidence for one being a superior selection method over another.

Thanks!

Dave

WGCNA Hub Genes KmE Intramodular Connectivity • 14k views

ADD COMMENT • link updated 8.8 years ago by Peter Langfelder ★ 3.0k • written 8.8 years ago by brohawndg ▴ 150

score 4 · Answer 1 · 2015-07-13

Dave,

I'm glad you find WGCNA useful. I am not aware of any studies comparing the utility of kME to kIM. Most of the time these two are very strongly related in that the rankings they produce are very similar. We usually use kME because it is simpler to calculate, has an associated p-value, and kMEs can be compared across modules since they all lie between -1 and 1.

Whichever ranking you use, we recommend using the ranking only as a guide to prioritization for further follow-up because it is often the case that multiple genes have very similar, high values of kME and they could all be considered hubs. Other external information should be used as well (for example, a biologist can look at the list of top ranked genes and single out genes based on previous knowledge).

Peter

score 3 · Answer 2 · 2015-07-18

The R commands look good to me. There's also a function called signedKME which essentially does the same as your cor() call, but adds a bunch of checks for correct data formatting and for non-varying genes or genes with too many missing values.

You don't need to specify a signed network in these functions. kME is automatically signed while kIM is not.

Last comment regarding the "similar but not identical"... WGCNA produces a ranking of genes but the user needs to keep in mind that the ranking is always influenced by noise, experimental limitations, violations of assumptions about all aspects of data collection and processing, and other nightmares. The ranking should be taken as a rough suggestion; as such, very similar rankings should be considered equivalent. The top ranked genes should be further filtered using prior knowledge or practical considerations (some genes are easier to study than others); if you see an interesting gene, it doesn't matter whether it's no. 3 or no. 7.

score 0 · Answer 3 · 2015-07-13

Hello Dr. Langfelder,

Thanks for your response. We do have a number of interesting candidate genes with high kME values and high gene significance values within our module of interest. However, the ranking of these genes based on IM connectivity vs kME is similar, but not identical.

To ensure I am calculating MM and kIM correctly, can you confirm the commands below are appropriate?

For modular membership:

MM = as.data.frame(cor(datExpr, MEs, use = "p"));

For intramodular connectivity:

kIM = intramodularConnectivity(adjacency, moduleColors, scaleByMax = TRUE)

To add a little more information, our adjacency matrix was generated as a signed network using bicor as we have a smaller # of samples. Should I indicate our network is signed when running the above commands?

Thanks!

Dave Brohawn