Hi,
I am trying to analyze a set of particular genes that are all within a module in addition to the HUB genes of that module. I am finding that when I extract the list of genes with the highest module membership at varying lengths, the genes that populate that list slightly change in their ranking.
For example - when I extract the top 10 HUB genes:
yellow
[1,] "LOC106529418"
[2,] "LOC106513277"
[3,] "LOC106523395"
[4,] "LOC106522037"
[5,] "LOC106524105"
[6,] "LOC106516167"
[7,] "LOC106525246"
[8,] "LOC106516366"
[9,] "LOC106531344"
[10,] "LOC106522808"
And when I extract the top 20, I don't only see new genes populate the bottom of the list, but rather, I see some new genes making up the top 10 positions, too.
yellow [1,] "LOC106529418" [2,] "LOC106516441" [3,] "LOC106528572" [4,] "LOC106513277" [5,] "LOC106523804" [6,] "LOC106523395" [7,] "LOC106522037" [8,] "LOC106524105" [9,] "LOC106516167" [10,] "LOC106525555" [11,] "LOC106525246" [12,] "LOC106523471" [13,] "LOC106516366" [14,] "LOC106515286" [15,] "LOC106517146" [16,] "LOC106531344" [17,] "LOC106522808" [18,] "LOC106534245" [19,] "LOC106527578" [20,] "LOC106520168"
Can anyone explain why the ranking of gene module memberships is being adjusted with with each of my attempts?
Here is the code chunk I am following from the tutorial with the value '10' in the 6th line down as what I change between the results I see above:
topGenesKME = NULL for (c in 1:length(colors20)){ kMErank1 = rank(-geneModuleMembership1[,c]) kMErank2 = rank(-geneModuleMembership2[,c]) maxKMErank = rank(apply(cbind(kMErank1,kMErank2+.00001),1,max)) topGenesKME = cbind(topGenesKME,Gene[maxKMErank<=10]) }; colnames(topGenesKME) = colors20 topGenesKME
Thank you in advance for any insight on this!
Great, thank you!!
I am not sure I completely understand the difference, but as long as this outputs representative HUB genes of the modules, then I am satisfied.
Hi Lluis R., I actually still have a bit of confusion about this.
I was hoping to get a list of the genes that have the highest module membership (HUB genes) within each module. From the code that you helped me with earlier, I found some genes were repeated in the lists from multiple modules. I found this to happen more often in the modules that share similar profiles (MEs). Nonetheless, how would a gene have high module membership in multiple modules. Let me know if I am not making myself clear or need to read the paper more clearly, but I anticipated that there would be unique module HUB genes from the code you helped me with earlier. I can explore merging modules, but I would like to understand them independently, first.
Thank you for your time.
Module Membership can be calculated for several modules. If several modules are similar for a single gene the module membership may be high for several modules.
That's why I suggested ordering by the position of several module memberships. But if you want the HUB genes of each module just subset the genes based on the module membership of its module.