GSVA results interpretation
1
0
Entering edit mode
@6951f99c
Last seen 14 months ago
United Kingdom

Hi there, I have done gsva using the code below. But, when plotting the results. I found something that couldnt explain, where gene lists that I used are more enriched in the Control group which cant be true. Any help would be appriciated. Many thanks

enter image description here


gsva <- gsva(
  mapped_vst,
  Glists,
  method = "gsva",kcdf = "Poisson",
  mx.diff = TRUE,
  verbose = FALSE)
GSVA • 1.0k views
ADD COMMENT
0
Entering edit mode
Robert Castelo ★ 3.3k
@rcastelo
Last seen 2 days ago
Barcelona/Universitat Pompeu Fabra

hi,

There are many reasons why a downstream result or a graphical diagnostic, such as a hierarchical clustering, may depart from what you expect. Assuming that it has something to do with the GSVA algorithm, then I'd suggest the following. From the name of your input expression object, mapped_vst, it looks like the values in that object correspond to VST transformed counts, i.e., after calling the function vst() from the DESeq2 package. If that is the case, they are in fact continuous values and you should remove the argument kcdf="Poisson" from your call to the gsva() function.

robert.

ADD COMMENT
0
Entering edit mode

Hi, Thank you for your reply, but removing kcdf="Poisson" didnt effect the results too much. Still, have the same pattern. Many thanks

ADD REPLY
0
Entering edit mode

hi, as I said, there may be many reasons why a downstream result or a graphical diagnostic, such as a hierarchical clustering, may depart from what you expect, and the GSVA algorithm may or may not be one of those reasons. If the GSVA algorithm was the reason, then maybe other methods implementing different algorithms may give you the result you expect. The GSVA package implements three other methods, check out the help page of the gsva() function, doing help(gsva), and try them. If you do not get the expected result with any of the four methods, you may still try other approaches implemented in a different package/software such as, for instance, the singscore or qusage packages. If none of those give you the expected result, probably the problem is not on the single sample gene set enrichment score algorithm.

I'd suggest then to pick a few of the genes for which you know how do they should change across sample groups and plot their expression values by sample group using, for instance, box plots. If they the behave as you expect, maybe you have an error in the code that builds and displays the hierarchical clustering. If they do not behave as you expect, then the data you have probably does not match your hypothesis.

ADD REPLY

Login before adding your answer.

Traffic: 572 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6