Question

GSVA results interpretation

0

Entering edit mode

munta.jihad • 0

@6951f99c

Last seen 14 months ago

United Kingdom

Hi there, I have done gsva using the code below. But, when plotting the results. I found something that couldnt explain, where gene lists that I used are more enriched in the Control group which cant be true. Any help would be appriciated. Many thanks

enter image description here


gsva <- gsva(
  mapped_vst,
  Glists,
  method = "gsva",kcdf = "Poisson",
  mx.diff = TRUE,
  verbose = FALSE)

GSVA • 1.0k views

ADD COMMENT • link updated 15 months ago by Robert Castelo ★ 3.3k • written 15 months ago by munta.jihad • 0

score 0 · Answer 1 · 2023-01-12

0

Entering edit mode

Robert Castelo ★ 3.3k

@rcastelo

Last seen 2 days ago

Barcelona/Universitat Pompeu Fabra

hi,

There are many reasons why a downstream result or a graphical diagnostic, such as a hierarchical clustering, may depart from what you expect. Assuming that it has something to do with the GSVA algorithm, then I'd suggest the following. From the name of your input expression object, mapped_vst, it looks like the values in that object correspond to VST transformed counts, i.e., after calling the function vst() from the DESeq2 package. If that is the case, they are in fact continuous values and you should remove the argument kcdf="Poisson" from your call to the gsva() function.

robert.

ADD COMMENT • link 15 months ago Robert Castelo ★ 3.3k

0

Entering edit mode

Hi, Thank you for your reply, but removing kcdf="Poisson" didnt effect the results too much. Still, have the same pattern. Many thanks

ADD REPLY • link 15 months ago munta.jihad • 0

0

Entering edit mode

hi, as I said, there may be many reasons why a downstream result or a graphical diagnostic, such as a hierarchical clustering, may depart from what you expect, and the GSVA algorithm may or may not be one of those reasons. If the GSVA algorithm was the reason, then maybe other methods implementing different algorithms may give you the result you expect. The GSVA package implements three other methods, check out the help page of the gsva() function, doing help(gsva), and try them. If you do not get the expected result with any of the four methods, you may still try other approaches implemented in a different package/software such as, for instance, the singscore or qusage packages. If none of those give you the expected result, probably the problem is not on the single sample gene set enrichment score algorithm.

I'd suggest then to pick a few of the genes for which you know how do they should change across sample groups and plot their expression values by sample group using, for instance, box plots. If they the behave as you expect, maybe you have an error in the code that builds and displays the hierarchical clustering. If they do not behave as you expect, then the data you have probably does not match your hypothesis.

ADD REPLY • link 15 months ago Robert Castelo ★ 3.3k