GSEA test by gene permutation instead of phenotype permutation

1

Entering edit mode

Yuan Hao ▴ 90

@yuan-hao-3303

Last seen 9.6 years ago

Dear list, I know there is a gseattperm() function available in the Category package used to perform GSEA test on two group of samples. The permutation is based on phenotype labels of the samples. I am wondering is there a similar function can be used to do the same test, but based on the permutation of genes because I have a very small number of samples ( 6 samples in total for two phenotypes ( 4 versus 2) ). If not, may I get around this problem by transposing my data set to use the gseattper() by permuting on genes? Thank you very much in advance! Kind regards, Yuan

• 2.3k views

ADD COMMENT • link updated 14.5 years ago by James W. MacDonald 65k • written 14.5 years ago by Yuan Hao ▴ 90

0

Entering edit mode

James W. MacDonald 65k

@james-w-macdonald-5106

Last seen 4 hours ago

United States

Hi Yuan, Yuan Hao wrote: > Dear list, > > I know there is a gseattperm() function available in the Category > package used to perform GSEA test on two group of samples. The > permutation is based on phenotype labels of the samples. I am wondering > is there a similar function can be used to do the same test, but based > on the permutation of genes because I have a very small number of > samples ( 6 samples in total for two phenotypes ( 4 versus 2) ). If not, > may I get around this problem by transposing my data set to use the > gseattper() by permuting on genes? Thank you very much in advance! Not really. If you transpose, then you will be computing t-tests between genes within samples rather than between samples within genes. In other words, the function won't know that you have transposed the data, so won't know to permute genes but still compute the t-statistics on a by-gene basis. Anyway, I am not sure this is a reasonable thing to do. Under the null distribution you can argue that the samples are exchangeable because under the null there aren't any differences between samples. However, under the null distribution of no difference between samples, there are still expected to be differences between genes, so they are not exchangeable. Therefore, the distribution you would create by permuting genes would not necessarily correspond to the expectation of no differences between samples. You are probably better off using the t-distribution as your null rather than permuting genes. One way you could do this is to use the 'J-G' statistic proposed in Extensions to gene set enrichment. Jiang Z, Gentleman R. Bioinformatics. 2007 Feb 1;23(3):306-13. Epub 2006 Nov 24. and explained in a more accessible manner in Gene set enrichment analysis using linear models and diagnostics. Oron AP, Jiang Z, Gentleman R Bioinformatics. 2008 Nov 15;24(22):2586-91. Epub 2008 Sep 11. Best, Jim > > Kind regards, > Yuan > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor -- James W. MacDonald, M.S. Biostatistician Douglas Lab University of Michigan Department of Human Genetics 5912 Buhl 1241 E. Catherine St. Ann Arbor MI 48109-5618 734-615-7826

ADD COMMENT • link 14.5 years ago James W. MacDonald 65k

Login before adding your answer.