Machine learning
1
0
Entering edit mode
Weiyin Zhou ▴ 220
@weiyin-zhou-1970
Last seen 9.6 years ago
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/bioconductor/attachments/20070301/ bcef05a5/attachment.pl
• 521 views
ADD COMMENT
0
Entering edit mode
@wolfgang-huber-3550
Last seen 11 days ago
EMBL European Molecular Biology Laborat…
Dear Weiyin, The code you mention is just to filter out genes with little variation, since they are unlikely to be helpful for the classification task but tend to aggravate the 'curse of high-dimensionality' problem. You might also consider a more recent lab, from the 2006 course in Brixen, with nicer explanations, code etc.: http://www.economia.unimi.it/projects/marray/2006/material/Lab3/Machin eLearning/ (progress happens!) More answers below. > I am trying to do classification on 92 Affymetrix data-set using Random > Forest. I followed the examples on "Machine Learning Lab" June 11, > 2004. I have problem to understand some of the codes for the > non-specific filtering step. > > > > Here are the codes in the paper: > > > > * library(genefilter) > > * f1 <- pOverA(0.25, log2(200)) > > * f2 <- function(x) (IQR(x) > 0.5) > > * ff <- filterfun(f1,f2) > > * selected <- genefilter(eset, ff) > > * sum(selected) > > > > > > For "pOverA(0.25, log2(200))", is this means if 25% of samples' > expression values from same gene > log2(200), then it return true? Indeed. > For "f2 <- function(x) (IQR(x) > 0.5)", I tried the help page, but still > don't understand. I assume it have something to do with filter genes > show little variation across samples. > f2 returns TRUE if the interquartile range of x (the difference between 75% and 25% percentile) is > 0.5 > > For "ff <- filterfun(f1,f2)", the help page said "the function returns > FALSE when the first filter function returns FALSE otherwise it return > TRUE". So why we need f2 if is decided by first function, which is f1 > here? I think the man page needs some help here. Both f1 and f2 need to return TRUE if the gene is to be selected. I will poke its maintainer. > Could someone explain this for me? Best wishes Wolfgang ------------------------------------------------------------------ Wolfgang Huber EBI/EMBL Cambridge UK http://www.ebi.ac.uk/huber
ADD COMMENT

Login before adding your answer.

Traffic: 493 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6