P calls
3
0
Entering edit mode
@stephen-henderson-71
Last seen 7.7 years ago
I'd like to use the Present (P) and Absent (A) calls for some rudimentary filtering of data prior to analysis. Is there an appropriate slot for inserting P and A calls within the exprSet object? I'd like to garner opinions: Does anyone else use these , think them worthwhile, or perhaps use some other surrogate? ********************************************************************** This email and any files transmitted with it are confidential an... [[dropped]]
• 1.3k views
ADD COMMENT
0
Entering edit mode
@rafael-a-irizarry-205
Last seen 10.3 years ago
in my opinionm the main reason affy uses these is because MAS 5.0 has so much noise at "the bottom". if they didnt, all their large fold changes would be for genes with low expression. with RMA, what i use, you dont have this problem so i dont see the need to throw away information. there are other ways to get rid of the "noise" at the bottom: dChip (pm- only) and vsn are two exaples. There is no designated place to stick them into exprSet. you could create another exprSet just for these or since MAS doesnt give SEs, you could stick them in the se.exprs slot. a better (but you need to code some) solution is to extend the exprSet class to a new class that includes a slot for these calls. On Mon, 17 Mar 2003, Stephen Henderson wrote: > I'd like to use the Present (P) and Absent (A) calls for some rudimentary > filtering of data prior to analysis. Is there an appropriate slot for > inserting P and A calls within the exprSet object? > > I'd like to garner opinions: Does anyone else use these , think them > worthwhile, or perhaps use some other surrogate? > > > ********************************************************************** > This email and any files transmitted with it are confidential an... [[dropped]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor >
ADD COMMENT
0
Entering edit mode
Well, some biologists put great value in Affymetrix' presence calls, so I looked for papers but could only find one by Lemon et al. 2002: http://bioinformatics.oupjournals.org/cgi/reprint/18/11/1470?maxtoshow =&HITS=10&hits=10&RESULTFORMAT=&searchid=1047461778635_631&stored_sear ch=&FIRSTINDEX=0&volume=18&firstpage=1470&journalcode=bioinfo. They deal with MAS4 predecessors of the current (MAS5) presence calls and conclude via spike-in experiments that these statistics have an error rate around 80%. In my opinion an "absent" call just means high values of the MM probes compared to their corresponding PM probes; other than that I could not detect any difference between genes called "absent" or "present". Regards Reinhold Koch Biozentrum Universitaet Basel reinhold.koch@unibas.ch On Mon, 17 Mar 2003, Stephen Henderson wrote: > I'd like to use the Present (P) and Absent (A) calls for some rudimentary > filtering of data prior to analysis. Is there an appropriate slot for > inserting P and A calls within the exprSet object? > > I'd like to garner opinions: Does anyone else use these , think them > worthwhile, or perhaps use some other surrogate? >
ADD REPLY
0
Entering edit mode
Hi, > In my opinion an "absent" call just means high values of the MM probes > compared to their corresponding PM probes; other than that I could not > detect any difference between genes called "absent" or "present". And in many cases that could just mean that the "MM" is in a fact a perfect match for something else (esp. in complex transcriptomes such as human). The findings in the paper below seem to confirm that notion. Irizarry, RA, Hobbs, B, Collin, F, Beazer-Barclay, YD, Antonellis, KJ, Scherf, U, Speed, TP (2002) Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data. Accepted for publication in Biostatistics. http://biosun01.biostat.jhsph.edu/~ririzarr/papers/index.html Best regards Wolfgang Huber
ADD REPLY
0
Entering edit mode
@stephen-henderson-71
Last seen 7.7 years ago
I agree. I was really trying to elicit advice on an alternative. The funny thing about the MM probes is that if they just removed them Affy could produce a single chip with U133A and B genes, and we could all use a PM only algorithm. BUT then they wouldn't get double the money off some labs. For practical purposes however I still need to filter some data-- for computer intensive clustering or learning. So as I said to R. Irizarry below, I'm not sure whether to try and do this looking at the probe- set level or at the expression summary level. For instance If not a test of difference from the mean of the specific MM (robust or whatever), why not a test of difference from all local MM values in that sector, or a lower quantile thereof? Stephen -----Original Message----- From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] Sent: Monday, March 17, 2003 3:30 PM To: Stephen Henderson Cc: Ben Rubinstein Subject: RE: [BioC] P calls i agree that the notion of P and A might make sense. as you suggest, i wouldnt trust affy's way of doing this. i know of one other attempt at getting to this. im cc-ing ben rubinstein who has been working on this problem. -rafael On Mon, 17 Mar 2003, Stephen Henderson wrote: > I think that's probably true that vsn and dchip better assign meaningful > values to data that is essentially instrument noise. I'm still left > wondering how best to choose a filter that excludes such data. Whilst for > modeling purposes it maybe nice to keep the noise as it is information, I > think for many computer intensive tasks it is practically best to remove. > > I wonder whether the notion of P and A calls is OK but not best implemented > under MAS5? Surely it is better to exclude data based upon the probe level > data rather than the summary value? No? > > -----Original Message----- > From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] > Sent: Monday, March 17, 2003 2:34 PM > To: Stephen Henderson > Cc: bioconductor@stat.math.ethz.ch > Subject: Re: [BioC] P calls > > in my opinionm the main reason affy uses these is because MAS 5.0 has so > much noise at "the bottom". if they didnt, all their large fold changes > would be for genes with low expression. with RMA, what i use, you dont > have this problem so i dont see the need to throw away information. there > are other ways to get rid of the "noise" at the bottom: dChip (pm- only) > and vsn are two exaples. > > There is no designated place to stick them into exprSet. you could create > another exprSet just for these or since MAS doesnt give SEs, > you could stick them in the se.exprs slot. a better (but you need to code > some) solution is to extend the exprSet class to a new class that includes > a slot for these calls. > > On Mon, 17 Mar 2003, Stephen Henderson wrote: > > > I'd like to use the Present (P) and Absent (A) calls for some rudimentary > > filtering of data prior to analysis. Is there an appropriate slot for > > inserting P and A calls within the exprSet object? > > > > I'd like to garner opinions: Does anyone else use these , think them > > worthwhile, or perhaps use some other surrogate? -----Original Message----- From: w.huber@dkfz-heidelberg.de [mailto:w.huber@dkfz-heidelberg.de] Sent: Tuesday, March 18, 2003 10:28 AM To: Reinhold Koch Cc: bioconductor Subject: Re: [BioC] P calls Hi, > In my opinion an "absent" call just means high values of the MM probes > compared to their corresponding PM probes; other than that I could not > detect any difference between genes called "absent" or "present". And in many cases that could just mean that the "MM" is in a fact a perfect match for something else (esp. in complex transcriptomes such as human). The findings in the paper below seem to confirm that notion. Irizarry, RA, Hobbs, B, Collin, F, Beazer-Barclay, YD, Antonellis, KJ, Scherf, U, Speed, TP (2002) Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data. Accepted for publication in Biostatistics. http://biosun01.biostat.jhsph.edu/~ririzarr/papers/index.html Best regards Wolfgang Huber _______________________________________________ Bioconductor mailing list Bioconductor@stat.math.ethz.ch https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor ********************************************************************** This email and any files transmitted with it are confidential an... [[dropped]]
ADD COMMENT
0
Entering edit mode
On Tue, Mar 18, 2003 at 11:17:05AM -0000, Stephen Henderson wrote: > I agree. I was really trying to elicit advice on an alternative. The funny > thing about the MM probes is that if they just removed them Affy could > produce a single chip with U133A and B genes, and we could all use a PM only > algorithm. BUT then they wouldn't get double the money off some labs. > > For practical purposes however I still need to filter some data-- for > computer intensive clustering or learning. So as I said to R. Irizarry > below, I'm not sure whether to try and do this looking at the probe- set > level or at the expression summary level. It probably makes sense to look at things at the probe-set level, or at the chip probe level as you suggest it. Using the released version of the package, you can implement the ideas you suggest. You will have to write your own 'summary'(*) function (refer to the corresponding vignette for details). This 'summary' could inspired by one included in the pack. In your function, you will store the 'A' and 'P' (or whatever) in a sort of "global" vector (you'll need to use 'get' and 'environment' most probably)... The next release (or affy-devel > 1.2.2) would let you do it more easily... Hopin' it helps, Laurent (*) or "pmcorrect" function if the use of MMs is wished > > For instance > If not a test of difference from the mean of the specific MM (robust or > whatever), why not a test of difference from all local MM values in that > sector, or a lower quantile thereof? > > Stephen > > > -----Original Message----- > From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] > Sent: Monday, March 17, 2003 3:30 PM > To: Stephen Henderson > Cc: Ben Rubinstein > Subject: RE: [BioC] P calls > > i agree that the notion of P and A might make sense. as you suggest, i > wouldnt trust affy's way of doing this. i know of one other attempt at > getting to this. im cc-ing ben rubinstein who has been working on this > problem. > > -rafael > > > On Mon, 17 Mar 2003, Stephen > Henderson wrote: > > > I think that's probably true that vsn and dchip better assign meaningful > > values to data that is essentially instrument noise. I'm still left > > wondering how best to choose a filter that excludes such data. Whilst for > > modeling purposes it maybe nice to keep the noise as it is information, I > > think for many computer intensive tasks it is practically best to remove. > > > > I wonder whether the notion of P and A calls is OK but not best > implemented > > under MAS5? Surely it is better to exclude data based upon the probe level > > data rather than the summary value? No? > > > > -----Original Message----- > > From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] > > Sent: Monday, March 17, 2003 2:34 PM > > To: Stephen Henderson > > Cc: bioconductor@stat.math.ethz.ch > > Subject: Re: [BioC] P calls > > > > in my opinionm the main reason affy uses these is because MAS 5.0 has so > > much noise at "the bottom". if they didnt, all their large fold changes > > would be for genes with low expression. with RMA, what i use, you dont > > have this problem so i dont see the need to throw away information. there > > are other ways to get rid of the "noise" at the bottom: dChip (pm- only) > > and vsn are two exaples. > > > > There is no designated place to stick them into exprSet. you could create > > another exprSet just for these or since MAS doesnt give SEs, > > you could stick them in the se.exprs slot. a better (but you need to code > > some) solution is to extend the exprSet class to a new class that includes > > a slot for these calls. > > > > On Mon, 17 Mar 2003, Stephen Henderson wrote: > > > > > I'd like to use the Present (P) and Absent (A) calls for some > rudimentary > > > filtering of data prior to analysis. Is there an appropriate slot for > > > inserting P and A calls within the exprSet object? > > > > > > I'd like to garner opinions: Does anyone else use these , think them > > > worthwhile, or perhaps use some other surrogate? > > > -----Original Message----- > From: w.huber@dkfz-heidelberg.de [mailto:w.huber@dkfz-heidelberg.de] > Sent: Tuesday, March 18, 2003 10:28 AM > To: Reinhold Koch > Cc: bioconductor > Subject: Re: [BioC] P calls > > > Hi, > > > In my opinion an "absent" call just means high values of the MM probes > > compared to their corresponding PM probes; other than that I could not > > detect any difference between genes called "absent" or "present". > > And in many cases that could just mean that the "MM" is in a fact a > perfect match for something else (esp. in complex transcriptomes such as > human). The findings in the paper below seem to confirm that notion. > > Irizarry, RA, Hobbs, B, Collin, F, Beazer-Barclay, YD, Antonellis, KJ, > Scherf, U, Speed, TP (2002) Exploration, Normalization, and Summaries of > High Density Oligonucleotide Array Probe Level Data. Accepted for > publication in Biostatistics. > http://biosun01.biostat.jhsph.edu/~ririzarr/papers/index.html > > > Best regards > Wolfgang Huber > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor > > > ********************************************************************** > This email and any files transmitted with it are confidential an... [[dropped]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor -- -------------------------------------------------------------- currently at the National Yang-Ming University in Taipei, Taiwan -------------------------------------------------------------- Laurent Gautier CBS, Building 208, DTU PhD. Student DK-2800 Lyngby,Denmark tel: +45 45 25 24 89 http://www.cbs.dtu.dk/laurent
ADD REPLY
0
Entering edit mode
@stephen-henderson-71
Last seen 7.7 years ago
I think that sounds like a nice project. What is the difference with the coming version? Will there be a blank slot in exprSet for such flags? -----Original Message----- From: Laurent Gautier [mailto:laurent@cbs.dtu.dk] Sent: Tuesday, March 18, 2003 12:54 PM To: Stephen Henderson Cc: 'w.huber@dkfz-heidelberg.de'; bioconductor@stat.math.ethz.ch Subject: Re: [BioC] P calls On Tue, Mar 18, 2003 at 11:17:05AM -0000, Stephen Henderson wrote: > I agree. I was really trying to elicit advice on an alternative. The funny > thing about the MM probes is that if they just removed them Affy could > produce a single chip with U133A and B genes, and we could all use a PM only > algorithm. BUT then they wouldn't get double the money off some labs. > > For practical purposes however I still need to filter some data-- for > computer intensive clustering or learning. So as I said to R. Irizarry > below, I'm not sure whether to try and do this looking at the probe- set > level or at the expression summary level. It probably makes sense to look at things at the probe-set level, or at the chip probe level as you suggest it. Using the released version of the package, you can implement the ideas you suggest. You will have to write your own 'summary'(*) function (refer to the corresponding vignette for details). This 'summary' could inspired by one included in the pack. In your function, you will store the 'A' and 'P' (or whatever) in a sort of "global" vector (you'll need to use 'get' and 'environment' most probably)... The next release (or affy-devel > 1.2.2) would let you do it more easily... Hopin' it helps, Laurent (*) or "pmcorrect" function if the use of MMs is wished > > For instance > If not a test of difference from the mean of the specific MM (robust or > whatever), why not a test of difference from all local MM values in that > sector, or a lower quantile thereof? > > Stephen > > > -----Original Message----- > From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] > Sent: Monday, March 17, 2003 3:30 PM > To: Stephen Henderson > Cc: Ben Rubinstein > Subject: RE: [BioC] P calls > > i agree that the notion of P and A might make sense. as you suggest, i > wouldnt trust affy's way of doing this. i know of one other attempt at > getting to this. im cc-ing ben rubinstein who has been working on this > problem. > > -rafael > > > On Mon, 17 Mar 2003, Stephen > Henderson wrote: > > > I think that's probably true that vsn and dchip better assign meaningful > > values to data that is essentially instrument noise. I'm still left > > wondering how best to choose a filter that excludes such data. Whilst for > > modeling purposes it maybe nice to keep the noise as it is information, I > > think for many computer intensive tasks it is practically best to remove. > > > > I wonder whether the notion of P and A calls is OK but not best > implemented > > under MAS5? Surely it is better to exclude data based upon the probe level > > data rather than the summary value? No? > > > > -----Original Message----- > > From: Rafael A. Irizarry [mailto:ririzarr@jhsph.edu] > > Sent: Monday, March 17, 2003 2:34 PM > > To: Stephen Henderson > > Cc: bioconductor@stat.math.ethz.ch > > Subject: Re: [BioC] P calls > > > > in my opinionm the main reason affy uses these is because MAS 5.0 has so > > much noise at "the bottom". if they didnt, all their large fold changes > > would be for genes with low expression. with RMA, what i use, you dont > > have this problem so i dont see the need to throw away information. there > > are other ways to get rid of the "noise" at the bottom: dChip (pm- only) > > and vsn are two exaples. > > > > There is no designated place to stick them into exprSet. you could create > > another exprSet just for these or since MAS doesnt give SEs, > > you could stick them in the se.exprs slot. a better (but you need to code > > some) solution is to extend the exprSet class to a new class that includes > > a slot for these calls. > > > > On Mon, 17 Mar 2003, Stephen Henderson wrote: > > > > > I'd like to use the Present (P) and Absent (A) calls for some > rudimentary > > > filtering of data prior to analysis. Is there an appropriate slot for > > > inserting P and A calls within the exprSet object? > > > > > > I'd like to garner opinions: Does anyone else use these , think them > > > worthwhile, or perhaps use some other surrogate? > > > -----Original Message----- > From: w.huber@dkfz-heidelberg.de [mailto:w.huber@dkfz-heidelberg.de] > Sent: Tuesday, March 18, 2003 10:28 AM > To: Reinhold Koch > Cc: bioconductor > Subject: Re: [BioC] P calls > > > Hi, > > > In my opinion an "absent" call just means high values of the MM probes > > compared to their corresponding PM probes; other than that I could not > > detect any difference between genes called "absent" or "present". > > And in many cases that could just mean that the "MM" is in a fact a > perfect match for something else (esp. in complex transcriptomes such as > human). The findings in the paper below seem to confirm that notion. > > Irizarry, RA, Hobbs, B, Collin, F, Beazer-Barclay, YD, Antonellis, KJ, > Scherf, U, Speed, TP (2002) Exploration, Normalization, and Summaries of > High Density Oligonucleotide Array Probe Level Data. Accepted for > publication in Biostatistics. > http://biosun01.biostat.jhsph.edu/~ririzarr/papers/index.html > > > Best regards > Wolfgang Huber > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor > > > ********************************************************************** > This email and any files transmitted with it are confidential an... [[dropped]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor -- -------------------------------------------------------------- currently at the National Yang-Ming University in Taipei, Taiwan -------------------------------------------------------------- Laurent Gautier CBS, Building 208, DTU PhD. Student DK-2800 Lyngby,Denmark tel: +45 45 25 24 89 http://www.cbs.dtu.dk/laurent
ADD COMMENT
0
Entering edit mode
On Tue, Mar 18, 2003 at 01:29:09PM -0000, Stephen Henderson wrote: > I think that sounds like a nice project. What is the difference with the > coming version? Will there be a blank slot in exprSet for such flags? almost, there *should* be a something to mark summary expression values in the MIAME slot (if enough time to put things correctly). Currently (affy-devel > 1.2.2) there should be an attribute attached to the exprSet returned saying if a warning was issued when computing the summary expr. val. L.
ADD REPLY

Login before adding your answer.

Traffic: 618 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6