Analysis of many Flagged spots
1
0
Entering edit mode
@jdelasherasedacuk-1189
Last seen 8.6 years ago
United Kingdom
Quoting Davide Valentini <davide.valentini at="" ki.se="">: > > > Hi to all, > > I've to deal with a dataset that has a huge amount of flagged as "bad" > spots. My data are from peptide microarrays and the proportion of flags > is around the 90% in each slide. Luckily I have a good set of samples > (35 cases and 35 controls), but I'm not an expert with this kind of > problem. Should I treat the flagged data as missing values and so try to > impute new values instead the flagged "bad" spots ? I know the KNN > imputation or the SVD imputation. > What is normally done with the spots flagged as "bad" (-100, following > GenePix criteria + additional criteria) in cDNA experiments ? > > Sorry if the question looks banal, but as I said I'm not an expert on > this field. Any help is useful, also links regarding flagged data > analysis... > > Thanks in advance, > > Davide Hi Davide, when looking at the actual images, are the high number of "bad" spots indicating artifacts on the slides (dirt, scratches, high and uneven background...) or just a reflection of your probes lighting up just a small proportion of the spots? Genepix flags as "Not Found" spots where there's no signal, but if the background is a little high and uneven, it sometimes will try hard to find a spot, and find a group of pixels to call a spot, which then often fails other quality criteria (shape-related) and teh spot is flagged as "Bad" rather than "Not Found". I think you should look at the scanned images to get a very good idea of what's going on. Regarding what to do with flagged spots... I tend to disregard GenePix flags. Ocassionally there'll be some dust or a bubble affecting the signal on a few spots in a given array. I just proceed, relying on the fact that I have replicates and that I hope I won't get two specs of dust on two arrays on teh exact same place. If a given array concerns me, and I need to use it in my analysis, I may then create some weights (like the Genepix flags) to exclude particular spots from a given slide. I do my analyses using the Limma package, which allows you to use weights at different levels. I find it useful to remove all spots that have negligible signal in both channels (2-colour data) on ALL arrays. In my latest experiments that amounts to up to 30% of the spots. I remove them completely. If you have a large % of spots in your arrays that won't light up with your samples, it may be a good idea to identify them and remove them from the analysis... If you end up having only a relatively small number of spots you will have to be extra careful about your normalisation procedure... but that's another matter. From what you describe, I would want to know first of all why I am getting so many "bad" spots: is there a problem with the hybridisation/slides, or do my samples only light up a small % of spots? Jose -- Dr. Jose I. de las Heras Email: J.delasHeras at ed.ac.uk The Wellcome Trust Centre for Cell Biology Phone: +44 (0)131 6513374 Institute for Cell & Molecular Biology Fax: +44 (0)131 6507360 Swann Building, Mayfield Road University of Edinburgh Edinburgh EH9 3JR UK -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
limma impute limma impute • 1.0k views
ADD COMMENT
0
Entering edit mode
@davide-valentini-2218
Last seen 9.6 years ago
Sorry for the delay in my reply, But I haven't noticed you message before ! Thanks a lot for your suggestions. I submitted your indications to the bio lab, Me too, I was wondering why so many flags, we are discussing the situation with the group. Thanks a lot Davide J.delasHeras at ed.ac.uk wrote: > Quoting Davide Valentini <davide.valentini at="" ki.se="">: > > >> Hi to all, >> >> I've to deal with a dataset that has a huge amount of flagged as "bad" >> spots. My data are from peptide microarrays and the proportion of flags >> is around the 90% in each slide. Luckily I have a good set of samples >> (35 cases and 35 controls), but I'm not an expert with this kind of >> problem. Should I treat the flagged data as missing values and so try to >> impute new values instead the flagged "bad" spots ? I know the KNN >> imputation or the SVD imputation. >> What is normally done with the spots flagged as "bad" (-100, following >> GenePix criteria + additional criteria) in cDNA experiments ? >> >> Sorry if the question looks banal, but as I said I'm not an expert on >> this field. Any help is useful, also links regarding flagged data >> analysis... >> >> Thanks in advance, >> >> Davide >> > > > Hi Davide, > > when looking at the actual images, are the high number of "bad" spots > indicating artifacts on the slides (dirt, scratches, high and uneven > background...) or just a reflection of your probes lighting up just a > small proportion of the spots? > Genepix flags as "Not Found" spots where there's no signal, but if the > background is a little high and uneven, it sometimes will try hard to > find a spot, and find a group of pixels to call a spot, which then > often fails other quality criteria (shape-related) and teh spot is > flagged as "Bad" rather than "Not Found". I think you should look at > the scanned images to get a very good idea of what's going on. > > Regarding what to do with flagged spots... I tend to disregard GenePix > flags. Ocassionally there'll be some dust or a bubble affecting the > signal on a few spots in a given array. I just proceed, relying on the > fact that I have replicates and that I hope I won't get two specs of > dust on two arrays on teh exact same place. If a given array concerns > me, and I need to use it in my analysis, I may then create some > weights (like the Genepix flags) to exclude particular spots from a > given slide. I do my analyses using the Limma package, which allows > you to use weights at different levels. > I find it useful to remove all spots that have negligible signal in > both channels (2-colour data) on ALL arrays. In my latest experiments > that amounts to up to 30% of the spots. I remove them completely. If > you have a large % of spots in your arrays that won't light up with > your samples, it may be a good idea to identify them and remove them > from the analysis... If you end up having only a relatively small > number of spots you will have to be extra careful about your > normalisation procedure... but that's another matter. > > From what you describe, I would want to know first of all why I am > getting so many "bad" spots: is there a problem with the > hybridisation/slides, or do my samples only light up a small % of spots? > > Jose > > -- Davide Valentini PhD - Biostatistician Department of Medical Epidemiology and Biostatistic Karolinska Institute Box 281 SE-171 77 Stockholm SWEDEN Tel: +46-8-524 82294 Fax: +46-8-31 4975
ADD COMMENT

Login before adding your answer.

Traffic: 810 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6