Question: Bioinformatics researching Schizophrenia
gravatar for Guest User
3.2 years ago by
Guest User12k
Guest User12k wrote:
I am an undergraduate student and I have been given a project that involves bioinformatics. My supervisor researches chromosome abnormalities and gene expression and he has been given a list of GWAS generated SNPs, that have been linked to Schizophrenia and it has been assigned to me to generate a list of potential genes/regions that we could CROSS REFERENCE with the other list to see if any of these SNPs occur at sites such as miRNA sites, methyltransferase genes, Acetylation and Deacetylation genes etc. that could be implicated in Schizophrenia. I was hoping you would be able to recommend the necessary R-Packages for A. generating a list of potentially implicated regions and B. to cross validate this list with the one I have received. Thank you -- output of sessionInfo(): s -- Sent via the guest posting facility at
ADD COMMENTlink modified 3.2 years ago by Valerie Obenchain ♦♦ 6.4k • written 3.2 years ago by Guest User12k
gravatar for Valerie Obenchain
3.2 years ago by
Valerie Obenchain ♦♦ 6.4k
United States
Valerie Obenchain ♦♦ 6.4k wrote:


To generate a list of gene (or other) regions for your SNPS you could do the following.

1. get SNP location from SNPloc package

(Assuming you only have SNP id's and not locations.)

All pre-built Bioconductor annotations are listed here:

Search for 'SNPloc' and choose the package that was aligned to the same genome as your SNPs. See the man page for examples of how to extract snps into a GRanges using the rsid.

2. get (or make) TxDb package for regions of interest

Search the annotation site for 'TxDb'. These packages contain gene models from various resources and genome builds (apparent in titles). If you don't see a compatible TxDb you can create you own with a function from GenomicFeatures.


Once you have the TxDb you can extract gene, exons, UTRs or other regions. I'll use the known gene table from UCSC:

txdb <- TxDb.Hsapiens.UCSC.hg19.knownGene

There are several extractors, see ?transcriptsBy:

genes <- transcriptsBy(txdb, "gene")

3. overlap SNP locations with regions from TxDb

There are several options for overlaps. You could use findOverlaps() from IRanges or locateVariants() from VariantAnnotation. Alternatively you could use the biomaRt package to extract metadata based on the SNPs.

Hopefully this is enough to get you going. It would be helpful to know what information you have in the 'list' of GWAS generated SNPs. If you run into problems please show an example of what you've tried so we can give a more specific answer.


Valerie Obenchain
Program in Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N, Seattle, WA 98109

Phone: (206) 667-3158

ADD COMMENTlink modified 3.2 years ago by Martin Morgan ♦♦ 20k • written 3.2 years ago by Valerie Obenchain ♦♦ 6.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.2.0
Traffic: 148 users visited in the last hour