Dear all,
for a list of Structural Variants (including deletions, duplications, inversions, translocations), either in VCF or BEDPE format, we would like to have the gene annotations, and the lists of the following sets of genes :
-- fusions (if both breakpoints are in exons, introns, utrs)
-- truncations (if only one breakpoint is in exon, intron, utr; and the other breakpoint is in intergenic area)
-- the genes in the areas that are deleted, duplicated, inverted
could we get all these annotations with a package in R/BioC (possibly with VariantAnnotation) ?
I wrote some scripts in perl based on Annovar, although it would be helpful to implement this in R as well.
thanks a lot,
-- bogdan
VariantAnnotation will give you the calls but I don't think we're annotating stuff at this level.
locateVariants()
would be a good place to start.Thanks, Mike, yes, I am looking into locateVariants(). I have the scripts in perl, although the other parts of the pipelines (especially for SNV/INDELs) is in R, and it would run smoother, shall I have the SV annotation part in R as well.
Hi Mike,
talking about the annotation of the SV, Daniel Cameron mentioned that he started developing an R/BioC package :
https://github.com/PapenfussLab/StructuralVariantAnnotation