I have not encountered this problem previously and do not have an
You state that 16 out of 20 bam files can be processed without a
mset <- MEDIPS.createSet("0139202.fq.sam.noDUP.bam.qf.bam", BSgenome =
"Hsapiens", sample_name = "0139202").
This surprises me a little bit, because you have to state the whole
BSgenome name, for example BSgenome ="BSgenome.Hsapiens.UCSC.hg19".
pointed out few days ago that I can make the package much more
allowing other data types- Hervé, thank you for the hint, we will
definitely consider this]. However, this does not seem to be your
as you encounter problems for only 4 out of 20 bam files.
Therefore, I assume that there is something strange with these
files. Do they contain non-mapped reads? MEDIPS 1.10.0 reports an
there are none mapped reads in the bam files, but the latest version
MEDIPS available in the development branch can deal with this.
based on the example read you've sent this might not cause the error.
last regular status report you get is 'Creating GRange Object...'.
happens in the getGRange() function called by MEDIPS.createSet() after
bam files has been imported. The command that probably causes the
regions_GRange = GRanges(seqnames=regions$chr,
but I do not understand why you have missing values. In order to
this, you might want to try to import your bam file by yourself using
scanFlag = scanBamFlag(isUnmappedQuery = F) #this will ignore unmapped
reads in the bam file and is implemented in the latest dev version
scanParam=ScanBamParam(flag=scanFlag, what = c("rname", "pos",
regions = scanBam(file="0139202.fq.sam.noDUP.bam.qf.bam",
regions = do.call(rbind,lapply(regions, as.data.frame,
regions = data.frame(chr=as.character(as.vector(regions$rname)),
Afterwards, it will be necessary to investigate where your regions
has NA values.
Another thing: do you have bam index files in the same directory where
store the bam file? MEDIPS will make use of these, if they are
(using slightly different code as stated above). In case there is a
discrepancy between the index and bam files (having the same prefix
name), this might cause a problem as well.
Finally, you can convert your bam files into bed files (by yourself)
try uploading simple txt based bed files into MEDIPS.
I hope we will identify the root of your error.
On Thu, Aug 15, 2013 at 9:28 PM, Jonathan Ellis
> Dear Lukas and list,
> I'm trying to process a set of BAM files using the latest version of
> MEDIPS (1.10.0), but have run into problems creating MEDIPSset
> for some BAM file. The following is an example, but I'm getting the
> same error for 4 out of 20 BAM files.
> > mset <- MEDIPS.createSet("0139202.fq.sam.noDUP.bam.qf.bam",
> + BSgenome = "Hsapiens", sample_name = "0139202")
> Reading bam alignment 0139202.fq.sam.noDUP.bam.qf.bam
> Total number of imported short reads: 13813686
> Creating GRange Object...
> Error in .Call2("solve_user_SEW0", start, end, width, PACKAGE =
> solving row 11345799: range cannot be determined from the supplied
> (too many NAs)
> The data are single-end reads that were aligned with bwa version
> 0.5.9-r16 (the alignments were done some time ago hence the older
> version of bwa), and the corresponding line from the BAM
> 19441177 16 chr17 38161416 18 49M *
> 0 TAGAGTCCGGCGTTCAGGGGCAGGAAGCATCCAGCACGGGAGAAAGATG
> BBBBBBBBBBBBBBBBBd_IIX[ffgb[[YJb^d^[[ggee^^cc^^^_ X0:i:1
> MD:Z:8A7A32 XG:i:0 NM:i:2 XM:i:2 XO:i:0XT:A:U
> I'm unsure whether this is a problem with the MEDIPS package or
> something from GRanges/IRanges. As far as I understand the .Call2
> function is part of IRanges, but I assume it's failing due to
> passed by MEDIPS. Any advice or pointers would be greatly
> > sessionInfo()
> R version 3.0.1 (2013-05-16)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>  LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
>  LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
>  LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
>  LC_PAPER=C LC_NAME=C
>  LC_ADDRESS=C LC_TELEPHONE=C
>  LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
> attached base packages:
>  parallel stats graphics grDevices utils datasets
>  base
> other attached packages:
>  BSgenome.Hsapiens.UCSC.hg19_1.3.19 MEDIPS_1.10.0
>  DNAcopy_1.34.0 BSgenome_1.28.0
>  FDb.InfiniumMethylation.hg19_1.0.1 rtracklayer_1.20.4
>  Biostrings_2.28.0 GenomicFeatures_1.12.3
>  AnnotationDbi_1.22.6 Biobase_2.20.1
>  GenomicRanges_1.12.4 IRanges_1.18.2
>  BiocGenerics_0.6.0 BiocInstaller_1.10.3
> loaded via a namespace (and not attached):
>  biomaRt_2.16.0 bitops_1.0-5 DBI_0.2-7 gtools_3.0.0
>  RCurl_1.95-4.1 Rsamtools_1.12.3 RSQLite_0.11.4 stats4_3.0.1
>  tools_3.0.1 XML_3.98-1.1 zlibbioc_1.6.0
> Bioconductor mailing list
> Search the archives:
[[alternative HTML version deleted]]