Hi! I am trying to run Diffbind for differential enrichment analysis of my ChIP-seq datasets.
To start with, .broadPeak bed files (both ChIP & Input) and .bam files were supplied to Diffbind. However, it failed to generate the greylist. It is something about my bam files? The bam files are having ensembl chromosome names. Or if there is anything related to the header of the bam files?
Thanks in advance!
test <- dba.blacklist(test, blacklist=DBA_BLACKLIST_MM10, greylist=TRUE)
No genome detected.
test.greylist <- dba.blacklist(test, Retrieve=DBA_GREYLIST)
Error: No greylist
sessionInfo( )
R version 4.3.0 (2023-04-21)
Platform: aarch64-apple-darwin20 (64-bit)
Running under: macOS Ventura 13.2.1
Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/4.3-arm64/Resources/lib/libRlapack.dylib; LAPACK version 3.11.0
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
time zone: Europe/Stockholm
tzcode source: internal
attached base packages:
[1] stats4 stats graphics grDevices utils datasets methods base
other attached packages:
[1] DiffBind_3.9.6 SummarizedExperiment_1.30.1 Biobase_2.60.0 MatrixGenerics_1.12.0
[5] matrixStats_0.63.0 GenomicRanges_1.52.0 GenomeInfoDb_1.36.0 IRanges_2.34.0
[9] S4Vectors_0.38.1 BiocGenerics_0.46.0 lubridate_1.9.2 forcats_1.0.0
[13] stringr_1.5.0 purrr_1.0.1 readr_2.1.4 tidyr_1.3.0
[17] tibble_3.2.1 ggplot2_3.4.2 tidyverse_2.0.0 dplyr_1.1.2
[21] magrittr_2.0.3
loaded via a namespace (and not attached):
[1] bitops_1.0-7 deldir_1.0-9 rlang_1.1.1 compiler_4.3.0
[5] png_0.1-8 vctrs_0.6.2 pkgconfig_2.0.3 crayon_1.5.2
[9] fastmap_1.1.1 XVector_0.40.0 caTools_1.18.2 utf8_1.2.3
[13] Rsamtools_2.16.0 tzdb_0.4.0 zlibbioc_1.46.0 DelayedArray_0.26.2
[17] BiocParallel_1.34.1 jpeg_0.1-10 irlba_2.3.5.1 parallel_4.3.0
[21] R6_2.5.1 stringi_1.7.12 RColorBrewer_1.1-3 SQUAREM_2021.1
[25] limma_3.56.1 rtracklayer_1.60.0 numDeriv_2016.8-1.1 Rcpp_1.0.10
[29] Matrix_1.5-4.1 timechange_0.2.0 tidyselect_1.2.0 rstudioapi_0.14
[33] yaml_2.3.7 gplots_3.1.3 codetools_0.2-19 hwriter_1.3.2.1
[37] lattice_0.21-8 plyr_1.8.8 withr_2.5.0 ShortRead_1.58.0
[41] coda_0.19-4 Biostrings_2.68.1 pillar_1.9.0 KernSmooth_2.23-21
[45] generics_0.1.3 invgamma_1.1 RCurl_1.98-1.12 truncnorm_1.0-9
[49] emdbook_1.3.12 hms_1.1.3 munsell_0.5.0 scales_1.2.1
[53] ashr_2.2-54 gtools_3.9.4 glue_1.6.2 tools_4.3.0
[57] apeglm_1.22.1 interp_1.1-4 BiocIO_1.10.0 BSgenome_1.68.0
[61] locfit_1.5-9.7 GenomicAlignments_1.36.0 systemPipeR_2.6.0 XML_3.99-0.14
[65] mvtnorm_1.1-3 grid_4.3.0 bbmle_1.0.25 amap_0.8-19
[69] bdsmatrix_1.3-6 latticeExtra_0.6-30 colorspace_2.1-0 GenomeInfoDbData_1.2.10
[73] restfulr_0.0.15 cli_3.6.1 GreyListChIP_1.32.0 fansi_1.0.4
[77] mixsqp_0.3-48 S4Arrays_1.0.4 gtable_0.3.3 digest_0.6.31
[81] ggrepel_0.9.3 rjson_0.2.21 htmlwidgets_1.6.2 htmltools_0.5.5
[85] lifecycle_1.0.3 MASS_7.3-60
```
Hi James,
Thanks for your reply!
For the blacklist, I just used the one provided by DiffBind, which is "DBA_BLACKLIST_MM10". For the greylist, I did not specify the greylist, as I want Diffbind generates the greylist using the input bam I provided. But, I got the error mentioned.
Wanna know why this happens... and how to solve it.
The error you see comes from an internal function called
pv.genome
that inspects the header of your BAM file and tries to infer the genome based on that. It is unable to do so and returns the error you see. I would imagine that it's because you aligned vs Ensembl (no chr in the chromosome name), but then you are using a UCSC based BSgenome package for the blacklist. I don't know if there is a simple fix, so hopefully Rory Stark will be along in a while to provide advice.