Closed:DiffBind: Plotting dba.Heatmap produces different clustering depending on whether unrelated ChIPs are included
2
0
Entering edit mode
timslittle • 0
@timslittle-12734
Last seen 7.1 years ago

Hello,

I'm trying to distinguish whether the overall binding pattern of a protein (Protein A) differs between two conditions. When I use a sampleSheet.csv that only includes the data for Protein A and plot the Heatmap (after counting, analyzing etc...) the time points are nested among each other in the clustering dendrogram. I conclude that there are little differences in binding between the two timepoints.

However when I include the data for another protein (Protein B) being ChIPed for, the Heatmap produced separates the binding data for the two different proteins (as expected), however the dendrogram for Protein A has now changed and does in fact separate the time points quite clearly.

Why does the combination of Protein A + B in the counting/analysis process create a different clustering relationship for Protein A than if Protein A is included in the analysis on its own?

DiffBind code:

samp <- read.csv('samplesheet.csv', header=T)
dbsamp <- dba(sampleSheet = samp)
dbcount <- dba.count(dbsamp,
                      bUseSummarizeOverlaps = TRUE,
                      bParallel = T)
dbcontrast <- dba.contrast(dbcount,
minMembers = 3,
                            categories = c(DBA_TREATMENT, DBA_FACTOR))
dbanalyze <- dba.analyze(dbcontrast, method = DBA_EDGER)
dba.plotHeatmap(dbanalyze, correlations = T,
                ColAttributes = DBA_GROUP,
                RowAttributes = DBA_GROUP,
                method = DBA_EDGER)

Session Info:

> sessionInfo()
R version 3.3.3 (2017-03-06)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 16.04.2 LTS

locale:
[1] LC_CTYPE=en_GB.UTF-8       LC_NUMERIC=C               LC_TIME=en_GB.UTF-8        LC_COLLATE=en_GB.UTF-8  
[5] LC_MONETARY=en_GB.UTF-8    LC_MESSAGES=en_GB.UTF-8    LC_PAPER=en_GB.UTF-8       LC_NAME=C               
[9] LC_ADDRESS=C               LC_TELEPHONE=C             LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] parallel  stats4    stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] RColorBrewer_1.1-2         pheatmap_1.0.8             DiffBind_2.2.9             SummarizedExperiment_1.4.0
[5] Biobase_2.34.0             GenomicRanges_1.26.4       GenomeInfoDb_1.10.3        IRanges_2.8.2           
[9] S4Vectors_0.12.2           BiocGenerics_0.20.0       

loaded via a namespace (and not attached):
[1] Category_2.40.0          bitops_1.0-6             tools_3.3.3              backports_1.0.5          R6_2.2.0              
[6] rpart_4.1-10             KernSmooth_2.23-15       Hmisc_4.0-2              DBI_0.6                  lazyeval_0.2.0        
[11] colorspace_1.3-2         nnet_7.3-12              gridExtra_2.2.1          DESeq2_1.14.1            sendmailR_1.2-1       
[16] graph_1.52.0             htmlTable_1.9            rtracklayer_1.34.2       caTools_1.17.1           scales_0.4.1          
[21] checkmate_1.8.2          BatchJobs_1.6            genefilter_1.56.0        RBGL_1.50.0              stringr_1.2.0         
[26] digest_0.6.12            Rsamtools_1.26.1         foreign_0.8-67           AnnotationForge_1.16.1   XVector_0.14.1        
[31] base64enc_0.1-3          htmltools_0.3.5          limma_3.30.13            htmlwidgets_0.8          RSQLite_1.1-2         
[36] BBmisc_1.11              GOstats_2.40.0           hwriter_1.3.2            BiocParallel_1.8.1       gtools_3.5.0          
[41] acepack_1.4.1            dplyr_0.5.0              RCurl_1.95-4.8           magrittr_1.5             GO.db_3.4.0           
[46] Formula_1.2-1            Matrix_1.2-8             Rcpp_0.12.10             munsell_0.4.3            stringi_1.1.3         
[51] edgeR_3.16.5             zlibbioc_1.20.0          gplots_3.0.1             fail_1.3                 plyr_1.8.4            
[56] grid_3.3.3               gdata_2.17.0             lattice_0.20-34          Biostrings_2.42.1        splines_3.3.3         
[61] GenomicFeatures_1.26.3   annotate_1.52.1          locfit_1.5-9.1           knitr_1.15.1             rjson_0.2.15          
[66] systemPipeR_1.8.1        geneplotter_1.52.0       biomaRt_2.30.0           XML_3.98-1.5             ShortRead_1.32.1      
[71] latticeExtra_0.6-28      data.table_1.10.4        gtable_0.2.0             amap_0.8-14              assertthat_0.1        
[76] ggplot2_2.2.1            xtable_1.8-2             survival_2.41-2          tibble_1.2               GenomicAlignments_1.10.1
[81] AnnotationDbi_1.36.2     memoise_1.0.0            cluster_2.0.6            brew_1.0-6               GSEABase_1.36.0

Many thanks,

Tim.

diffbind heatmap • 89 views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 1133 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6