Entering edit mode
André Faure
▴
30
@andre-faure-6389
Last seen 10.2 years ago
Dear all,
I'm using the bioconductor package "Rcade" to determine likely direct
targets of a transcription factor. I have corresponding ChIP-seq data
(one ChIP and one Input sample, no replicates unfortunately) and
microarray gene expression data before/after forced expression of the
TF.
I'm considering all genes in the mouse genome (mm9) and ChIP-seq bins
centred on their canonical TSSs (+/-2.5kb). Independent peak-calling
analysis shows the TF does indeed occupy active promoters.
However the Rcade results show very low posterior probabilities for
"active" binding (maximum p.ChIP = 0.16). The B-values for the
combined hypothesis (i.e. active binding and differential expression:
B.ChIP.DE) are therefore also low:
> x <- getRcade(Rcade)
> x[1:5,]
geneID logfc.DE B.DE id.DE symbol.DE
M.ChIP
19937 ENSMUSG00000069008 -0.5714092 6.701309 10601390 Gm5537
-2.892738
20175 ENSMUSG00000070141 0.6631557 4.626032 10398396 Mir494
-3.115882
22435 ENSMUSG00000076145 0.6631557 4.626032 10398396 Mir679
-3.115882
23445 ENSMUSG00000080411 0.6631557 4.626032 10398396 Mir1193
-3.115882
24261 ENSMUSG00000083380 -0.5012910 3.177634 10462193 Gm3244
-2.892738
A.ChIP log.p.ChIP B.ChIP p.DE p.ChIP B.nothing
B.DE.only
19937 -23.98005 -1.819439 -1.642562 0.9987722 0.1621167 -6.878385
1.635008
20175 -23.75691 -1.845462 -1.673543 0.9903014 0.1579523 -4.799496
1.613386
22435 -23.75691 -1.845462 -1.673543 0.9903014 0.1579523 -4.799496
1.613386
23445 -23.75691 -1.845462 -1.673543 0.9903014 0.1579523 -4.799496
1.613386
24261 -23.98005 -1.819439 -1.642562 0.9599839 0.1621167 -3.361246
1.413735
B.ChIP.only B.ChIP.DE
19937 -8.521777 -1.644028
20175 -6.479707 -1.685107
22435 -6.479707 -1.685107
23445 -6.479707 -1.685107
24261 -5.031403 -1.691113
Am I doing something wrong / misunderstanding the output or is this
simply related to the absence of replicates for the ChIP-seq data?
Any help or ideas would be very much appreciated.
Many thanks.
André Faure
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom
> sessionInfo()
R version 2.15.3 (2013-03-01)
Platform: x86_64-unknown-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_GB.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_GB.UTF-8 LC_COLLATE=en_GB.UTF-8
[5] LC_MONETARY=en_GB.UTF-8 LC_MESSAGES=en_GB.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] Rcade_1.0.0 Rsamtools_1.10.2 Biostrings_2.26.3
[4] baySeq_1.12.0 GenomicRanges_1.10.7 IRanges_1.16.6
[7] BiocGenerics_0.4.0
loaded via a namespace (and not attached):
[1] bitops_1.0-6 parallel_2.15.3 stats4_2.15.3 tools_2.15.3
[5] zlibbioc_1.4.0
[[alternative HTML version deleted]]