Question: RefNet: source/data preparation used for Gerstein et al (2012) human TF data?
0
gravatar for Keith Hughitt
4.2 years ago by
Keith Hughitt120
United States
Keith Hughitt120 wrote:

Hello,

Does anyone happen to know what source / processing was used to construct the "gerstein-2012" annotations in the RefNet package?


It appears that the network includes 6896 edges:

    library('RefNet')
    refnet = RefNet()
    ixns = interactions(refnet, species="9606", provider=c("gerstein-2012"))
    nrow(ixns) # 6895

I would have suspected that the data used to construct the annotations would have come from http://encodenets.gersteinlab.org/, but none of the correpsonding files there are of a similar size:

wc -l *

   26070 enets2.Proximal_filtered.txt
   19258 enets3.Distal.txt

Is there another source that I've missed? Or was some additional processing down on the dataset resulting in a subset of the original edges?

Keith

 

annotationhub refnet • 566 views
ADD COMMENTlink modified 4.2 years ago by pshannon90 • written 4.2 years ago by Keith Hughitt120
Answer: RefNet: source/data preparation used for Gerstein et al (2012) human TF data?
0
gravatar for pshannon
4.2 years ago by
pshannon90
United States
pshannon90 wrote:

Hi Keith,

The RefNet gerstein-2012 data interactions are from 

 http://archive.gersteinlab.org/proj/Hierarchy_Rewiring/PNAS_hier/Hs_Tr.txt

If other (and possibly more recent) interaction data sets are of compelling interest, let us know.  The 4 which are built in to RefNet now ('native'; they live in the AnnotationHub) rather than from PSICQUIC were chosen for their contrasting nature and origin -- not for their comprehensiveness.

R> show(refnet)
RefNet object with 25 providers in 2 classes
| provider class 'native':
|     gerstein-2012
|     hypoxiaSignaling-2006
|     stamlabTFs-2012
|     recon202
| provider class 'PSICQUIC':
|     BioGrid
|     bhf-ucl
| ...


 - Paul
ADD COMMENTlink written 4.2 years ago by pshannon90

Hi Paul,

Thanks for the response and clarification. I think that the dataset you are using is actually from an earlier paper out of the Gerstein lab -- "Rewiring of Transcriptional Regulatory Networks: Hierarchy, Rather Than Connectivity, Better Reflects the Importance of Regulators" (2010).

The datasets associated with the ENCODE paper are on http://encodenets.gersteinlab.org/.

I could see both being useful for some people, so it probably wouldn't hurt to include both the datasets. It might also be worth it to include the source of each of the datasets in the documentation. To be even more explicit, you could even include the scripts you used to generate each of the external RData-based providers.

Thanks for your work putting this useful package together!

All the best,

Keith

ADD REPLYlink written 4.2 years ago by Keith Hughitt120
Answer: RefNet: source/data preparation used for Gerstein et al (2012) human TF data?
0
gravatar for pshannon
4.2 years ago by
pshannon90
United States
pshannon90 wrote:

Thanks, Keith: good catch.  I'll update the pmid in RefNet.

  - Paul

ADD COMMENTlink written 4.2 years ago by pshannon90
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 215 users visited in the last hour