HOCOMOCO: A comprehensive collection of human transcription factor binding sites models
AuthorsKulakovskiy, Ivan V.
Medvedeva, Yulia A.
Kasianov, Artem S.
Vorontsov, Ilya E.
Bajic, Vladimir B.
Makeev, Vsevolod J.
KAUST DepartmentComputational Bioscience Research Center (CBRC)
MetadataShow full item record
AbstractTranscription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/ hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source. The Author(s) 2012.
CitationKulakovskiy IV, Medvedeva YA, Schaefer U, Kasianov AS, Vorontsov IE, et al. (2012) HOCOMOCO: a comprehensive collection of human transcription factor binding sites models. Nucleic Acids Research 41: D195-D202. doi:10.1093/nar/gks1089.
PublisherOxford University Press (OUP)
JournalNucleic Acids Research
PubMed Central IDPMC3531053
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact firstname.lastname@example.org.
- HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models.
- Authors: Kulakovskiy IV, Vorontsov IE, Yevshin IS, Soboleva AV, Kasianov AS, Ashoor H, Ba-Alawi W, Bajic VB, Medvedeva YA, Kolpakov FA, Makeev VJ
- Issue date: 2016 Jan 4
- HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.
- Authors: Kulakovskiy IV, Vorontsov IE, Yevshin IS, Sharipov RN, Fedorova AD, Rumynskiy EI, Medvedeva YA, Magana-Mora A, Bajic VB, Papatsenko DA, Kolpakov FA, Makeev VJ
- Issue date: 2018 Jan 4
- Transcription Factor Information System (TFIS): A Tool for Detection of Transcription Factor Binding Sites.
- Authors: Narad P, Kumar A, Chakraborty A, Patni P, Sengupta A, Wadhwa G, Upadhyaya KC
- Issue date: 2017 Sep
- A novel method for improved accuracy of transcription factor binding site prediction.
- Authors: Khamis AM, Motwalli O, Oliva R, Jankovic BR, Medvedeva YA, Ashoor H, Essack M, Gao X, Bajic VB
- Issue date: 2018 Jul 6
- AthaMap-assisted transcription factor target gene identification in Arabidopsis thaliana.
- Authors: Bülow L, Brill Y, Hehl R
- Issue date: 2010
Showing items related by title, author, creator and subject.
Harnessing CRISPR/Cas systems for programmable transcriptional and post-transcriptional regulationMahas, Ahmed; Neal Stewart, C.; Mahfouz, Magdy M. (Elsevier BV, 2017-11-29)Genome editing has enabled broad advances and novel approaches in studies of gene function and structure; now, emerging methods aim to precisely engineer post-transcriptional processes. Developing precise, efficient molecular tools to alter the transcriptome holds great promise for biotechnology and synthetic biology applications. Different approaches have been employed for targeted degradation of RNA species in eukaryotes, but they lack programmability and versatility, thereby limiting their utility for diverse applications. The CRISPR/Cas9 system has been harnessed for genome editing in many eukaryotic species and, using a catalytically inactive Cas9 variant, the CRISPR/dCas9 system has been repurposed for transcriptional regulation. Recent studies have used other CRISPR/Cas systems for targeted RNA degradation and RNA-based manipulations. For example, Cas13a, a Type VI-A endonuclease, has been identified as an RNA-guided RNA ribonuclease and used for manipulation of RNA. Here, we discuss different modalities for targeted RNA interference with an emphasis on the potential applications of CRISPR/Cas systems as programmable transcriptional regulators for broad uses, including functional biology, biotechnology, and synthetic biology applications.
TcoF-DB v2: update of the database of human and mouse transcription co-factors and transcription factor interactionsSchmeier, Sebastian; Alam, Tanvir; Essack, Magbubah; Bajic, Vladimir B. (Oxford University Press (OUP), 2016-10-17)Transcription factors (TFs) play a pivotal role in transcriptional regulation, making them crucial for cell survival and important biological functions. For the regulation of transcription, interactions of different regulatory proteins known as transcription co-factors (TcoFs) and TFs are essential in forming necessary protein complexes. Although TcoFs themselves do not bind DNA directly, their influence on transcriptional regulation and initiation, although indirect, has been shown to be significant, with the functionality of TFs strongly influenced by the presence of TcoFs. In the TcoF-DB v2 database, we collect information on TcoFs. In this article, we describe updates and improvements implemented in TcoF-DB v2. TcoF-DB v2 provides several new features that enables exploration of the roles of TcoFs. The content of the database has significantly expanded, and is enriched with information from Gene Ontology, biological pathways, diseases and molecular signatures. TcoF-DB v2 now includes many more TFs; has substantially increased the number of human TcoFs to 958, and now includes information on mouse (418 new TcoFs). TcoF-DB v2 enables the exploration of information on TcoFs and allows investigations into their influence on transcriptional regulation in humans and mice. TcoF-DB v2 can be accessed at http://tcofdb.org/.
TcoF-DB: dragon database for human transcription co-factors and transcription factor interacting proteinsSchaefer, Ulf; Schmeier, Sebastian; Bajic, Vladimir B. (Oxford University Press (OUP), 2010-10-21)The initiation and regulation of transcription in eukaryotes is complex and involves a large number of transcription factors (TFs), which are known to bind to the regulatory regions of eukaryotic DNA. Apart from TF-DNA binding, protein-protein interaction involving TFs is an essential component of the machinery facilitating transcriptional regulation. Proteins that interact with TFs in the context of transcription regulation but do not bind to the DNA themselves, we consider transcription co-factors (TcoFs). The influence of TcoFs on transcriptional regulation and initiation, although indirect, has been shown to be significant with the functionality of TFs strongly influenced by the presence of TcoFs. While the role of TFs and their interaction with regulatory DNA regions has been well-studied, the association between TFs and TcoFs has so far been given less attention. Here, we present a resource that is comprised of a collection of human TFs and the TcoFs with which they interact. Other proteins that have a proven interaction with a TF, but are not considered TcoFs are also included. Our database contains 157 high-confidence TcoFs and additionally 379 hypothetical TcoFs. These have been identified and classified according to the type of available evidence for their involvement in transcriptional regulation and their presence in the cell nucleus. We have divided TcoFs into four groups, one of which contains high-confidence TcoFs and three others contain TcoFs which are hypothetical to different extents. We have developed the Dragon Database for Human Transcription Co-Factors and Transcription Factor Interacting Proteins (TcoF-DB). A web-based interface for this resource can be freely accessed at http://cbrc.kaust.edu.sa/tcof/ and http://apps.sanbi.ac.za/tcof/. © The Author(s) 2010.