HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis
Type
ArticleAuthors
Kulakovskiy, Ivan V.Vorontsov, Ilya E.
Yevshin, Ivan S.
Sharipov, Ruslan N.

Fedorova, Alla D.
Rumynskiy, Eugene I.
Medvedeva, Yulia A.
Magana-Mora, Arturo

Bajic, Vladimir B.

Papatsenko, Dmitry A.
Kolpakov, Fedor A.
Makeev, Vsevolod J.
KAUST Department
Computational Bioscience Research Center (CBRC)Computer Science Program
Applied Mathematics and Computational Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
KAUST Grant Number
BAS/1/1606-01-01Date
2017-11-11Online Publication Date
2017-11-11Print Publication Date
2018-01-04Permanent link to this record
http://hdl.handle.net/10754/626157
Metadata
Show full item recordAbstract
We present a major update of the HOCOMOCO collection that consists of patterns describing DNA binding specificities for human and mouse transcription factors. In this release, we profited from a nearly doubled volume of published in vivo experiments on transcription factor (TF) binding to expand the repertoire of binding models, replace low-quality models previously based on in vitro data only and cover more than a hundred TFs with previously unknown binding specificities. This was achieved by systematic motif discovery from more than five thousand ChIP-Seq experiments uniformly processed within the BioUML framework with several ChIP-Seq peak calling tools and aggregated in the GTRD database. HOCOMOCO v11 contains binding models for 453 mouse and 680 human transcription factors and includes 1302 mononucleotide and 576 dinucleotide position weight matrices, which describe primary binding preferences of each transcription factor and reliable alternative binding specificities. An interactive interface and bulk downloads are available on the web: http://hocomoco.autosome.ru and http://www.cbrc.kaust.edu.sa/hocomoco11. In this release, we complement HOCOMOCO by MoLoTool (Motif Location Toolbox, http://molotool.autosome.ru) that applies HOCOMOCO models for visualization of binding sites in short DNA sequences.Citation
Kulakovskiy IV, Vorontsov IE, Yevshin IS, Sharipov RN, Fedorova AD, et al. (2017) HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis. Nucleic Acids Research. Available: http://dx.doi.org/10.1093/nar/gkx1106.Sponsors
The project was primarily supported by Russian Science Foundation [17-74-10188 to I.V.K.]; A.M.M. and V.B.B. were supported by King Abdullah University of Science and Technology (KAUST) [baseline fund BAS/1/1606-01-01 of V.B.B.]; I.E.V. was personally supported by the Skoltech Systems Biology Fellowship. Funding for open access charge: Russian Science Foundation [17–74–10188 to I.V.K.].Publisher
Oxford University Press (OUP)Journal
Nucleic Acids ResearchPubMed ID
29140464Additional Links
https://academic.oup.com/nar/article/doi/10.1093/nar/gkx1106/4616875ae974a485f413a2113503eed53cd6c53
10.1093/nar/gkx1106
Scopus Count
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
Related articles
- HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models.
- Authors: Kulakovskiy IV, Vorontsov IE, Yevshin IS, Soboleva AV, Kasianov AS, Ashoor H, Ba-Alawi W, Bajic VB, Medvedeva YA, Kolpakov FA, Makeev VJ
- Issue date: 2016 Jan 4
- GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments.
- Authors: Yevshin I, Sharipov R, Valeev T, Kel A, Kolpakov F
- Issue date: 2017 Jan 4
- HOCOMOCO: a comprehensive collection of human transcription factor binding sites models.
- Authors: Kulakovskiy IV, Medvedeva YA, Schaefer U, Kasianov AS, Vorontsov IE, Bajic VB, Makeev VJ
- Issue date: 2013 Jan
- GTRD: a database on gene transcription regulation-2019 update.
- Authors: Yevshin I, Sharipov R, Kolmykov S, Kondrakhin Y, Kolpakov F
- Issue date: 2019 Jan 8
- A novel method for improved accuracy of transcription factor binding site prediction.
- Authors: Khamis AM, Motwalli O, Oliva R, Jankovic BR, Medvedeva YA, Ashoor H, Essack M, Gao X, Bajic VB
- Issue date: 2018 Jul 6