Deep learning identifies and quantifies recombination hotspot determinants
dc.contributor.author | Li, Yu | |
dc.contributor.author | Chen, Siyuan | |
dc.contributor.author | Rapakoulia, Trisevgeni | |
dc.contributor.author | Kuwahara, Hiroyuki | |
dc.contributor.author | Yip, Kevin Y | |
dc.contributor.author | Gao, Xin | |
dc.date.accessioned | 2022-04-13T09:10:53Z | |
dc.date.available | 2022-04-13T09:10:53Z | |
dc.date.issued | 2022-04-12 | |
dc.identifier.citation | Li, Y., Chen, S., Rapakoulia, T., Kuwahara, H., Yip, K. Y., & Gao, X. (2022). Deep learning identifies and quantifies recombination hotspot determinants. Bioinformatics. https://doi.org/10.1093/bioinformatics/btac234 | |
dc.identifier.issn | 1367-4803 | |
dc.identifier.issn | 1460-2059 | |
dc.identifier.pmid | 35412589 | |
dc.identifier.doi | 10.1093/bioinformatics/btac234 | |
dc.identifier.uri | http://hdl.handle.net/10754/676242 | |
dc.description.abstract | Motivation Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes, and species. Results RHSNet can significantly outperforms other sequence-based methods on multiple datasets across different species, sexes, and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification, and GC content. Further cross-sex, cross-population, and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. | |
dc.description.sponsorship | Figure 1 is created by Heno Hwang, a scientific illustrator at King Abdullah University of Science and Technology (KAUST). | |
dc.description.sponsorship | Supported by the KAUST Office of Sponsored Research(OSR) | |
dc.publisher | Oxford University Press (OUP) | |
dc.relation.url | https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btac234/6567357 | |
dc.rights | Archived with thanks to Bioinformatics under a Creative Commons license, details at: https://creativecommons.org/licenses/by-nc/4.0/ | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc/4.0/ | |
dc.title | Deep learning identifies and quantifies recombination hotspot determinants | |
dc.type | Article | |
dc.contributor.department | Computer Science | |
dc.contributor.department | Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division | |
dc.contributor.department | Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia | |
dc.contributor.department | KAUST Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology | |
dc.contributor.department | Computational Bioscience Research Center (CBRC) | |
dc.contributor.department | Computer Science Program | |
dc.identifier.journal | Bioinformatics | |
dc.eprint.version | Publisher's Version/PDF | |
dc.contributor.institution | Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China | |
dc.contributor.institution | The CUHK Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China | |
dc.contributor.institution | Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany | |
kaust.person | Li, Yu | |
kaust.person | Chen, Siyuan | |
kaust.person | Kuwahara, Hiroyuki | |
kaust.person | Gao, Xin | |
kaust.grant.number | BAS/1/1624-01 | |
kaust.grant.number | FCC/1/1976-23-01 | |
kaust.grant.number | FCC/1/1976-26-01 | |
kaust.grant.number | REI/1/0018-01-01 | |
kaust.grant.number | REI/1/4216-01-01 | |
kaust.grant.number | REI/1/4437-01-01 | |
kaust.grant.number | REI/1/4473-01-01 | |
kaust.grant.number | URF/1/4098-01-01 | |
kaust.grant.number | REI/1/4742-01-01 | |
dc.relation.issupplementedby | github:frankchen121212/RHSNet | |
refterms.dateFOA | 2022-04-13T09:19:28Z | |
display.relations | <b>Is Supplemented By:</b><br/> <ul><li><i>[Software]</i> <br/> Title: frankchen121212/RHSNet: Tensorflow and Keras implementation of RHSNet for recombination hotspot identification and quantification. Publication Date: 2021-05-14. github: <a href="https://github.com/frankchen121212/RHSNet" >frankchen121212/RHSNet</a> Handle: <a href="http://hdl.handle.net/10754/676686" >10754/676686</a></a></li></ul> | |
kaust.acknowledged.supportUnit | Office of Sponsored Research(OSR) |
Files in this item
This item appears in the following Collection(s)
-
Articles
-
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computational Bioscience Research Center (CBRC)
-
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/