Handle URI:
http://hdl.handle.net/10754/601400
Title:
Deep Learning and Applications in Computational Biology
Authors:
Zeng, Jianyang
Abstract:
RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this work, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.
Conference/Event name:
KAUST Research Conference on Computational and Experimental Interfaces of Big Data and Biotechnology
Issue Date:
26-Jan-2016
Type:
Presentation
Appears in Collections:
KAUST Research Conference on Computational and Experimental Interfaces of Big Data and Biotechnology, January 2016

Full metadata record

DC FieldValue Language
dc.contributor.authorZeng, Jianyangen
dc.date.accessioned2016-03-16T12:53:23Zen
dc.date.available2016-03-16T12:53:23Zen
dc.date.issued2016-01-26en
dc.identifier.urihttp://hdl.handle.net/10754/601400en
dc.description.abstractRNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this work, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.en
dc.titleDeep Learning and Applications in Computational Biologyen
dc.typePresentationen
dc.conference.dateJanuary 25-27, 2016en
dc.conference.nameKAUST Research Conference on Computational and Experimental Interfaces of Big Data and Biotechnologyen
dc.conference.locationKAUST, Thuwal, Saudi Arabiaen
dc.contributor.institutionInstitute for Interdisciplinary Information Sciences (IIIS), Tsinghua Universityen
All Items in KAUST are protected by copyright, with all rights reserved, unless otherwise indicated.