Isoform-Disease Association Prediction by Data Fusion

Abstract
Alternative splicing enables a gene spliced into different isoforms, which are closely related with diverse developmental abnormalities. Identifying the isoform-disease associations helps to uncover the underlying pathology of various complex diseases, and to develop precise treatments and drugs for these diseases. Although many approaches have been proposed for predicting gene-disease associations and isoform functions, few efforts have been made toward predicting isoform-disease associations in large-scale, the main bottleneck is the lack of ground-truth isoform-disease associations. To bridge this gap, we propose a multi-instance learning inspired computational approach called IDAPred to fuse genomics and transcriptomics data for isoform-disease association prediction. Given the bag-instance relationship between gene and its spliced isoforms, IDAPred introduces a dispatch and aggregation term to dispatch gene-disease associations to individual isoforms, and reversely aggregate these dispatched associations to affiliated genes. Next, it fuses different genomics and transcriptomics data to replenish gene-disease associations and to induce a linear classifier for predicting isoform-disease associations in a coherent way. In addition, to alleviate the bias toward observed gene-disease associations, it adds a regularization term to differentiate the currently observed associations from the unobserved (potential) ones. Experimental results show that IDAPred significantly outperforms the related state-of-the-art methods.

Citation
Huang, Q., Wang, J., Zhang, X., & Yu, G. (2020). Isoform-Disease Association Prediction by Data Fusion. Lecture Notes in Computer Science, 44–55. doi:10.1007/978-3-030-57821-3_5

Acknowledgements
This research is supported by NSFC (61872300), Fundamental Research Funds for the Central Universities (XDJK2019B024 and XDJK2020B028), Natural Science Foundation of CQ CSTC (cstc2018jcyjAX0228).

Publisher
Springer Nature

Conference/Event Name
16th International Symposium on Bioinformatics Research and Applications, ISBRA 2020

DOI
10.1007/978-3-030-57821-3_5

Additional Links
http://link.springer.com/10.1007/978-3-030-57821-3_5

Permanent link to this record