Predicting Candidate Genes From Phenotypes, Functions, And Anatomical Site Of Expression.
Type
ArticleKAUST Department
Bio-Ontology Research Group (BORG)Computational Bioscience Research Center (CBRC)
Computer Science
Computer Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
KAUST Grant Number
FCC/1/1976-08-01URF/1/3454-01-01
URF/1/3790-01-01
Date
2020-10-14Online Publication Date
2020-10-14Print Publication Date
2021-05-05Submitted Date
2020-03-30Permanent link to this record
http://hdl.handle.net/10754/662446
Metadata
Show full item recordAbstract
MOTIVATION:Over the past years, many computational methods have been developed to incorporate information about phenotypes for disease gene prioritization task. These methods generally compute the similarity between a patient's phenotypes and a database of gene-phenotype to find the most phenotypically similar match. The main limitation in these methods is their reliance on knowledge about phenotypes associated with particular genes, which is not complete in humans as well as in many model organisms such as the mouse and fish. Information about functions of gene products and anatomical site of gene expression is available for more genes and can also be related to phenotypes through ontologies and machine learning models. RESULTS:We developed a novel graph-based machine learning method for biomedical ontologies which is able to exploit axioms in ontologies and other graph-structured data. Using our machine learning method, we embed genes based on their associated phenotypes, functions of the gene products, and anatomical location of gene expression. We then develop a machine learning model to predict gene-disease associations based on the associations between genes and multiple biomedical ontologies, and this model significantly improves over state of the art methods. Furthermore, we extend phenotype-based gene prioritization methods significantly to all genes which are associated with phenotypes, functions, or site of expression. AVAILABILITY:Software and data are available at https://github.com/bio-ontology-research-group/DL2Vec.Citation
Chen, J., Althagafi, A., & Hoehndorf, R. (2020). Predicting Candidate Genes From Phenotypes, Functions, And Anatomical Site Of Expression. Bioinformatics. doi:10.1093/bioinformatics/btaa879Sponsors
This work was supported by funding from King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) under Award No. URF/1/3454-01-01, URF/1/3790-01-01, FCC/1/1976-08-01, and FCC/1/1976-08-08.Publisher
Oxford University Press (OUP)Journal
BioinformaticsPubMed ID
33051643Additional Links
https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btaa879/5922810ae974a485f413a2113503eed53cd6c53
10.1093/bioinformatics/btaa879
Scopus Count
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution License which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Related articles
- mOWL: Python library for machine learning with biomedical ontologies.
- Authors: Zhapa-Camacho F, Kulmanov M, Hoehndorf R
- Issue date: 2023 Jan 1
- Formal axioms in biomedical ontologies improve analysis and interpretation of associated data.
- Authors: Smaili FZ, Gao X, Hoehndorf R
- Issue date: 2020 Apr 1
- OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction.
- Authors: Smaili FZ, Gao X, Hoehndorf R
- Issue date: 2019 Jun 1
- Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes.
- Authors: Alshahrani M, Hoehndorf R
- Issue date: 2018 Sep 1
- Semantic similarity and machine learning with ontologies.
- Authors: Kulmanov M, Smaili FZ, Gao X, Hoehndorf R
- Issue date: 2021 Jul 20