Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs.
Naser, Rayan Mohammad Mahmoud
Momin, Afaque Ahmad Imtiyaz
Walkiewicz, Katarzyna Wiktoria
Canlas, Christian G
Ali, Amal J.
Bajic, Vladimir B.
Arold, Stefan T.
KAUST DepartmentApplied Mathematics and Computational Science Program
Biological and Environmental Sciences and Engineering (BESE) Division
Computational Bioscience Research Center (CBRC)
Computer Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Structural Biology and Engineering
Structural and Functional Bioinformatics Group
KAUST Grant NumberURF/1/1976-04
Preprint Posting Date2018-03-11
Permanent link to this recordhttp://hdl.handle.net/10754/658612
MetadataShow full item record
AbstractMOTIVATION:Leucine-aspartic acid (LD) motifs are short linear interaction motifs (SLiMs) that link paxillin family proteins to factors controlling cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. RESULTS:To enable a proteome-wide assessment of LD motifs, we developed an active-learning based framework (LDmotif finder; LDMF) that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome revealed a dozen new proteins containing LD motifs. We found that LD motif signalling evolved in unicellular eukaryotes more than 800 Myr ago, with paxillin and vinculin as core constituents, and nuclear export signal (NES) as a likely source of de novo LD motifs. We show that LD motif proteins form a functionally homogenous group, all being involved in cell morphogenesis and adhesion. This functional focus is recapitulated in cells by GFP-fused LD motifs, suggesting that it is intrinsic to the LD motif sequence, possibly through their effect on binding partners. Our approach elucidated the origin and dynamic adaptations of an ancestral SLiM, and can serve as a guide for the identification of other SLiMs for which only few representatives are known. AVAILABILITY:LDMF is freely available online at www.cbrc.kaust.edu.sa/ldmf; Source code is available at https://github.com/tanviralambd/LD/. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.
CitationAlam, T., Alazmi, M., Naser, R., Huser, F., Momin, A. A., Astro, V., … Arold, S. T. (2019). Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs. Bioinformatics. doi:10.1093/bioinformatics/btz703
SponsorsWe acknowledge SOLEIL for provision of synchrotron radiation facilities for testing of FAT:LD motif peptide crystals. We thank M. Savko, W. Shepard, S. Sirigu, L. Chavas and P. Legrand for assistance in using beamlines PX1 and PX2A. We thank R. Höhndorf for advice with the GO analysis, J. Hanks, C. Kapfer and A. Hungler for help with computing at KAUST. We acknowledge support from the KAUST Imaging and Characterization Core Lab, the Bioscience Core Lab and Research Computing Core lab.
This research used the resources of the Supercomputing Laboratory at KAUST, and was supported by KAUST through the baseline fund and the Award No. URF/1/1976-04, URF/1/1976-06, URF/1/3007-01, URF/1/1976-02, BAS/1/1606-01-01 and #OSR-2015- CRG4-2602 from the Office of Sponsored Research (OSR).
PublisherOxford University Press (OUP)
RelationsIs Supplemented By:
Title: tanviralambd/LD: Source code for LD motif detection. Publication Date: 2019-06-28. github: tanviralambd/LD Handle: 10754/666977
CollectionsArticles; Biological and Environmental Science and Engineering (BESE) Division; Bioscience Program; Applied Mathematics and Computational Science Program; Structural and Functional Bioinformatics Group; Computer Science Program; Computational Bioscience Research Center (CBRC); Statistics Program; Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
Except where otherwise noted, this item's license is described as This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.