Few-shot learning for classification of novel macromolecular structures in cryo-electron tomograms
dc.contributor.author | Li, Ran | |
dc.contributor.author | Yu, Liangyong | |
dc.contributor.author | Zhou, Bo | |
dc.contributor.author | Zeng, Xiangrui | |
dc.contributor.author | Wang, Zhenyu | |
dc.contributor.author | Yang, Xiaoyan | |
dc.contributor.author | Zhang, Jing | |
dc.contributor.author | Gao, Xin | |
dc.contributor.author | Jiang, Rui | |
dc.contributor.author | Xu, Min | |
dc.date.accessioned | 2020-11-12T05:47:45Z | |
dc.date.available | 2020-11-12T05:47:45Z | |
dc.date.issued | 2020-11-11 | |
dc.date.submitted | 2020-04-13 | |
dc.identifier.citation | Li, R., Yu, L., Zhou, B., Zeng, X., Wang, Z., Yang, X., … Xu, M. (2020). Few-shot learning for classification of novel macromolecular structures in cryo-electron tomograms. PLOS Computational Biology, 16(11), e1008227. doi:10.1371/journal.pcbi.1008227 | |
dc.identifier.issn | 1553-7358 | |
dc.identifier.doi | 10.1371/journal.pcbi.1008227 | |
dc.identifier.uri | http://hdl.handle.net/10754/665918 | |
dc.description.abstract | Cryo-electron tomography (cryo-ET) provides 3D visualization of subcellular components in the near-native state and at sub-molecular resolutions in single cells, demonstrating an increasingly important role in structural biology in situ. However, systematic recognition and recovery of macromolecular structures in cryo-ET data remain challenging as a result of low signal-to-noise ratio (SNR), small sizes of macromolecules, and high complexity of the cellular environment. Subtomogram structural classification is an essential step for such task. Although acquisition of large amounts of subtomograms is no longer an obstacle due to advances in automation of data collection, obtaining the same number of structural labels is both computation and labor intensive. On the other hand, existing deep learning based supervised classification approaches are highly demanding on labeled data and have limited ability to learn about new structures rapidly from data containing very few labels of such new structures. In this work, we propose a novel approach for subtomogram classification based on few-shot learning. With our approach, classification of unseen structures in the training data can be conducted given few labeled samples in test data through instance embedding. Experiments were performed on both simulated and real datasets. Our experimental results show that we can make inference on new structures given only five labeled samples for each class with a competitive accuracy (> 0.86 on the simulated dataset with SNR = 0.1), or even one sample with an accuracy of 0.7644. The results on real datasets are also promising with accuracy > 0.9 on both conditions and even up to 1 on one of the real datasets. Our approach achieves significant improvement compared with the baseline method and has strong capabilities of generalizing to other cellular components. | |
dc.description.sponsorship | This work was supported in part by U.S. National Institutes of Health (NIH) grants P41GM103712 and R01GM134020, U.S. National Science Foundation (NSF) grants DBI-1949629 and IIS-2007595, and Mark Foundation 19-044-ASP. XZ was supported by a fellowship from Carnegie | |
dc.publisher | Public Library of Science (PLoS) | |
dc.relation.url | https://dx.plos.org/10.1371/journal.pcbi.1008227 | |
dc.rights | This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.title | Few-shot learning for classification of novel macromolecular structures in cryo-electron tomograms | |
dc.type | Article | |
dc.contributor.department | Computational Bioscience Research Center (CBRC) | |
dc.contributor.department | Computer Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.contributor.department | Structural and Functional Bioinformatics Group | |
dc.identifier.journal | PLOS Computational Biology | |
dc.eprint.version | Publisher's Version/PDF | |
dc.contributor.institution | Department of Automation, Tsinghua University, Beijing, China. | |
dc.contributor.institution | Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA. | |
dc.contributor.institution | Department of Biomedical Engineering, Yale University, New Haven, CT, USA. | |
dc.contributor.institution | Department of Computer Science, University of California Irvine, Irvine, CA, USA. | |
dc.identifier.volume | 16 | |
dc.identifier.issue | 11 | |
dc.identifier.pages | e1008227 | |
kaust.person | Gao, Xin | |
dc.date.accepted | 2020-08-08 | |
dc.relation.issupplementedby | github:xulabs/aitom | |
refterms.dateFOA | 2020-11-12T05:49:14Z | |
display.relations | <b>Is Supplemented By:</b><br/> <ul><li><i>[Software]</i> <br/> Title: xulabs/aitom: AI for tomography. Publication Date: 2019-06-13. github: <a href="https://github.com/xulabs/aitom" >xulabs/aitom</a> Handle: <a href="http://hdl.handle.net/10754/667784" >10754/667784</a></a></li></ul> |
Files in this item
This item appears in the following Collection(s)
-
Articles
-
Structural and Functional Bioinformatics Group
For more information visit: https://sfb.kaust.edu.sa/Pages/Home.aspx -
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computational Bioscience Research Center (CBRC)
-
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/