SupportNet: a novel incremental learning framework through deep learning and support data
Type
PreprintKAUST Department
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) DivisionComputer Science Program
Computational Bioscience Research Center (CBRC)
Date
2018-05-08Permanent link to this record
http://hdl.handle.net/10754/627906
Metadata
Show full item recordAbstract
Motivation: In most biological data sets, the amount of data is regularly growing and the number of classes is continuously increasing. To deal with the new data from the new classes, one approach is to train a classification model, e.g., a deep learning model, from scratch based on both old and new data. This approach is highly computationally costly and the extracted features are likely very different from the ones extracted by the model trained on the old data alone, which leads to poor model robustness. Another approach is to fine tune the trained model from the old data on the new data. However, this approach often does not have the ability to learn new knowledge without forgetting the previously learned knowledge, which is known as the catastrophic forgetting problem. To our knowledge, this problem has not been studied in the field of bioinformatics despite its existence in many bioinformatic problems. Results: Here we propose a novel method, SupportNet, to solve the catastrophic forgetting problem efficiently and effectively. SupportNet combines the strength of deep learning and support vector machine (SVM), where SVM is used to identify the support data from the old data, which are fed to the deep learning model together with the new data for further training so that the model can review the essential information of the old data when learning the new information. Two powerful consolidation regularizers are applied to ensure the robustness of the learned model. Comprehensive experiments on various tasks, including enzyme function prediction, subcellular structure classification and breast tumor classification, show that SupportNet drastically outperforms the state-of-the-art incremental learning methods and reaches similar performance as the deep learning model trained from scratch on both old and new data. Availability: Our program is accessible at: \url{https://github.com/lykaust15/SupportNet}.Citation
Li Y, Li Z, Ding L, Hu Y, Chen W, et al. (2018) SupportNet: a novel incremental learning framework through deep learning and support data. Available: http://dx.doi.org/10.1101/317578.Sponsors
This work was supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) under Awards No. FCC/1/1976-04, URF/1/2602-01, URF/1/3007- 01, URF/1/3412-01, URF/1/3450-01 and URF/1/3454-01. W.C. was supported by Basic Research Grant from Science and Technology Innovation Commission of Shenzhen Municipal Government [JCYJ20170 307105752508]. Y.H. was supported by the International Cooperation Research Grant (No. GJHZ20170310161947503) from Science and Technology Innovation Commission of Shenzhen Municipal Government.Publisher
Cold Spring Harbor LaboratoryDOI
10.1101/317578Additional Links
https://www.biorxiv.org/content/early/2018/05/08/317578ae974a485f413a2113503eed53cd6c53
10.1101/317578