Taxon and trait recognition from digitized herbarium specimens using deep convolutional neural networks
Type
ArticleAuthors
Younis, Sohaib
Weiland, Claus

Hoehndorf, Robert

Dressler, Stefan
Hickler, Thomas

Seeger, Bernhard
Schmidt, Marco

KAUST Department
Bio-Ontology Research Group (BORG)Computational Bioscience Research Center (CBRC)
Computer Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Date
2018-03-13Online Publication Date
2018-03-13Print Publication Date
2018-10-02Permanent link to this record
http://hdl.handle.net/10754/627471
Metadata
Show full item recordAbstract
Herbaria worldwide are housing a treasure of hundreds of millions of herbarium specimens, which are increasingly being digitized and thereby more accessible to the scientific community. At the same time, deep-learning algorithms are rapidly improving pattern recognition from images and these techniques are more and more being applied to biological objects. In this study, we are using digital images of herbarium specimens in order to identify taxa and traits of these collection objects by applying convolutional neural networks (CNN). Images of the 1000 species most frequently documented by herbarium specimens on GBIF have been downloaded and combined with morphological trait data, preprocessed and divided into training and test datasets for species and trait recognition. Good performance in both domains suggests substantial potential of this approach for supporting taxonomy and natural history collection management. Trait recognition is also promising for applications in functional ecology.Citation
Younis S, Weiland C, Hoehndorf R, Dressler S, Hickler T, et al. (2018) Taxon and trait recognition from digitized herbarium specimens using deep convolutional neural networks. Botany Letters: 1–7. Available: http://dx.doi.org/10.1080/23818107.2018.1446357.Sponsors
SY, MS and SD received funding from the DFG Project Mobilization of trait data from digital image files by deep learning approaches (grant 316452578). Parts of RH’s & CW’s work were funded by the National Bioscience Database Center (NBDC) and the Database Center for Life Science (DBCLS) Biohackathon 2017 grants. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the TITAN Xp GPU to CW used for this research.Publisher
Informa UK LimitedJournal
Botany LettersarXiv
1803.07892Additional Links
https://www.tandfonline.com/doi/full/10.1080/23818107.2018.1446357http://arxiv.org/pdf/1803.07892
ae974a485f413a2113503eed53cd6c53
10.1080/23818107.2018.1446357