Does Head Label Help for Long-Tailed Multi-Label Text Classification
dc.contributor.author | Xiao, Lin | |
dc.contributor.author | Zhang, Xiangliang | |
dc.contributor.author | Jing, Liping | |
dc.contributor.author | Huang, Chi | |
dc.contributor.author | Song, Mingyang | |
dc.date.accessioned | 2021-09-20T07:04:58Z | |
dc.date.available | 2021-02-04T11:39:08Z | |
dc.date.available | 2021-09-20T07:04:58Z | |
dc.date.issued | 2021 | |
dc.identifier.issn | 2374-3468 | |
dc.identifier.issn | 2159-5399 | |
dc.identifier.uri | http://hdl.handle.net/10754/667224 | |
dc.description.abstract | Multi-label text classification (MLTC) aims to annotate documents with the most relevant labels from a number of candidate labels. In real applications, the distribution of label frequency often exhibits a long tail, i.e., a few labels are associated with a large number of documents (a.k.a. head labels), while a large fraction of labels are associated with a small number of documents (a.k.a. tail labels). To address the challenge of insufficient training data on tail label classification, we propose a Head-to-Tail Network (HTTN) to transfer the meta-knowledge from the data-rich head labels to data-poor tail labels. The meta-knowledge is the mapping from few-shot network parameters to many-shot network parameters, which aims to promote the generalizability of tail classifiers. Extensive experimental results on three benchmark datasets demonstrate that HTTN consistently outperforms the state-of-the-art methods. The code and hyper-parameter settings are released for reproducibility(1). | |
dc.description.sponsorship | This work was supported in part by the National Natural Science Foundation of China under Grant 61822601, 61773050, 61632004 and 61828302; The Beijing Natural Science Foundation under Grant Z180006; The National Key Research and Development Program of China under Grant 2020AAA0106800 and 2017YFC1703506; The Fundamental Research Funds for the Central Universities (2019JBZ110); And King Abdullah University of Science & Technology, under award number FCC/1/1976-19-01. | en_US |
dc.publisher | arXiv | en_US |
dc.relation.url | https://arxiv.org/pdf/2101.09704 | en_US |
dc.rights | Archived with thanks to arXiv | |
dc.title | Does Head Label Help for Long-Tailed Multi-Label Text Classification | |
dc.type | Conference Paper | |
dc.contributor.department | Computer Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division | |
dc.conference.date | FEB 02-09, 2021 | |
dc.conference.name | 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence | |
dc.conference.location | ELECTR NETWORK | |
dc.identifier.wosut | WOS:000681269805088 | |
dc.eprint.version | Post-print | |
dc.contributor.institution | Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China. | |
dc.identifier.volume | 35 | |
dc.identifier.pages | 14103-14111 | |
dc.identifier.arxivid | 2101.09704 | |
kaust.person | Zhang, Xiangliang | |
kaust.grant.number | FCC/1/1976-19-01 | |
refterms.dateFOA | 2021-02-04T11:39:47Z |
Files in this item
This item appears in the following Collection(s)
-
Conference Papers
-
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/