Abstract
Many works have used knowledge bases that contain taxonomy of hierarchically structured categories for large-scale text classification. These works have utilized hierarchical taxonomies based on the explicit representation model. They demonstrated that the explicit representation model provides a stable performance for large-scale text classification. However, this performance is limited to the knowledge base. In this paper, we integrate the implicit representation model, which has the ability to use external knowledge indirectly, with previous large-scale text classification. To this end, we first propose Hierarchical Category embedding (HC embedding) to generate distributed representations of hierarchical categories based on the implicit representation model. Second, we develop a new semantic similarity method to integrate HC embedding with the large-scale text classification. To demonstrate efficacy, we apply the proposed methodology to Open Directory Project (ODP)-based text classification, which has a hierarchical taxonomy. The evaluation results demonstrate that the proposed method outperforms the current state-of-the-art method by 7.4 %, 7.0 %, and 18 % in terms of micro-averaging F1-score, macro-averaging F1-score, and precision at k, respectively.
Original language | English |
---|---|
Title of host publication | Proceedings of 2018 IEEE 17th International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2018 |
Editors | Newton Howard, Sam Kwong, Yingxu Wang, Jerome Feldman, Bernard Widrow, Phillip Sheu |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 246-253 |
Number of pages | 8 |
ISBN (Electronic) | 9781538633601 |
DOIs | |
State | Published - 4 Oct 2018 |
Event | 17th IEEE International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2018 - Berkeley, United States Duration: 16 Jul 2018 → 18 Jul 2018 |
Publication series
Name | Proceedings of 2018 IEEE 17th International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2018 |
---|
Conference
Conference | 17th IEEE International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2018 |
---|---|
Country/Territory | United States |
City | Berkeley |
Period | 16/07/18 → 18/07/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Artificial neural networks
- Embedding
- Knowledge manipulations
- Knowledge representation