Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/17143
Title: | Evaluation of Different Data-Derived Label Hierarchies in Multi-label Classification | Authors: | Madjarov, Gjorgji Dimitrovski, Ivica GJorgjevikj, Dejan Džeroski, Sašo |
Issue Date: | 2015 | Publisher: | Springer, Cham | Source: | Madjarov, G., Dimitrovski, I., Gjorgjevikj, D., Džeroski, S. (2015). Evaluation of Different Data-Derived Label Hierarchies in Multi-label Classification. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds) New Frontiers in Mining Complex Patterns. NFMCP 2014. Lecture Notes in Computer Science(), vol 8983. Springer, Cham. https://doi.org/10.1007/978-3-319-17876-9_2 | Series/Report no.: | Lecture Notes in Computer Science; | Conference: | New Frontiers in Mining Complex Patterns. NFMCP 2014 | Abstract: | Motivated by an increasing number of new applications, the research community is devoting an increasing amount of attention to the task of multi-label classification (MLC). Many different approaches to solving multi-label classification problems have been recently developed. Recent empirical studies have comprehensively evaluated many of these approaches on many datasets using different evaluation measures. The studies have indicated that the predictive performance and efficiency of the approaches could be improved by using data derived (artificial) hierarchies, in the learning and prediction phases. In this paper, we compare different clustering algorithms for constructing the label hierarchies (in a data-driven manner), in multi-label classification. We consider flat label sets and construct the label hierarchies from the label sets that appear in the annotations of the training data by using four different clustering algorithms (balanced $$k$$-means, agglomerative clustering with single and complete linkage and predictive clustering trees). The hierarchies are then used in conjunction with global hierarchical multi-label classification (HMC) approaches. The results from the statistical and experimental evaluation reveal that the data-derived label hierarchies used in conjunction with global HMC methods greatly improve the performance of MLC methods. Additionally, multi-branch hierarchies appear much more suitable for the global HMC approaches as compared to the binary hierarchies. | URI: | http://hdl.handle.net/20.500.12188/17143 | ISSN: | 978-3-319-17876-9 | DOI: | 10.1007/978-3-319-17876-9_2 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
9783319178752-c2.pdf | 2.42 MB | Adobe PDF | View/Open |
Page view(s)
39
checked on Jul 24, 2024
Download(s)
13
checked on Jul 24, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.