LPIS Home Page
Google Search

Title: Random k-Labelsets for Multi-Label Classification
Author(s): G. Tsoumakas, I. Katakis, I. Vlahavas.
Availability: Click here to download the PDF (Acrobat Reader) file (12 pages).
Appeared in: IEEE Transactions on Knowledge and Data Engineering, IEEE, 23(7), pp. 1079-1089, 2011.
Abstract: A simple yet effective multi-label learning method, called label powerset (LP), considers each distinct combination of labels that exist in the training set as a different class value of a single-label classification task. The computational efficiency and predictive performance of LP is challenged by application domains with large number of labels and training examples. In these cases the number of classes may become very large and at the same time many classes are associated with very few training examples. To deal with these problems, this paper proposes breaking the initial set of labels into a number of small random subsets, called {\em labelsets} and employing LP to train a corresponding classifier. The labelsets can be either disjoint or overlapping depending on which of two strategies is used to construct them. The proposed method is called RA$k$EL (RAndom $k$ labELsets), where $k$ is a parameter that specifies the size of the subsets. Empirical evidence indicate that RA$k$EL manages to improve substantially over LP, especially in domains with large number of labels and exhibits competitive performance against other high-performing multi-label learning methods.
See also :

        This paper has been cited by the following:

1 Sanden,C. (2010) An Empirical Evaluation of Computational and Perceptual Multi-Label Genre Classification on Music, Master Thesis, Department of Mathematics and Computer Science, University of Lethbridge, Canada.
2 Nasierding, G, Kouzani, A. (2010) Image to Text Translation by Multi-Label Classification, Proc. 6th International Conference on Intelligent Computing, ICIC 2010, Changsha, China, August 18-21, 2010, pp. 247-254.
3 Nasierding, G., Kouzani, A.Z. (2010) "Empirical Study of Multi-label Classification Methods for Image Annotation and Retrieval," dicta, pp.617-622, 2010 International Conference on Digital Image Computing: Techniques and Applications.
4 Santos, A.M., Canuto, A.M.P., Neto, A.F. (2010) Evaluating classification methods applied to multi-label tasks in different domains, 10th International Conference on Hybrid Intelligent Systems, HIS 2010, art. no. 5600014, pp. 61-66.
5 Juan Bekios-Calfa, J., Buenaposada, J.M., Baumela, L. (2011) On the Importance of Multi-dimensional Information in Gender Estimation from Face Images, Proc. 16th Iberoamerican Congress on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, CIARP 2011, Pucón, Chile, November 15-18, 2011, pp. 264-271
6 Bielza, C., Li, G., Larrañaga, P. Multi-dimensional classification with Bayesian networks (2011) International Journal of Approximate Reasoning, 52 (6), pp. 705-727.
7 Choi, H., Zhu, B.B., Lee, H. (2011) Detecting Malicious Web Links and Identifying Their Attack Types, Proc. WebApps '11: 2nd USENIX Conference on Web Application Development.
8 X. Han, J. Liu, Z. Shen, C. Miao (2011) An Optimized K-Nearest Neighbor Algorithm for Large Scale Hierarchical Text Classification, Proc. Joint ECML/PKDD PASCAL Workshop on Large-Scale Hierarchical Classification, pp. 2-12
9 Xiao-lin Wang, Hai Zhao, Bao-liang Lu (2011) Enhanced K-Nearest Neighbour Algorithm for Large-scale Hierarchical Multi-label Classification, Proc. Joint ECML/PKDD PASCAL Workshop on Large-Scale Hierarchical Classification, pp. 58-67.
10 Xu, J. (2011) An extended one-versus-rest support vector machine for multi-label classification, Neurocomputing, 74 (17), pp. 3114-3124.
11 Araken M Santos, Anne M P Canuto and Antonino Feitosa Neto (2011) "A Comparative Analysis of Classification Methods to Multi-label Tasks in Different Application Domains", International Journal of Computer Information Systems and Industrial Management Applications, ISSN 2150-7988 Volume 3, pp. 218-227
12 Ulanov, A., Sapozhnikov, G., Lyubomishchenko, N., Shevlyakov, G. (2011) Enhancing Accuracy of Multilabel Classification by Extracting Hierarchies from Flat Clusterings, Proc. 8th International Workshop on Text-based Information Retrieval.
13 Wang, Hua; Huang, Heng; Ding, Chris; (2011) Image annotation using bi-relational graph of images and semantic labels, Proc. 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.793-800, 20-25 June 2011
14 Costa, N., Coelho, A.L.V. (2011) Genetic and ranking-based selection of components for multilabel classifier ensembles, Hybrid Intelligent Systems (HIS), 2011 11th International Conference on , vol., no., pp.311-317, 5-8 Dec. 2011
15 France, S.L., Abbasi, A. (2011) Boosting unsupervised additive clustering using cluster-wise optimization and multi-label learning, Proceedings - IEEE International Conference on Data Mining, ICDM, art. no. 6137385, pp. 236-243.
16 Han,X.;Li,S.;Shen,Z (2012) A k-NN Method for Large Scale Hierarchical Text Classification at LSHTC3, Proceedings ECML-PKDD 2012 PASCAL Workshop on Large-Scale Hierarchical Classification, Bristol, UK.
17 Ramón Quevedo, J., Luaces, O., Bahamonde, A. (2012) Multilabel classifiers with a probabilistic thresholding strategy, Pattern Recognition, 45 (2), pp. 876-883.
18 Xu, J. (2012) An efficient multi-label support vector machine with a zero label, Expert Systems with Applications, 39 (5), pp. 4796-4804.
19 Fu, B., Wang, Z., Pan, R., Xu, G., Dolog, P. (2012) Learning tree structure of label dependency for multi-label learning, Proc 16th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2012, 7301 LNAI (PART 1), pp. 159-170.
20 Pacharawongsakda, E., Theeramunkong, T. Towards more efficient multi-label classification using dependent and independent dual space reduction (2012), Proc. 16th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2012, 7301 LNAI (PART 2), pp. 383-394.
21 Tahir, M.A., Kittler, J., Yan, F. Inverse random under sampling for class imbalance problem and its application to multi-label classification (2012) Pattern Recognition, 45 (10), pp. 3738-3750.
22 You, M., Liu, J., Li, G.-Z., Chen, Y. (2012) Embedded Feature Selection for Multi-label Classification of Music Emotions, International Journal of Computational Intelligence Systems, 5 (4), pp. 668-678.
23 Santos, A.M., Canuto, A.M.P. (2012) Using semi-supervised learning in multi-label classification problems, Proceedings of the International Joint Conference on Neural Networks, art. no. 625280.
24 Cesa-Bianchi, N., Re, M., Valentini, G. (2012) Synergy of multi-label hierarchical ensembles, data fusion, and cost-sensitive methods for gene functional inference, Machine Learning, 88 (1-2), pp. 209-241.
25 Sobol-Shikler, T. (2012) Inference of Co-occurring Classes: Multi-class and Multi-label Classification, In: Computational Intelligence Paradigms in Advanced Pattern Classification, M.R Ogiela, L.C. Jain (Eds.), pp. 171-197.