LPIS Home Page
Google Search

Title: Protein Classification with Multiple Algorithms
Author(s): S. Diplaris, G. Tsoumakas, P. Mitkas, I. Vlahavas.
Availability: Click here to download the PDF (Acrobat Reader) file (10 pages).
Appeared in: 10th Panhellenic Conference on Informatics (PCI 2005), P. Bozanis and E.N. Houstis (Eds.), Springer-Verlag, LNCS 3746, pp. 448-456, Volos, Greece, 11-13 November, 2005.
Abstract: Nowadays, the number of protein sequences being stored in central protein databases from labs all over the world is constantly increasing. From these proteins only a fraction has been experimentally analyzed in order to detect their structure and hence their function in the corresponding organism. The reason is that experimental determination of structure is labor-intensive and quite time-consuming. Therefore there is the need for automated tools that can classify new proteins to structural families. This paper presents a comparative evaluation of several algorithms that learn such classification models from data concerning patterns of proteins with known structure. In addition, several approaches that combine multiple learning algorithms to increase the accuracy of predictions are evaluated. The results of the experiments provide insights that can help biologists and computer scientists design high-performance protein classification systems of high quality
See also :

        This paper has been cited by the following:

1 Bian, S., Wang, W. (2006) Investigation on diversity in homogeneous and heterogeneous ensembles, Proceedings 2006 IEEE International Conference on Neural Networks, pp. 3078-3085
2 C. Katar, "Combining Multiple Techniques for Intrusion Detection", JCSNS International Journal of Computer Science and Network Security, 6(2B), pp. 208-218, February 2006.
3 Kedarisetti, K.D., Kurgan, L., Dick, S. Classifier ensembles for protein structural class prediction with varying homology, (2006), Biochemical and Biophysical Research Communications, 348 (3), pp. 981-988.
4 Bian, S., Wang, W. (2007) On diversity and accuracy of homogeneous and heterogeneous ensembles, International Journal of Hybrid Intelligent Systems 4, pp. 103–128.
5 Shu-Peng Wan, Jian-Hua Xu, “A multi-label classification algorithm based on triple class support vector machine”, Proc. Int. Conf. on Wavelet Analysis and Pattern Recognition 2007 (ICWAPR’07), pp. 1447-1452.
6 Sarinnapakorn, K. (2007) Induction of Classifiers from Multi-labeled Examples: An Information-Retrieval Point of View, PhD Dissertation, University of Miami, 2007.
7 P. Liewlom, T. Rakthanmanon, K. Waiyamai, "Prediction of Enzyme Class by using Reactive Motifs Generated from Binding and Catalytic Sites", Proc. 3rd International Conference on Advanced Data Mining and Applications (ADMA 2007), pp. 442-453, August 2007.
8 Mohamed, S., Rubin, D., Marwala, T. “Multi-class protein sequence classification using fuzzy ARTMAP”, (2007) Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics, 2, pp. 1676-1681.
9 T. Lingner, P. Meinicke, (2008) “Fast Target Set Reduction for Large-Scale Protein Function Prediction: A Multi-class Multi-label Machine Learning Approach”, Proc. 8th International Workshop on Algorithms in Bioinformatics, WABI 2008, pp 198-209.
10 Sang-Hyeun Park, Johannes Fürnkranz, Multi-Label Classification with Label Constraints, Proc. of the ECML/PKDD 2008 Workshop on Preference Learning, 2008.
11 Chen, B., Ma, L., Hu, J. (2008) A New SVM Based Method for Solving Multi-Label Classification Problem, Proc. of the 3rd International Symposium on Computational Intelligence and Industrial Applications (Dali), 11, 325-334, 2008
12 K. Waiyamai, P. Liewlom, T. Kangkachit, T. Rakthanmanon “Concept Lattice–Based Mutation Control for Reactive Motifs Discovery”, Proc. 12th Pacific-Asia Conference, PAKDD 2008 Osaka, Japan, May 20-23, 2008, pp. 767-776
13 Tahir, M.A., Kittler, J., Yan, F., Mikolajczyk, K. (2009) Kernel Discriminant Analysis Using Triangular Kernel For Semantic Scene Classification, Proceedings of the 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 3-5 June 2009, Chania, Greece, IEEE, 2009.
14 Cerri, R., da Silva, R.R.O, de Carvalho A.C.P.L.F. (2009) Comparing Methods for Multilabel Classification of Proteins Using Machine Learning Techniques, Proc. of the 4th Brazilian Symposium on Bioinformatics, BSB 2009, Porto Alegre, Brazil, July 29-31, 2009.
15 Wu, Y., Ren, F. (2009) Simple linguistic processing effect on multi-label emotion classification, Proc. 2009 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2009.
16 Li, J., Xu, J. (2009) A fast multi-label classification algorithm based on double label support vector machine (2009) Proc. 2009 International Conference on Computational Intelligence and Security (CIS 2009), pp. 30-35.
17 Rahman, H., Application of Data Mining Algorithms for Measuring Performance Impact of Social Development Activities, Chapter VIII, in Data Mining Applications for Empowering Knowledge Societies (Ed. Rahman, H), IGI Global, pp. 136-160, 2009.
18 Tsai, S.-C., Jiang, J.-Y., Wu, C., Lee, S.-J. (2009) A Fuzzy Similarity-Based Approach for Multi-label Document Classification, Proc. 2nd International Workshop on Computer Science and Engineering, pp. 59-63
19 Chen, B., Gu, W., Hu, J. (2010) An improved multi-label classification method and its application to functional genomics, International Journal of Computational Biology and Drug Design, 3 (2), pp. 133-145.
20 Rokach, L, Itachm E. (2010) An Ensemble Method for Multi-label Classification using an Approximation Algorithm for the Set Covering Problem, Proc. 2nd International Workshop on Multi-Label Learning.
21 Correa, D.C., Saito J.H., da Costa, L.F. (2010) Musical genres: beating to the rhythms of different drums. New Journal of Physics 12.
22 Wei, Z (2010) The research on Chinese text multi-label classification, PhD Thesis, Université Lumičre Lyon 2.
23 Chen, B., Sun, F., Hu, J. (2010) "Local linear multi-SVM method for gene function classification", Proc. 2010 Second World Congress on Nature and Biologically Inspired Computing (NaBIC), pp.183-188, 15-17 Dec. 2010.
24 Shuo Xiang, Songcan Chen and Lishan Qiao (2010) Sparse Representation: Extract Adaptive Neighborhood for Multilabel Classification, PRICAI 2010: Trends in Artificial Intelligence, Lecture Notes in Computer Science, 2010, Volume 6230/2010, 304-314
25 Bindoff, I. (2010) Multiple classification ripple round rules: classifications as conditions. PhD Thesis, University of Tasmania.
26 Zhang, S., Li, B., and Xue, X. 2010. Semi-automatic dynamic auxiliary-tag-aided image annotation. Pattern Recogn. 43, 2 (Feb. 2010), 470-477.
27 Ávila-Jiménez, J.L., Gibaja, E., Ventura, S. (2010) Evolving multi-label classification rules with gene expression programming: A preliminary study, 5th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2010, San Sebastian, 23-25 June 2010, Proceedings, Part II, pp. 9-16.
28 Doquire, G., Verleysen, M. (2011) Feature Selection for Multi-label Classification Problems, Proceedings, Part I, 11th International Work-Conference on Artificial Neural Networks, IWANN 2011, Torremolinos-Málaga, Spain, June 8-10, 2011, pp. 9-16.
29 Vivian F. López Batista, Fernando Prieta Pintado, Ana Belén Gil, Sara Rodríguez and María N. Moreno (2011) A System for Multi-label Classification of Learning Objects, Proc. 6th International Conference on Soft Computing Models in Industrial and Environmental Applications, SOCO 2011, pp. 523-531.
30 Ávila, J.L., Gibaja, E.L., Zafra, A., Ventura, S. (2011) A gene expression programming algorithm for multi-label classification, Journal of Multiple-Valued Logic and Soft Computing, 17 (2-3), pp. 183-206.
31 Madjarov, G., Gjorgjevikj, D., Džeroski, S. (2011) Dual layer voting method for efficient multi-label classification, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6669 LNCS, pp. 232-239.
32 He, J., Gu, H., Wang, Z. (2012) Bayesian multi-instance multi-label learning using gaussian process prior, Machine Learning, 88 (1-2), pp. 273-295.
33 Zhou, T., Tao, D., Wu, X. (2012) Compressed labeling on distilled labelsets for multi-label learning, Machine Learning, 88 (1-2), pp. 69-126.
34 Chen, B., Duan, L., Hu, J. (2012) Composite kernel based SVM for hierarchical multi-label gene function classification, Proceedings of the International Joint Conference on Neural Networks, art. no. 6252555.
35 Santos, A.M., Canuto, A.M.P. (2012) Using semi-supervised learning in multi-label classification problems, Proceedings of the International Joint Conference on Neural Networks, art. no. 625280.
36 Zhang, T., Dai, H., Liu, L.A., Lewis, D.F.V., Wei, D. (2012) Classification models for predicting cytochrome p450 enzyme-substrate selectivity, Molecular Informatics, 31 (1), pp. 53-62.
37 López, V.F., De La Prieta, F., Ogihara, M., Wong, D.D. (2012) A model for multi-label classification and ranking of learning objects, Expert Systems with Applications, 39 (10), pp. 8878-8884.
38 Nasierding, G.; Kouzani, A.Z.; (2012) "Comparative evaluation of multi-label classification methods, Proceedings 9th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), pp.679-683, 29-31 May 2012.
39 Zhu, Y., Luo, W., Chen, G., Ou, J. (2012) A multi-label classification method based on associative rules, Journal of Computational Information Systems, 8 (2), pp. 791-799.