LPIS Home Page
Google Search

Title: An Adaptive Personalized News Dissemination System
Author(s): I. Katakis, G. Tsoumakas, E. Banos, N. Bassiliades, I. Vlahavas.
Availability: Click here to download the PDF (Acrobat Reader) file (21 pages).
Keywords: Web, Text Classification, RSS, Personalization, News, Naive Bayes, Machine Learning, Dynamic Feature Space, Data Streams, Concept Drfit.
Appeared in: Journal of Intelligent Information Systems, Springer, 32 (2), pp. 191-201, 2009.
Abstract: With the explosive growth of the Word Wide Web, information overload became a crucial concern. In a data-rich information-poor environment like the Web, the discrimination of useful or desirable information out of tons of mostly worthless data became a tedious task. The role of Machine Learning in tackling this problem is thoroughly discussed in the literature, but few systems are available for public use. In this work, we bridge theory to practice, by implementing a web-based news reader enhanced with a specifically designed machine learning framework for dynamic content personalization. This way, we get the chance to examine applicability and implementation issues and discuss the effectiveness of machine learning methods for the classification of real-world text streams. The main features of our system named PersoNews are: a) the aggregation of many different news sources that offer an RSS version of their content, b) incremental filtering, offering dynamic personalization of the content not only per user but also per each feed a user is subscribed to, and c) the ability for every user to watch a more abstracted topic of interest by filtering through a taxonomy of topics. PersoNews is freely available for public use on the WWW (http://news.csd.auth.gr).
See also :

        This paper has been cited by the following:

1 Zeina Chedrawy and Syed Sibte Raza Abidi, "A Web Recommender System for Recommending, Predicting and Personalizing Music Playlists", 10th International Conference on Web Information Systems Engineering - WISE 2009, Poznan, Poland, October 5-7, 2009. Proceedings 2009, 335-342, 2009.
2 Giannikopoulos, P., Varlamis, I., Eirinaki, M. (2009) Mining Frequent Generalized Patterns for Web Personalization in the presence of Taxonomies, in International Journal of Data Warehousing and Mining, Vol. 6, No.1, pp. 4-15, August 2009.
3 Nikolaos Nanas, Manolis Vavalis, Elias Houstis, "Personalised news and scientific literature aggregation", Information Processing and Management, In Press.
4 Dariusz Brzezinski, Mining Data Streams with Concept Drift, MSc Thesis, Poznan University of Technology, Faculty of Computing Science and Management, Institute of Computing Science, Poznan, 2010
5 Liu, J., Dolan, P., and Pedersen, E. R. 2010. Personalized news recommendation based on click behavior. In Proceeding of the 14th international Conference on intelligent User interfaces (Hong Kong, China, February 07 - 10, 2010). IUI '10. ACM, New York, NY, 31-40.
6 S. Decherchi, P.Gastaldo, F. Sangiacomo, A. Leoncini, R. Zunino "Operative Assessment of Predicted Generalization Errors on Non-Stationary Distributions in Data-Intensive Applications ", Intelligent Data Analysis, IOS Press, 2010, in press
7 Riccardo Di Massa, Maurizio Montagnuolo, and Alberto Messina. 2010. Implicit news recommendation based on user interest models and multimodal content analysis. In Proceedings of the 3rd international workshop on Automated information extraction in media production (AIEMPro '10). ACM, New York, NY, USA, 33-38.
8 , , , , , , , , , and , 2010.
9 Sarah N. Kohail, Learning Concept Drift Using Adaptive Training Set Formation Strategy, MSc Thesis, Faculty of Information Technology, The Islamic University of Gaza, October 2011.
10 Sergio Decherchi, On the Structure of the Hypothesis Space, Model Selection, and Applications of Statistical Learning Theory, PhD Thesis, Engineering Faculty, University Of Genoa, Italy, April 2011.
11 Krishna Kamath and James Caverlee (2011) Expert-Driven Topical Classification of Short Message Streams, To apper in: Proceedings of 3rd IEEE Conference on Social Computing (SocialCom 2011), MIT, Boston, USA.
12 Zhang, Ming; Kianmehr, Keivan; Alhajj, Reda; "Effective monitoring by efficient fingerprint matching using a forest of NAQ-trees", Journal of Intelligent Information Systems, Volume 37, Issue 2, October 2011, Pages 267-290.
13 Kyosuke Nishida, Takahide Hoshide, and Ko Fujimura. 2012. Improving tweet stream classification by detecting changes in word probability. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '12). ACM, New York, NY, USA, 971-980.
14 Silic, A., Dalbelo Basic, B., Exploring classification concept drift on a large news text corpus, Computational Linguistics and Intelligent Text Processing, Springer, LNCS 7181 (PART 1), pp. 428-437, 2012.
15 Lai, Y., Xu, X., Yang, Z., Liu, Z., User interest prediction based on behaviors analysis, (2012) International Journal of Digital Content Technology and its Applications, 6 (13), pp. 192-204.
16 Holton, Avery. Negating nodes and liquid fragmentation: Applications of traditional communication models and theories in the midst of diffusing communication innovations. Communication Theory, 2012, 22 (3) pp. 279-298.
17 Agarwal, S., Singhal, A., Bedi, P., Classification of RSS feed news items using ontology, (2012) International Conference on Intelligent Systems Design and Applications, ISDA, art. no. 6416587, pp. 491-496.
18 Kyosuke NISHIDA, Takashide HOSHIDE, and Ko FUJIMURA, Tweet Stream Classi?cation with Word Non-stationarity: Learning with Suf?x Arrays, The 4th Forum on Data Engineering and Information Management (Deim2012), March 3-5 2012.
19 Kavasoǧlu, Z., Öǧüdücü, S.G., Personalized summarization of customer reviews based on user's browsing history, (2013) Proceedings of the IADIS International Conference Intelligent Systems and Agents 2013, ISA 2013, Proceedings of the IADIS European Conference on Data Mining 2013, ECDM 2013, pp. 21-28.
20 Krishna Yeshwanth Kamath, Mining, Modeling, and Analyzing Real-Time Social Trails, PhD Thesis, Texas A&M University, August 2013.
21 Petr Kosina, João Gama, Very fast decision rules for classification in data streams, Data Mining and Knowledge Discovery, DOI: 10.1007/s10618-013-0340-z, December 2013.
22 Yi-Ren Yeh, Yu-Chiang Frank Wang, A rank-one update method for least squares linear discriminant analysis with concept drift, Pattern Recognition, 46(5) pp. 1267-1276, 2013.
23 Gama, João; Kosina, Petr; Recurrent concepts in data streams classification, Knowledge and Information Systems, May 2013, pp. 1-19. DOI 10.1007/s10115-013-0654-6
24 Jeremiah Smith, Naranker Dulay, Mate Attila Toth, Oliver Amft, Yanxia Zhang, Exploring Concept Drift using Interactive Simulations, 10th IEEE Workshop on Context Modeling and Reasoning 2013, San Diego (18 March 2013), pp. 49-54.
25 Ghorab, M.; Zhou, Dong; OConnor, Alexander; Wade, Vincent; Personalised Information Retrieval: survey and classification; User Modeling and User-Adapted Interaction, 23(4), pp. 381-443 SEP 2013, Springer.
26 Carmona-Cejudo, J.M., Castillo, G., Baena-García, M., Morales-Bueno, R., A comparative study on feature selection and adaptive strategies for email foldering using the ABC-DynF framework, Knowledge-Based Systems, volume 46, issue , year 2013, pp. 81 - 94
27 Joung Woo Ryu, Myung Won Kim, An Ensemble Model based on Data Distribution for Streaming Data Classification, Database and Information Science Journal, Vol. 40, No 2, (2013.4), pp. 89-98.
28 Messina, A., Montagnuolo, M., Di Massa, R., Borgotallo, R., Hyper Media News: A fully automated platform for large scale analysis, production and distribution of multimodal news content, (2013) Multimedia Tools and Applications, 63 (2), pp. 427-460.
29 Shahparast, Homeira; Jahromi, MansoorZolghadri; Taheri, Mohammad; Hamzeloo, Sam; A Novel Weight Adjustment Method for Handling Concept-Drift in Data Stream Classification, Arabian Journal for Science and Engineering, Springer, 39(2) pp. 799 807, 2014.
30 Amey Desai, Streaming Algorithms for Matrix Approximation, MSc Thesis, School of Computing, University of Utah, USA, December 2014.
31 Sela, M., Lavie, T., Inbar, O., Oppenheim, I., and Meyer, J. (2014). Personalizing news content: An experimental study. Journal of the Association for Information Science and Technology.
32 Jean Paul Barddal, Heitor Murilo Gomes, and Fabrício Enembreck. 2014. SFNClassifier: a scale-free social network method to handle concept drift. In Proceedings of the 29th Annual ACM Symposium on Applied Computing (SAC '14). ACM, New York, NY, USA, 786-791. DOI=10.1145/2554850.2554855 http://doi.acm.org/10.1145/2554850.2554855
33 Heitor Murilo Gomes and Fabrício Enembreck. 2014. SAE2: advances on the social adaptive ensemble classifier for data streams. In Proceedings of the 29th Annual ACM Symposium on Applied Computing (SAC '14). ACM, New York, NY, USA, 798-804. DOI=10.1145/2554850.2554905 http://doi.acm.org/10.1145/2554850.2554905
34 YANG, Tao, PENG Ru-xiang, Li Ying-na, An Ontology-based Personalized Knowledge Retrieval Method, Computer Knowledge and Technology, Vol. 10, No.7. March 2014, pp. 1382-1386.
35 Hughes, T. (2014). Co-creation: Moving towards a framework for creating innovation in the triple helix. Prometheus, , 1-14. doi:10.1080/08109028.2014.971613
36 Dayrelis Mena Torres, Clasificacion De Flujos De Datos Basada En Similitud, PhD thesis, Departa-mento De Ciencias De La Computacion Inteligencia Artificial, Universidad De Granada, Spain, 2014.