Classifying Web Pages by Aimed Nation Using Machine Learning

Publisher: IGI Global_journal

E-ISSN: 1947-9352|7|1|20-35

ISSN: 1947-9344

Source: International Journal of Organizational and Collective Intelligence (IJOCI), Vol.7, Iss.1, 2017-01, pp. : 20-35

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract

Classifying web pages is to automatically assign predefined class to them. It is one of the main applications of web mining. The authors' aim is to detect the targeted nation based on the web pages content. It is an original application. In this paper, the authors propose different web mining approaches using machine learning algorithms such as Naïve Bayes and Support Vector Machine in order classify them. They present detailed stages of the procedure. The best experimental result based on an original corpus created by their own means shows a very attention grabbing f-score of 85%.