An extended process model of knowledge discovery in database

Author: Li Tianrui   Ruan Da  

Publisher: Emerald Group Publishing Ltd

ISSN: 1741-0398

Source: Journal of Enterprise Information Management, Vol.20, Iss.2, 2007-02, pp. : 169-177

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract

Purpose - Much research on knowledge discovery in database (KDD) merely pays attention to data mining, one of many interacting steps in the process of discovering previously unknown and potentially interesting patterns in large databases, but little to the whole process. However, such approaches cannot satisfy the need of real applications of KDD. The purpose of this work is to extend a process model of KDD in practice at large. Design/methodology/approach - A new model based on research experiences of the knowledge discovery process is formalized as an extension of the model by Fayyad et al. A case study by a reduct method from rough set theory is to illustrate why the process model is proposed and in what situation it can be used in practice. Findings - This model incorporates data collection in the KDD process to supply a sound framework to better support KDD applications. Research limitations/implications - This model reflects the native of KDD in some tested cases. It may need further research to be used in all other situations. Practical implications - It can be used in the area of information security, medical treatment and other information management. Originality/value - Using this model, one can directly collect data that are essential and useful for the mining results. It also offers practical help to those KDD researchers both from industry and academia.