|Introduction | Technical Background | Applications | Tools | Websites | Mailing lists|
Databases can contain vast quantities of data describing decisions, performance and operations. In many cases the database contains critical information concerning past business performance which could be used to predict the future.
Often the sheer volume of the data can make the extraction of this business information impossible by manual methods. Data mining is a set of techniques which allows you to do this.
Data Mining (also known as Knowledge Discovery) technology helps businesses discover hidden data patterns and provides predictive information which can be applied to benefit the business.
The basic approach is to access a database of historical data and to identify relationships which have a bearing on a specific issue, and then extrapolate from these relationships to predict future performance or behaviour. The human analyst plays an important role in that only they can decide whether a pattern, rule or function is interesting, relevant and useful to an enterprise.
Data Mining and Knowledge Discovery in Databases are terms used interchangeably. Other terms often used are data or information harvesting, data archeology, functional dependency analysis, knowledge extraction and data pattern analysis. A high level definition of Data Mining is: the non-trivial process of identifying valid, novel, potentially useful and ultimately understandable patterns in data. Data mining is not a simple process and there is no tool that can do the job automatically. Data mining can be aided by tools, but it requires both human data mining expertise and human domain expertise. Data mining consists of a number of operations, each of which are supported by a variety of technologies, such as rule induction, neural networks, conceptual clustering. In real world applications information extraction requires the cooperative use of several data mining operations and techniques.The basic data mining process is as follows:
Data mining is typically not used as a business system delivery technology. Rather it is an extremely powerful and effective set of technologies for analysing and clustering data which can be used to form the basis of a system.
The key reason why Data Mining is such a buzzword at the moment is that because many organisations recognised the need to better understand their customers. Data mining can deliver real world results. Data mining has been used for the following types of applications:
The July 1996 Edition of "Intelligent Software Strategies" was devoted to Intelligent Data Mining Tools . They classified the tools by their knowledge discovery techniques. The 7 classifications were: Rule and Decision Tree Discovery; Neural Networks; Conventional Statistical; Advanced Visualisation; Fuzzy Techniques; Knowledge Based; and Multiple Techniques. Over 40 tools are reviewed and addresses of the vendors (some with website addresses) are given.
About AIAI |
Last updated 2nd June 1997
by Ian Harrison