Data Mining

Chapter 1 Exercises

1. What is data mining? In your answer, address the following:
Data mining refers to the process or method that extracts or \mines" interesting knowledge or patterns from large amounts of data.
(a) Is it another hype?
Data mining is not another hype. Instead, the need for data mining has arisen due to the wide availability of huge amounts of data and the imminent need for turning such data into useful information and knowledge. Thus, data mining can be viewed as the result of the natural evolution of information technology.
(b) Is it a simple transformation or application of technology developed from databases, statistics, machine learning, and pattern recognition?
No. Data mining is more than a simple transformation of technology developed from databases, statistics, and machine learning. Instead, data mining involves an integration, rather than a simple transformation, of techniques from multiple disciplines such as database technology, statistics, machine learning, high-performance computing, pattern recognition, neural networks, data visualization, information retrieval, image and signal processing, and spatial data analysis..
(c) We have presented a view that data mining is the result of the evolution of database technology.
Do you think that data mining is also the result of the evolution of machine learning research?
Can you present such views based on the historical progress of this discipline? Do the same for the fields of statistics and pattern recognition.

(d) Describe the steps involved in data mining when viewed as a process of knowledge discovery
The steps involved in data mining when viewed as a process of knowledge discovery are as follows: * Data cleaning, a process that removes or transforms noise and inconsistent data * Data integration, where multiple data sources may be combined * Data selection, where data relevant to the analysis task are retrieved from the database * Data transformation, where data are

Data Mining

You May Also Find These Documents Helpful

Database Concepts Pt2520

Database Concepts Pt2520

Cis 850 Study Guid

Cis 850 Study Guid

Bsc303 Chapter 1 Study Guide

Bsc303 Chapter 1 Study Guide

True False

True False

Bis Midterm Sheet

Bis Midterm Sheet

Assignment 1 - Problem-3 – Chapter-1 of Discovering Knowledge in Data

Assignment 1 - Problem-3 – Chapter-1 of Discovering Knowledge in Data

Dat Mining Annotated Bibliography

Dat Mining Annotated Bibliography

Cis 500 Data Mining Report

Cis 500 Data Mining Report

Crisp-Dm

Crisp-Dm

Data Mining - Chapter 2 questions

Data Mining - Chapter 2 questions

Business: Artificial Neural Network and Data

Business: Artificial Neural Network and Data

Economic Testbank

Economic Testbank

CISA

CISA

Data Warehousing and Data Mining

Data Warehousing and Data Mining

Business Intelligence

Business Intelligence

Related Topics