Preview

Data Mining

Powerful Essays
Open Document
Open Document
1453 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining
SMS CUSAT
Reading Material on Data Mining Anas AP & Alex Titty John

• What is Data?
Data is a collection of facts and information or unprocessed information.
Example: Student names, Addresses, Phone Numbers etc.

• What is a Database?
A structured set of data held in a computer which is accessible in various ways.
Example: Electronic Address Book, Phone Book.

• What is a Data Warehouse?
The electronic storage of large amount of data by business.
Concept originated in 1988
IBM researchers Barry Devlin & Paul Murphy
Used in business for DATA MINING & data exploration
Data warehouse is a decision support database that is maintained separately from the organization 's operational data base.
Supports Information processing, by providing a solid platform of consolidated, historical data for analysis.

“A process of transforming data into information and making it available to users in a timely enough manner to make a difference”
[Forrester Research, April 1996]

• What is Data Mining?
“Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner.”

Data mining is the exploration and analysis of large quantities of data in order to discover valid, novel, potentially useful and understandable patterns in data.

Valid: The patterns are true.
Novel: We did not know the pattern beforehand.
Useful: We can devise actions from the patterns.
Understandable: We can interpret and comprehend the patterns.

The relationships and summaries derived through a data mining exercise are often referred to as models or patterns.
Examples include linear equations, rules, clusters, graphs, tree structures, and patterns in time series • What’s the difference between data mining and data warehousing
Data mining is the process of finding patterns in a given data set. These patterns can often



References: Principles of Knowledge Discovery in Databases, Osmar R. Zaïane, 1999 | Principles of Data Mining by David Hand Heikki Mannila & Padhraic Smyth |

You May Also Find These Documents Helpful

  • Good Essays

    Cis 850 Study Guid

    • 499 Words
    • 2 Pages

    * Explain both data warehousing and data mining. How are they related? List at least three uses of data mining.…

    • 499 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Bsc303 Chapter 1 Study Guide

    • 4685 Words
    • 19 Pages

    Data Mining- the process of searching huge amounts of data with the hope of finding a pattern…

    • 4685 Words
    • 19 Pages
    Powerful Essays
  • Satisfactory Essays

    Biology Exam Paper

    • 2143 Words
    • 9 Pages

    Data ____ refers to the process of analyzing information in databases to discover previously unknown and potentially useful information.…

    • 2143 Words
    • 9 Pages
    Satisfactory Essays
  • Better Essays

    Created in many different forms and formats, data is collected, processed, stored, and retrieved by business to support the many informational needs of organizations.�� INCLUDEPICTURE "https://api.turnitin.com/images/spacer.gif" * MERGEFORMATINET �� HYPERLINK "javascript:void(0);" Business data enters an organization 's information system through software applications. The software applications process and code the data with proprietary formats that are difficult to extract or report without the help of sophisticated report writer or data extraction tools.�� INCLUDEPICTURE "https://api.turnitin.com/images/spacer.gif" * MERGEFORMATINET �� HYPERLINK "javascript:void(0);" Data is the heart of any business. Without good data turned into information, management can not make the proper decisions.�� INCLUDEPICTURE "https://api.turnitin.com/images/spacer.gif" * MERGEFORMATINET �� HYPERLINK "javascript:void(0);" The advances in computer processing power, storage capabilities, and the development of more ways to add information to data have paved the way for a radically new approach to collecting, storing, retrieving, and reporting business information: to build an entire information…

    • 1645 Words
    • 7 Pages
    Better Essays
  • Good Essays

    True False

    • 378 Words
    • 2 Pages

    5. Data mining uses business intelligence tools and techniques on a variety of data sources brought together in a data warehouse.…

    • 378 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Bis Midterm Sheet

    • 1467 Words
    • 6 Pages

    A data warehouse is to extract and clean data from operational systems and other sources to store and catalog that data for processing by BI tools. Data warehouses can include external data purchased from outside sources. Meta data is kept in the data warehouse. Physically, a data warehouse consists of a few fast computers with very large storage devices.…

    • 1467 Words
    • 6 Pages
    Good Essays
  • Satisfactory Essays

    Data Analysis

    • 1650 Words
    • 7 Pages

    5. A tabular method that can be used to summarize the data on two variables simultaneously is called…

    • 1650 Words
    • 7 Pages
    Satisfactory Essays
  • Powerful Essays

    Question 1

    • 1978 Words
    • 24 Pages

    Data becomes _____ when it is presented in a context so that it can answer a question or support decision making.…

    • 1978 Words
    • 24 Pages
    Powerful Essays
  • Powerful Essays

    14. Data mining is the process of engineering mathematical patterns from usually large sets of data…

    • 2021 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    Economic Testbank

    • 5300 Words
    • 22 Pages

    CHAPTER 1—DATA AND STATISTICS MULTIPLE CHOICE 1. Methods for developing useful decision-making information from large data bases is known as |a. |data manipulation | |b. |data monitoring | |c.…

    • 5300 Words
    • 22 Pages
    Satisfactory Essays
  • Powerful Essays

    Data mining / predictive analysis is to identify trends, anticipated hot-spots, predict future trends based on the likelihood of specific activity, and refined resource deployment decisions.…

    • 2242 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    MS Access - Part 1

    • 468 Words
    • 2 Pages

    1. Raw data that has been organized so as to become useful is also known as…

    • 468 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    A data warehouse is a relational database that is used for reporting and data analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but it can include data from other sources. It separates analysis workload from transaction workload and enables an organization to consolidate data from several sources.…

    • 348 Words
    • 2 Pages
    Satisfactory Essays
  • Best Essays

    It Essay - Data Mining

    • 1998 Words
    • 8 Pages

    Data mining is a concept with which most of us may not be familiar in terms of its prevalence and importance. Data mining is defined as an “analysis of large pools of data to find patterns and rules that can be used to guide decision making and predict future behaviour” (Laudon, Laudon & Brabston, 2011). This can be used to discover trends for essentially everything. While purchasing our daily cup of Starbucks coffee in the morning may seem meaningless and irrelevant to us, we could very well be part of a compilation of data used for further research. There are many ways in which information can be obtained via data mining. The first of these methods is association. Association refers to the relation between the…

    • 1998 Words
    • 8 Pages
    Best Essays
  • Good Essays

    Best Practices for Msbi

    • 1064 Words
    • 5 Pages

    · The SQL Server database engine holds and manages the tables that make up your data warehouse.…

    • 1064 Words
    • 5 Pages
    Good Essays