Web and Data Mining Introduction

Good Essays
Data Mining: Introduction

Lecture Notes for Chapter 1
Introduction to Data Mining by Tan, Steinbach, Kumar

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

1

Why Mine Data? Commercial Viewpoint
O

Lots of data is being collected and warehoused
– Web data, e-commerce
– purchases at department/ grocery stores
– Bank/Credit Card transactions O

Computers have become cheaper and more powerful

O

Competitive Pressure is Strong
– Provide better, customized services for an edge (e.g. in
Customer Relationship Management)

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

2

Why Mine Data? Scientific Viewpoint
O

Data collected and stored at enormous speeds (GB/hour)
– remote sensors on a satellite
– telescopes scanning the skies
– microarrays generating gene expression data
– scientific simulations generating terabytes of data

O
O

Traditional techniques infeasible for raw data
Data mining may help scientists
– in classifying and segmenting data
– in Hypothesis Formation

Mining Large Data Sets - Motivation
O
O
O

There is often information “hidden” in the data that is not readily evident
Human analysts may take weeks to discover useful information Much of the data is never analyzed at all
4,000,000
3,500,000

The Data Gap

3,000,000
2,500,000
2,000,000
1,500,000

Total new disk (TB) since 1995

1,000,000

Number of analysts 500,000
0
1995

1996

1997

1998

1999

©From:
Tan,Steinbach,
R. Grossman,
Kumar
C. Kamath, V. Kumar,
Introduction
“Data Mining to Data for Mining
Scientific and Engineering Applications”
4/18/2004

4

What is Data Mining?
O Many

Definitions

– Non-trivial extraction of implicit, previously unknown and potentially useful information from data – Exploration & analysis, by automatic or semi-automatic means, of large quantities of data in order to discover meaningful patterns

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

5

What is (not) Data Mining?
What is not Data
Mining?

O

O

What is Data

You May Also Find These Documents Helpful

  • Good Essays

    The National Vaccine Advisory Committee Teleconference Meeting: 2009 H1N1 Influenza Outbreak and Response, DHHS, Meeting Minutes, July 27, 2009,…

    • 592 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Web Programming

    • 480 Words
    • 2 Pages

    Good evening Mr. Charles. Earlier today you asked me to research some possible web conferencing programs that may help the company weekly status meetings. Since you assigned me to this task, I have found some programs that may work. I believe the best program that might fit the company needs would be due to cost is Skype a free web conferencing program.…

    • 480 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Human Services have many functions within the agencies. The agency wants to focus on the clients and operations. The operations of an agency can affect the effectiveness or failure of the program. The operations can be affected by human resources because the human resources cover who does the agency hires to what licensing needed for the employees. The human resource may also manage the marketing done by the agency such as who is the target audience, how does the agency information reach the audience, and how can the audience contact the agency.…

    • 1442 Words
    • 6 Pages
    Good Essays
  • Powerful Essays

    Data Mining Problems

    • 1295 Words
    • 6 Pages

    Suppose that we are responsible for managing product placement within a local supermarket. Our shelving units have 6 shelves each and are numbered from 1 to 6—with 1 being the lowest shelf and proceeding upward until the highest shelf is assigned the number 6. While there are many placement options that we should consider, we decide to look for any correlations between the row a product is placed on and its sales. Since we have our data stored in a data warehouse, it is easily accessible and responds quickly to our data request. Consider each of the following:…

    • 1295 Words
    • 6 Pages
    Powerful Essays
  • Best Essays

    Data Mining

    • 1981 Words
    • 8 Pages

    Exforsys. (2006). Execution for System: Connection between Data Mining and Customer Interaction. Retrieved from: http://www.exforsys.com/tutorials/data-mining/the-connection-between-data-mining-and-customer-interaction.html…

    • 1981 Words
    • 8 Pages
    Best Essays
  • Powerful Essays

    Data Mining

    • 2070 Words
    • 9 Pages

    Although data mining is still in its infancy, companies in a wide range of industries –…

    • 2070 Words
    • 9 Pages
    Powerful Essays
  • Powerful Essays

    Data Mining

    • 1921 Words
    • 8 Pages

    Patterson, L. (2010, APR 27). The nine most common data mining techniques used in predictive…

    • 1921 Words
    • 8 Pages
    Powerful Essays
  • Good Essays

    Data Mining

    • 1660 Words
    • 7 Pages

    Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases.…

    • 1660 Words
    • 7 Pages
    Good Essays
  • Satisfactory Essays

    Data Mining

    • 307 Words
    • 1 Page

    Do you feel that data mining infringes on your privacy or is data mining a natural part of living in a technologically enabled, connected environment? You may include personal or professional examples or opinions here.…

    • 307 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Web Design

    • 614 Words
    • 3 Pages

    Web design encompasses many different skills and disciplines in the production and maintenance of websites. The different areas of web design include web graphic design; interface design; authoring, including standardised code and proprietary software; user experience design; and search engine optimization. Often many individuals will work in teams covering different aspects of the design process, although some designers will cover them all. The term web design is normally used to describe the design process relating to the front-end (client side) design of a websiteincluding writing mark up. Web design partially overlaps web engineering in the broader scope of web development. Web designers are expected to have an awareness of usability and if their role involves creating mark up then they are also expected to be up to date with web accessibility guidelines.…

    • 614 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Data Mining

    • 350 Words
    • 2 Pages

    In today’s business world, information about the customer is a necessity for a businesses trying to maximize its profits. A new, and important, tool in gaining this knowledge is Data Mining. Data Mining is a set of automated procedures used to find previously unknown patterns and relationships in data. These patterns and relationships, once extracted, can be used to make valid predictions about the behavior of the customer.…

    • 350 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Web Analytics

    • 1128 Words
    • 5 Pages

    Google analytics is the next generation web analytics tools from Google that show you how people find your site. How they navigate and how they become customers. In much the same way that Google search engine has made it easy to use powerful technology, it brings a new accessibility to enterprise-class web analytics making it possible for all advertisers, publishers and website owners. Focusing your marketing resources on campaigns and initiatives that deliver ROI can improve your site to convert more visitors. A flexible graphing tool allows you to see larger trends even as you analyze and compare specific time periods. Short narratives; score cards and spark lines summarize your results while detail report is just a click away. Report controls allow you to play detail with in context and visualize data in new/different ways. Segmentation menu provides a way to slice data along a variety of factors.…

    • 1128 Words
    • 5 Pages
    Good Essays
  • Powerful Essays

    Mining data management

    • 1595 Words
    • 7 Pages

    Of all the information assets held by a mining company, exploration data is likely to be…

    • 1595 Words
    • 7 Pages
    Powerful Essays
  • Better Essays

    Data Mining

    • 2354 Words
    • 10 Pages

    Data mining is a concept that companies use to gain new customers or clients in an effort to make their business and profits grow. The ability to use data mining can result in the accrual of new customers by taking the new information and advertising to customers who are either not currently utilizing the business 's product or also in winning additional customers that may be purchasing from the competitor. Generally, data are any “facts, numbers, or text that can be processed by a computer.” Today, organizations are accumulating vast and growing amounts of data in different formats and different databases. This includes operational or transactional data such as, sales, cost, inventory, payroll, and accounting. Data mining also known as “knowledge discovery”, is the process of analyzing data from different perspectives and summarizing it into useful information- information that can then be used to increase revenue, cuts costs, and continue the goals outlined for the company. Data mining consists of five major elements: “Extract, transform, and load transaction data onto the data warehouse system, store and manage the data in a multidimensional database system, provide data access to business analysts and information technology professionals, analyze the data by application software, present the data in a useful format, such as a graph or table.”2 Extracting this information for future use will keep the company growing and adapting as the customer preference changes.…

    • 2354 Words
    • 10 Pages
    Better Essays
  • Good Essays

    Overview of the Data Mining

    • 8497 Words
    • 34 Pages

    Jeffrey W. Seifert Analyst in Information Science and Technology Policy Resources, Science, and Industry Division…

    • 8497 Words
    • 34 Pages
    Good Essays

Related Topics