Big Data

Only available on StudyMode
  • Download(s) : 69
  • Published : June 14, 2013
Open Document
Text Preview
1

Big Data, Data Mining and Business Intelligence Techniques

2

What is Data?
• Data is information in a form suitable for use with a computer. • There are two types of data ▫ Structured ▫ Unstructured

• The total volume of data is growing 59% every year. • The number of files grow at 88% every year.

3

What is Big Data?
Exa

Analytics on Big Data at Rest
Up to 10,000 Times larger

Peta

Data Scale

Giga

Data at Rest

Tera

Data Scale

Mega

Traditional Data Warehouse and Business Intelligence
Data in Motion Up to 10,000 times faster

Analytics on Big Data in Motion

Kilo yr mo wk day hr min sec … ms s

Occasional

Frequent

Real-time

Decision Frequency

4

Big Data
• Data sets so large and complex that they become awkward to work with using on-hand database management tools. • Late 2011 the term Big Data is applied to data sets that exceed 100 TB.

5

Growth Of Big Data
• Analytics market expect to grow 40% over the next 3 years • Big Data is growing at a rate of about 7 times & within Big Data ▫ 27.3% Growth in servers ▫ 34.2% Growth in software ▫ 61.4% Growth in storage

• Venture money is flowing into Big Data (around $500 millions)

6

Sources
• • • • • Social Network Profiles Social Influences Activity Generated Data Software as a Service & cloud applications Public

7

Big Data Platform
• Hadoop System • Stream Computing • Data Warehouse

8

The Big Data Platform in Action
Opportunity Cost Starts Here

Data Ingest 01011001100001001001 11000100101001001011 100100110100101010011100101001111001000100100010010001000100101 01100100101001001010 0110010010100100101 1100010010100100101 0110010010100100101 Bootstap 0110010010100100101 Enrich 01100100101001001010 0110010010100100101 11000100101001001011 01100100101001001010 01100100101001001010 01100100101001001010 01100100101001001011 Adaptive 01100100101001001010 Analytics 11000100101001001011 01100100101001001010 Model 01100100101001001010 01100100101001001010 11000100101001001011

9

Technologies
• Hadoop

• NoSQL

• Analytics DB

10

Technical Approach
• Columnar DB

• Massively Parallel Processing Databases
• Data Warehouse Appliances

11

Data Analysis Approach

12

Business Intelligence Techniques
• Big Data (General)

13

Business Intelligence Techniques
• Big Data (Conventional)

14

Business Intelligence Techniques
• Big Data (Operational)

15

Extreme Information Processing
• Must be managed along 12 complimentary dimensions. • Some people see Cloud as a solution.

16

Security
• Sensitivity of data and algorithms can dictate sourcing including which Cloud service can be used ▫ ITAR/EAR, IP, competition-sensitive etc.

• Secure images can be built • Application security
▫ Auditing and logging ▫ Authentication and authorization ▫ Data communication and data protection

• Periodic Scanning
▫ System and application

• Monitoring and Alerting • Patching and Updating

17

18

A whole new world of opportunity?
Calling Network Merged Network

Amy Bearn 32, Married, mother of 3, Accountant Telco Score: 91 CPG Score: 76 Fashion Score: 88

Telco company
How valuable is Amy to my mobile phone network? How likely is she to switch carriers? How many other customers will follow

Social Network

Public Database

Retail er
How valuable is Amy to my retail sales? Who does she influence? What do they spend?

19

20

References
• • • • • • • www.wikipedia.com www.idc.com www.jaspersoft.com/bigdata www.infoworld.com www.znet.com www.formtek.com www.forwardthinking.pcmag.com

21

tracking img