Preview

Data Mining Soltions

Good Essays
Open Document
Open Document
1720 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining Soltions
CS412 Assignment 2 Ref Answer
Question 1: Assume a base cuboid of 10 dimensions contains only three base cells: (1) (a1, b2, c3, d4; ..., d9, d10), (2) (a1, c2, b3, d4, ..., d9, d10), and (3) (b1, c2, b3, d4, ..., d9, d10), where a_i != b_i, b_i != c_i, etc. The measure of the cube is count. 1, How many nonempty cuboids will a full data cube contain? Answer: 210 = 1024 2, How many nonempty aggregate (i.e., non-base) cells will a full cube contain? Answer: There will be 3 ∗ 210 − 6 ∗ 27 − 3 = 2301 nonempty aggregate cells in the full cube. The number of cells overlapping twice is 27 while the number of cells overlapping once is 4 ∗ 27 . So the final calculation is 3 ∗ 210 − 2 ∗ 27 − 1 ∗ 4 ∗ 27 − 3, which yields the result. 3, How many nonempty aggregate cells will an iceberg cube contain if the condition of the 4, iceberg cube is "count >= 2"? Answer: There are in total 5 ∗ 27 = 640 nonempty aggregate cells in the iceberg cube. To calculate the result: fix the first three dimensions as (***), (a1**), (*c1*), (**b3) or (*c1b3), and vary the rest seven ones. 4, How many closed cells are in the full cube? Answer: There’re 6 closed cells in the full cube: 3 base cells; (a1, *, *, d4, …, d10); (*, c2, b3, d4, …, d10) : count 2; (*, *, *, d4, .., d10): count 3. Question 2: (Half open questions, make sure your algorithm and assumptions are correct, no need to be very specific) Suppose a base cuboid has the following tuples:
A B C D Count Sales a1 b1 c1 d1 1 a1 b2 c2 d1 1 a1 b3 c1 d2 1 a2 b4 c1 d2 1 a2 b3 c2 d3 1 6 4 2 10 12

1, Show the representative steps to demonstrate how a complete data cube (with Count and SUM(Sales) as measures) is computed by the multiway array aggregation algorithm; Answer (from fang2): Suppose dimensions A, B, C, D are organized into 2, 4, 2, 3 partitions respectively. So in total there are 2*4*2*3 = 48 chunks. The cardinality of dimensions A, B, C, D is 2, 4, 2, 3 respectively, i.e. A and C have the smallest size, followed by D, and lastly B has

You May Also Find These Documents Helpful

  • Satisfactory Essays

    Acc/531 Week 4

    • 646 Words
    • 3 Pages

    1. A sales manager collected the following data on salespersons’ annual sales and years of experience.…

    • 646 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    b) In the range E40:E42 enter the costs $9.50, $14.50, and $18.50 (in that order).…

    • 724 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Chid Abuse

    • 829 Words
    • 4 Pages

    | On the Main St. worksheet, in cell B25, insert a function that will total all of the sales for Quarter 1 in the range B6:B23. Copy the function in cell B25 to the range C25:F25.…

    • 829 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    the sales in the corresponding cells of the three monthly worksheets. Use 3-D references in the…

    • 369 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Thank you for the opportunity to assess your sales data in order to provide recommendations for increasing your sales. The analysis and recommendations below are based on the data you provided, which covers a period from May 2004 through June 2006. The analysis below is based on this data alone. Therefore, our recommendations should be tempered by your knowledge of business realities and your market. Please let us know if we can answer any questions concerning the analysis or the recommendations provided.…

    • 741 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Datawearhousing

    • 1743 Words
    • 7 Pages

    (1) List the tuples in the complete data cube of R in a tabular form with 4 attributes,…

    • 1743 Words
    • 7 Pages
    Satisfactory Essays
  • Good Essays

    “If a Bag is purchased, a Blush is also purchased at that same transaction.” (“If Bag, then Blush.”) While Bag is antecedent, Blush represents consequent.…

    • 824 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Total sales | $ 39,720,000 | $ 42,620,000 | $ 45,520,000 | $ 38,560,000 |…

    • 991 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Molson Cors

    • 938 Words
    • 4 Pages

    Sales refer to the total of sales across all countries and products (Including not fully owned brands under licensing).…

    • 938 Words
    • 4 Pages
    Powerful Essays
  • Good Essays

    (total amount of sales data adds up to 29, then divided this total by the number of data sets, 30. 29 divided by 30, equals 0.96, which I rounded up to 1.)…

    • 1433 Words
    • 6 Pages
    Good Essays
  • Better Essays

    Xacc 280 Final: Coke/Pepsi

    • 1839 Words
    • 8 Pages

    The significance of the trend analyses on net sales and net income is that PepsiCo has been steadily increasing its sales over the past 5 years (for a total increase of $9 billion), since 2001. Clearly, they must have sound marketing and advertising strategies, in order to not just maintain their sales figures, but to maintain a growing increase each year. Up until 2005, they were performing equally admirably at increasing their net income as well. While their net income for 2005 is still an impressive 69.9% higher than in 2001, they went from 3 years of massive increases, to a slight decrease. Since the sales figures increased from 2004-2005, it’s not a question of having a “sales slump”. This means they must have increased expenses dramatically during 2005. We know this because net income is tabulated from the income statement, which consists of revenues and expenses. Logically, if sales have continued to increase, yet net income has fallen slightly, then there must have been a substantial…

    • 1839 Words
    • 8 Pages
    Better Essays
  • Satisfactory Essays

    Northwind Traders Memo

    • 614 Words
    • 3 Pages

    Thank you for the opportunity to assess your sales data in order to provide recommendations for increasing your sales. The analysis and recommendations below are based on the data you provided, which covers a period from May 2004 through June 2006. The analysis below is based on this data alone. Therefore, our recommendations should be tempered by your knowledge of business realities and your market. Please let us know if we can answer any questions concerning the analysis or the recommendations provided.…

    • 614 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    Assignment 3 P4 P5 M2

    • 1079 Words
    • 4 Pages

    Simple processing would be simply adding up the number of items sold by the business by a variable, such as the store location, the product or the time or date.…

    • 1079 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    term project

    • 639 Words
    • 7 Pages

    Thank you for the opportunity to assess your sales data in order to provide recommendations for increasing your sales. The analysis and recommendations below are based on the data you provided, which covers a period from May 2004 through June 2006. The analysis below is based on this data alone. Therefore, our recommendations should be tempered by your knowledge of business realities and your market. Please let us know if we can answer any questions concerning the analysis or the recommendations provided.…

    • 639 Words
    • 7 Pages
    Satisfactory Essays
  • Satisfactory Essays

    data driven approach

    • 269 Words
    • 1 Page

    The demarcation between Six Sigma and lean is very small. Both are used in the attainment of…

    • 269 Words
    • 1 Page
    Satisfactory Essays

Related Topics