Preview

Advance Analytics Internship Coding Challenge Sai Charan Thotapalli

Satisfactory Essays
Open Document
Open Document
958 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Advance Analytics Internship Coding Challenge Sai Charan Thotapalli
Advance Analytics Internship Coding Challenge
Sai Charan Thotapalli
01/25/2015

Data description
First, is need to know the amount of information this analysis will involve, in this section a general review of data.
Number of rows, this mean the number of observations to be analysed. 21,061 observations are found.
## [1] 21061

Number of columns, this mean the number of variables to be analysed.
## [1] 12

The original names of the variables.
## [1]
## [4]
## [7]
## [10]

"day"
"platform"
"orders"
"add_to_cart"

"site"
"visits"
"gross_sales"
"product_page_views"

"new_customer"
"distinct_sessions"
"bounces"
"search_page_views"

Data dictionary.
Data dictionary is a set of information to explain the variables that are up to be analysed.
Where variables can be explained.













day | The calendar day. site | Company site visited by users. new_customer | 0 = returning customer; 1 = new customer; null = neither platform | The type of device used by a website visitor visits | The number of distinct website visits; 1 session may have multiple visits distinct_sessions | The number of distinct website visitors; 1 session may have multiple visits orders | The number of website orders gross_sales | The total gross sales for website orders bounces | The number of visits that only viewed one page add_to_cart | The number of visits that added a product to cart product_page_views | The number of product pages viewed search_page_views 1 The number of search pages viewed

Exploratory data analysis
First we explore relevant data, company site visited by users is described by next table:
##
##

Acme
7392

Botly Pinnacle
804
5725

Sortly
5532

Tabular Widgetry
804
804

Acme site, result to be the more visited followed by Pinnacle and Sortly.
In platforms according to next table, the most visitors users use iOS, followed by Android devices and Windows systems. In following figure the missing platform is cause of missing data of databe origin.
##
##
##
##
##
##

410

You May Also Find These Documents Helpful