Preview

Analysis of Loan Data

Good Essays
Open Document
Open Document
523 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Analysis of Loan Data
Analysis of Loan data and relationship with various factors
Introduction
As we all know the history of loans as old as the history of money. Earlier there used to be different mechanism of lending money and recovering it. In simple terms it was the process in which the people who have more money than they required used to give money to people who didn’t had enough. Over the years with the evolution of economics the loan process became extremely important for the people who made business out of it. They used to give loans to people who needed it. But there was always a risk of person defaulting on the loan. For this reason before giving the loan the companies analyses various factors such as the credit history of the borrower, loan period, interest rates, income source etc. in order to prevent any default. In this assignment we are trying to find relation between the interest rates and the various factors like amount, loan length, debt to income ratio, monthly income, FICO score etc.
Methods
Data Collection
The data was collected from the link https://spark-public.s3.amazonaws.com/dataanalysis/loansData.csv provided on the coursera page. The data was downloaded on 14th February 2013 using R software
Exploratory Analysis
Exploratory analysis on the data was done by examining table and plotting the data. The exploratory analysis was used to clean the data and determine factors to be used for the linear regression model. The cleaning of data involved removing inconsistent metrics like year/years, removing percentage signs an converting from factors to numeric for the purpose of regression analysis.
Statistical Modelling
A standard model of multiple linear regressions was built using the R software to check and determine the relationship between the outcome variable and the various factors. Coefficients were calculated and the significance was checked using the P value. The R square value and ANOVA table construction helped in interpreting the result.
The



References: 1. Coursera Data Analysis Course page (https://class.coursera.org/dataanalysis-001/lecture/index) 2. R Tutorial Website (http://www.r-tutor.com) 3. Business Statistics by Levin and Rubin, Pearson Publications

You May Also Find These Documents Helpful

  • Powerful Essays

    Dermaplus Analysis

    • 1521 Words
    • 7 Pages

    The regression analysis provided by Selwyn based on the data collected is statistically significant greater than a 99% confidence level based on the p-value with a high correlation between the dependent and independent variables at 95%…

    • 1521 Words
    • 7 Pages
    Powerful Essays
  • Satisfactory Essays

    11. Described the method of data analysis used in this research. (100 -150 words- 10 marks)…

    • 699 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    * Consumer loans are those loans which are required by a person for their personal needs.…

    • 897 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Colonial Broadcasting

    • 3627 Words
    • 15 Pages

    For a detailed description of the variables and the defined statistical terms used in this report, see [ Annex 1 ]. Based on the sample data provided and the statistical analysis, the following regression equation has been derived:…

    • 3627 Words
    • 15 Pages
    Powerful Essays
  • Good Essays

    Student Loan Research

    • 266 Words
    • 2 Pages

    Students could also get pursued by aggressive collection agencies and this could lower their credit score.…

    • 266 Words
    • 2 Pages
    Good Essays
  • Better Essays

    Jet Blue

    • 2688 Words
    • 12 Pages

    Peoples had accumulated assets of $556m. These assets were funded by short term consumer deposits, consisting largely of 3-month fixed rate savings certificates. These savings certificates were highly affected by interest rate fluctuations. The long term loans provided to people generate interest earnings which are do not increase or decrease with the interest rate fluctuations. Therefore, there was a mismatch between the interest rates earned by the bank and the interest rates that it had to give out. This caused large losses over the period 1979-1982 when interest rates rose.…

    • 2688 Words
    • 12 Pages
    Better Essays
  • Satisfactory Essays

    Multiple Regression

    • 742 Words
    • 3 Pages

    Regression Analysis Regression Statistics Multiple R 0.880534596 R Square 0.775341175 Adjusted R Square 0.693647057 Standard Error 1184.124723 Observations 16 ANOVA df Regression Residual Total Significance SS MS F F 4 53230058.9713307514.749.4907833180.001427993 11 15423664.971402151.361 15 68653723.94…

    • 742 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    Strengths Of Abortion

    • 1877 Words
    • 8 Pages

    Throughout the studies, a number of investigators implemented thorough and practical approaches to determine reproductive outcomes following an abortion. Since a majority of the studies are longitudinal and based on factual, reproductive and obstetric data and history, the information obtained regarding the women participants is straightforward and clear. This allows it to be easily measured, coded, and compared with other variables to identify potential correlations. In addition, this also decreases (or eliminates) the researchers individual interpretation and potential bias throughout the studies since the fixed data is being utilized. In analyzing the data, a number of the statistical analysis procedures employed throughout the research include the following:…

    • 1877 Words
    • 8 Pages
    Powerful Essays
  • Satisfactory Essays

    For a detailed description of the variables and the defined statistical terms used in this report, see [ Annex 1 ]. Based on the sample data provided and the statistical analysis, the following regression equation has been derived:…

    • 262 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    The data were expressed as the means±SEM. The differences between the control and treatment groups were tested by ANOVA followed by the Tukey post-hoc test, using SPSS 13.0 software. The probability of p<0.05 was considered to show considerable differences for all comparisons…

    • 3561 Words
    • 15 Pages
    Powerful Essays
  • Powerful Essays

    mamamam

    • 2532 Words
    • 11 Pages

    The choice of statistical analysis is a very important work. The right choice of analytical method should not only help solve the issue to be analysed, but should also have strong robustness. The choice of statistical analytical method is dependent upon mainly three factors, namely the scale of data, the research design and the fundamental assumptions (Nargundkar, 2008, p. 120). The scale f data can also be understood as the type of data, which is usually categorized into categorical and quantitative data. Categorical data can only be analyzed through univariate analysis and some specific types of bivaraite analysis, for example the chi-square analysis. Therefore, research question 3, 5, 6 could not be approached by using multivariate analysis since all of them only involves categorical variables. The second factor that affects the choice of analytical methods is the research design. As it is suggested, univariate analysis is only used to analyse the feature of one simple variable, while bivariate analysis and multivariate analysis are applied to analyse the relationship between two variables or among more than two variables (Rosenthal, 2012, p. 6). Therefore, research question 1 and 2 should be approached through univariate analysis, while research question 3 to 9 should be approached through bivariate…

    • 2532 Words
    • 11 Pages
    Powerful Essays
  • Good Essays

    In the next stage of scientific data analysis, test of significance was performed with 90% confidence level to understand the significance of relationship between independent and dependent variables. These tests of significance (T-test and F-Test) are followed by Regression model building approach to understand in what proportion each of these potential variables are impacting dependent (target) variables. At this stage different permutation and combination of variables were performed to understand the relationship and impact.…

    • 980 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Credit Analysis

    • 7225 Words
    • 29 Pages

    This module will review the key components of fundamental analysis of a borrower’s creditworthiness. The emphasis will be on the analysis of the income statement,…

    • 7225 Words
    • 29 Pages
    Powerful Essays
  • Good Essays

    Chapter 1

    • 510 Words
    • 3 Pages

    This Chapter presents the following essential elements of the study: the introduction, which contains the rationale (an explanation of the reason for the conduct of study); the literature review and statistical foundations; the statement of the general and specific problems; the scope and delimitation which identifies the major variables, sub-variables and indicators; the significance of the study which enumerates the beneficiaries of the study and the corresponding benefits each will receive; and lastly, the notations…

    • 510 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Women Empowerment in India

    • 1821 Words
    • 8 Pages

    Objectives: (i) To equip the students with techniques of data analysis. (ii) To grasp the various…

    • 1821 Words
    • 8 Pages
    Satisfactory Essays