A Multivariate Approach for the Analysis of Spatially Correlated Environmental Data

Powerful Essays
Journal of Environmental Informatics 5 (1) 9-16 (2005)

05JEI00041 1726-2135/1684-8799 © 2005 ISEIS www.iseis.org/jei doi:10.3808/jei.200500041

A Multivariate Approach for the Analysis of Spatially Correlated Environmental Data
A. Lamberti1* and E. Nissi2
2 1 ISTAT - Via C. Balbo, 16 - 00184 Roma, Italy Dipartimento di Metodi Quantitativi e Teoria Economica, Viale Pindaro, 42 - 65127 Pescara, Italy

ABSTRACT. The formulation and the evaluation of environmental policy depend upon a general class of latent variable models known as multivariate receptor models. Estimation of the number of major pollution sources, the source composition profiles and the source contributions are the main interests in multivariate receptor modelling. Many different approaches have been proposed both when the number of sources is unknown (explorative factorial analysis) and when the number and the type of sources are known (regression models). The objective of this work is to propose a flexible approach to the multivariate receptor models that incorporates the extra variability due to the spatial dependence. The method is applied to Lombardia air pollution data. Keywords: Covariance modelling, environmental data, latent variable models, multivariate receptor models, spatio-temporal modelling

1. Introduction
In the past few years interest in air quality monitoring has increased, specifically pertaining to the identification of pollution sources and their information needed to implement air pollution control programs. Since observing the quantity of various pollutants emitted from all potential pollution sources is virtually impossible, receptor models are used to analyze concentrations of pollutants or particles measured over time in order to gain insight concerning the unobserved pollution sources. Multivariate receptor modeling aims to identify the pollution sources and assess the amounts of pollution by resolving the measured mixture of chemical species into the



References: Anderson, T.W. (1984). An Introduction to Multivariate Statistical Analysis, 2nd Edition, John Wiley & Sons, New York, USA. Bartholomew, D.J. and Knot, M. (1999). Latent Variable Models and Factor Analysis, 2nd Edition, Oxford University Press, New York. Gleser, L.J. (1997). Some thoughts on chemical mass balance models. Chemom. Intell. Lab. Syst., 37, 15-22. Guttorp, P. and Sampson P.D. (1994). Methods for estimating heterogeneous spatial covariance functions with environmental applications, in G.P. Patil and C.R. Rao (Eds.), Handbook of Statistics XII: Environmental Statistics, Elsevier/North Holland, New York, pp. 663-690. Henry, R.C. (1987). Current factor analysis models are ill-posed. Atmos. Environ., 21, 1815-1820. Henry, R.C. (1997). History and fundamentals of multivariate air quality receptor models, Chemom. Intell. Lab. Syst., 37, 37-42. Henry, R.C., Park, E.S. and Spiegelman, C.H. (1999). Comparing a new algorithm with the classic methods for estimating the number of factors. Chemom. Intell. Lab. Syst., 48, 91-97. Henry, R.C. (2002). Multivariate receptor models: current practice and future trends. Chemom. Intell. Lab. Syst., 60, 43-48. Henry, R.C. (2003). Multivariate receptor modelling by N-dimensional edge detection. Chemom. Intell. Lab. Syst., 65, 179-189. 15 A. Lamberti and E. Nissi / Journal of Environmental Informatics 5 (1) 9 - 16 (2005) Hopke, P.K. (1991). An introduction to receptor modelling. Chemom. Intell. Lab. Syst., 10, 21-43. Hopke, P.K. (1997). Receptor modelling for air quality management, in R.E. Hester and R.M. Harrison (Eds.), Issues in Environmental Science, Issue 8, Royal Society of Chemistry, Cambridge UK, pp. 95-117. Hopke, P.K. (2003). Recent developments in receptor modelling. J. Chemom., 17, 255-265. Javitz, H.S., Watson, J.G., Guertin, J.P. and Mueller, P.K. (1988). Results of a receptor modelling feasibility study. J. Air Pollut. Control Assoc., 38, 661-667. Kim, E., Hopke, P.K., Paatero, P. and Edgerton, E.S. (2003). Incorporation of parametric factors into multilinear receptor model studies of Atlanta aerosol. Atmos. Environ., 37, 5009-5021. Loader, P.S. (1992). Spatial covariance estimation for monitoring data, in A. Walden and P. Guttorp (Eds.), Statistics in Environmental and Earth Sciences, Edward Arnold, London, pp. 52-70. Meiring, W., Sampson, P.D. and Guttorp, P. (1998). Space-time estimation of grid-cell hourly ozone levels for assessment of a deterministic model. Environ. Ecol. Stat., 5, 197-222. Nott, D.J., Dunsmuir, W.T.M., Speer, M.S. and Glowacki, T.J. (1998). Non-stationary Multivariate Covariance Estimation for Monitoring Data, Technical Report S98-14. Paatero, P. and Hopke, P.K. (2002). Utilizing wind direction and wind speed as independent variables in multilinear receptor modelling studies. Chemom. Intell. Lab. Syst., 60, 25-41. Paatero, P., Hopke, P.K., Hoppenstock, J. and Eberly, S.I. (2003). Advanced factor analysis of spatial distributions of PM2.5 in the eastern United States. Environ. Sci. Technol., 37, 2460-2476. Park, E.S., Henry, R.C. and Spiegelman, C.H. (1999). Determining the Number of Major Pollution Sources in Multivariate air Quality Receptor Models, NRCSE, TSR No.34. Park, E.S., Henry, R.C. and Spiegelman, C.H. (2000). Estimating the number of factors to include in a high-dimensional multivariate bilinear model. Commun. Stat., 29(B), 723-746. Park, E.S., Guttorp, P. and Henry, R.C. (2001). Multivariate receptor modelling for temporal correlated data by using MCMC. J. Am. Stat. Assoc., 96, 1171-1183. Park, E.S., Oh, M.S. and Guttorp, P. (2002). Multivariate receptor models and model uncertainty. Chemom. Intell. Lab. Syst., 60, 49-67. Sampson, P.D. and Guttorp, P. (1992). Nonparametric estimation of nonstationary spatial covariance structure. J. Am. Stat. Assoc., 87, 108-119. Spiegelman, C.H. and Dattner, S. (1993). Multivariate chemometrics, a case study: applying and developing receptor models for the 1990 El Paso winter PM10 receptor modelling scoping study, in G.P. Patil and C.R. Rao (Eds.), Multivariate Environmental Statistics, Elsevier Science publishers, New York, pp. 509-524. 16

You May Also Find These Documents Helpful

  • Powerful Essays

    Multivariate Data Analysis

    • 3047 Words
    • 13 Pages

    Introduction This document presents the regression analysis of customer survey data of Hatco, a large industrial supplier. The data has been collected for 100 customers of Hatco on 14 parameters. The 14 variables are as follows: * Perceptions of Hatco: This data was collected on a graphic measurement rating scale consisting of a 10cm line ranging from poor to excellent. Indicator | Variable | Description | X1 | Delivery speed | amount of time it takes to deliver the product once an order…

    • 3047 Words
    • 13 Pages
    Powerful Essays
  • Powerful Essays

    Chapter 1 Multivariate analysis refers to all statistical techniques that simultaneously analyze multiple measurements on individuals or objects under investigation. Factor analysis identifies the structure underlying a set of variables Discriminant analysis differentiates among groups based on a set of variables. All the variables must be random and interrelated in such ways that their different effects cannot meaningfully be interpreted separately. Nonmetric measurement scales Nominal…

    • 3526 Words
    • 15 Pages
    Powerful Essays
  • Satisfactory Essays

    multivariate analysis

    • 1395 Words
    • 5 Pages

    y4=dotting y2=form relations y5=sensory motor coordination y3=dynamometer y6= perseveration The data are recorded in the table below. Data analysis: Since these two dataset were independent and following normal distribution, but the variances are unknown. For this kind of case, we see 6 variables are measured on each sampling unit in two samples. First thing came into…

    • 1395 Words
    • 5 Pages
    Satisfactory Essays
  • Good Essays

    are caused by multiple phenotypes that are correlated. In many clinical or psychological settings, diagnoses are made through a set of binary phenotypes. Currently, there are several statistical methods available to analyze multiple phenotypes, such as comparing the results from single phenotype analyses, and performing multivariate analysis for quantitative phenotypes. These methods can be extended for related samples. The authors proposed new approach to jointly evaluate a set of binary phenotypes…

    • 885 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Materials 2 2.2 Methods 3 III Results 4 3.1 ANOVA 4 3.2 Multivariate analysis 4 IV Discussion 4 Conclusion 5 * Introduction 1 I Title, Abstract and Introduction 2 1.1 Objective of the study 2 1.2 Title 2 1.3 Abstract 2 1.4 Intoduction 2 II Materials and Methods 2 2.1 Materials 2 2.2 Methods 3 III Results 4 3.1 ANOVA 4 3.2 Multivariate analysis 4 IV Discussion 4 Conclusion 5 outpents.edu.ms outpents…

    • 1640 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Eleven Multivariate Analysis Techniques: Key Tools In Your Marketing Research Survival Kit by Michael Richarme Situation 1: A harried executive walks into your office with a stack of printouts. She says, “You’re the marketing research whiz—tell me how many of this new red widget we are going to sell next year. Oh, yeah, we don’t know what price we can get for it either.” Situation 2: Another harried executive (they all seem to be that way) calls you into his office and shows you three proposed…

    • 2223 Words
    • 9 Pages
    Good Essays
  • Satisfactory Essays

    data driven approach

    • 269 Words
    • 1 Page

    Importance of Data Driven Approach The demarcation between Six Sigma and lean is very small. Both are used in the attainment of positive results accurately and in a very short span of time. Six Sigma mainly concentrates on reducing process variation and enhancing process control, whereas lean drives out waste and promotes work standardization and flow. Six Sigma and Lean almost have the same general purpose of providing the customers with best quality possible, cost, delivery. Lean…

    • 269 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Sleep is directly correlated to the grades students receive on exams Abstract I investigate students attaining higher grades on exams when they receive proper sleep. I discuss 3 possible hypotheses for the reason of this occurrence: 1) more sleep contributes to greater capability in remembering, 2) rest is necessary for the brain to function at its prime, 3) staying awake and studying the night before leaves you with no energy during the actual exam. I will test hypothesis 2, which will include…

    • 1343 Words
    • 6 Pages
    Good Essays
  • Powerful Essays

    A data mining approach to analysis and prediction of movie ratings M. Saraee, S. White & J. Eccleston University of Salford, England Abstract This paper details our analysis of the Internet Movie Database (IMDb), a free, user-maintained, online resource of production details for over 390,000 movies, television series and video games, which contains information such as title, genre, box-office taking, cast credits and user 's ratings. We gather a series of interesting facts and relationships…

    • 4512 Words
    • 19 Pages
    Powerful Essays
  • Powerful Essays

    Environmental Analysis

    • 1040 Words
    • 5 Pages

    Environmental scanning is a strategy that the companies need to use in order to analyze new opportunities and changes. The information sources of environmental analysis data allow the companies to anticipate to events and plan for the future and anticipate changes. Comcast needs to understand the three main external environment components comprised of the remote , industry and the operating environment. The remote environment is made of external factors where these factors have a lot of control over…

    • 1040 Words
    • 5 Pages
    Powerful Essays