I worked for a logistic company. My major responsibility was in charge of the storage and transportation of parts of cars between two areas, which are about 1400 miles apart. One of my jobs is collecting the goods from suppliers and arranging the trucks to deliver them. There are five truck drivers, and each of them is assigned to deliver on each weekday throughout a whole year. Before the delivery, we will check the quality of the goods and Make sure that there are no damaged goods. When arriving at the destination, the staff will check the goods again and record the damaged goods that occurred in transit. At the end of every month, we will pay for the compensation according to the number of the defective goods. In order to reduce the number of the damaged goods during the delivery, I want to identify the reasons why they are damaged. In this study, I want to find out that whether some drivers are more prone to make the goods damaged during their delivery.
Variable to be measured:
Two variables are to be measured. The first variable is just the five truck drivers, and the second one is the quality of the goods after the delivery.
Determination of Population:
Population in this case is defined as the all goods delivered from Tianjin area to Guangzhou area.
To analyze relationship between the two variables above which are both nominal in terms of data type, I decide to use Chi-squared test of a contingency table.
The information about delivery is recorded in our computer system, including the delivery date, name of the driver, the number of damaged goods and so on. I take out the data about 52 weeks during the previous year and record them into the following table:
| Quality| |
Truck Driver| Passed| Damaged| Total|
Driver A| | | |
Driver B| | | |
Driver C| | | |
Driver D| | |...