Preview

Reliability Exercise

Good Essays
Open Document
Open Document
631 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Reliability Exercise
This test of novel problem solving is a measure of fluid intelligence (Doubleday, King, & Papageorgiou, 2002). People’s ability to solve novel problems is a stable characteristic, as it is largely genetically determined (Nairne, 2009). Test-retest is typically appropriate for measures with stable attributes, but this test’s novel nature makes it an inappropriate technique in regard to reliability. In effect, its novelty diminishes after the initial testing, producing difficulties due to practice effects, reactivity, or both. Since it has just 20 questions, furthermore, it is easier for examinees to remember a significant portion of its items and therefore either to remember the answers during the retest or to seek them out during the interval, resulting in erroneous score improvements (Yu, 2005). As it is impossible to discern the precise influences of any one factor, the interpretation of a test-retest coefficient is challenging, and with more appropriate reliability measures available temporal stability should not be used for this test.
Alternate-forms reliability eliminates some of the reactivity associated with test-retest, but it is nonetheless an inappropriate reliability measure for this test due to the possible carryover effects of strategy. Even when each specific item’s content is novel or unfamiliar, examinees may accustom themselves to the test’s style and subsequently apply the same principle used to solve one problem to another (Groth-Marnat, 2009). Truly equivalent forms are already difficult to develop, but together with the increasing difficulty of items in this test, assuming that no two items are the same, it makes generating a reliable alternate form unfeasible.
This test’s dichotomous scoring protocol is designed to assess problem-solving ability objectively with questions being answered either correctly or incorrectly. Such a standardised procedure independently considerably eliminates subjective influence, and assessing inter-rater

You May Also Find These Documents Helpful

  • Good Essays

    Reliability

    • 514 Words
    • 2 Pages

    Validity: Look at the population used for the VMQ and the populations for the tests used to evaluate the VMQ’s validity. Do you believe that the populations of the other tests are comparable…

    • 514 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    This paper defines and critiques the Wide Range Achievement Test-4 (WRAT-4). The first test edition was developed by Sidney Bijou and Joseph Jastak in 1941, and was published in 1946 (Wilkinson, Robertson, 2006). The WRAT-4 was developed and published by Dr. Gary S. Wilkinson and Dr. Gary J. Robertson in 2006 (Hasinger,…

    • 53 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Using standardized tests to assess a person’s cognitive and learning ability is a common practice in all kinds of institutions and has been debated for years. The primary purpose of such tests is to screen out large number of applications that don’t meet the minimum requirements. The key to correct use of such tests is to ensure the content, format and process of taking the test matches with the requirements of the job.…

    • 1125 Words
    • 5 Pages
    Good Essays
  • Good Essays

    2003 Dbq Analysis

    • 479 Words
    • 2 Pages

    Document 2 states, “Here is Gerald Bracey’s list of some of the biggies that we generally don’t even try to use standardized test to measure: creativity, critical thinking, resilience, motivation, persistence, enthusiasm, empathy, self-discipline, resourcefulness, honesty, and integrity-to name a few.” It is evidently shown that Document 2 addressed a common issue with standardized test and this acts as a counterclaim when supporters of standardized test say that it covers everything. As a result, this allots Document 2 great credibility and…

    • 479 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Chaubris And Braver

    • 572 Words
    • 3 Pages

    Summary: The goal of Gray, Chabris & Braver’s’(2003) study was to determine if individual differences in General Fluid Intelligence (gF) would be more evident in lure trials of the three-back task, both in terms of the performance and neural activation. To accomplish this task Gray, Chabris & Braver (2003) collected a sample of 60 healthy, right-handed, native English-speaking individuals from Washington University and the surrounding community. Gray, Chabris & Braver (2003) first used the Raven’s Advanced Progressive Matrices (APM) to measure the gF of the participants involved in the study, the APM contained 36 questions for the participants to complete. After the gF results were collected Gray, Chabris & Braver (2003) administered the three-back task, in each scanning run participants had to determine if the third face or word in the sequence matched the stimulus three spaces backwards.…

    • 572 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    PSI-4 Summary

    • 1754 Words
    • 8 Pages

    The manual as well as reviews of the literature do not identify why three different types of test items are utilized for one assessment and whether this is pertinent to the test as a whole. Item response theory (IRT) assumes unidimensionality of a test. According to Sykes, Hou, Hanson, and Wang (2002), mixed item formats on a test may raise questions about the test’s dimensionality which can lead to further concerns about the psychometric properties of the…

    • 1754 Words
    • 8 Pages
    Powerful Essays
  • Powerful Essays

    Cited: Anderson, Scarvia B., and John S. Helmick. On Educational Testing. San Francisco: Jossey-Bass, 1983. Print.…

    • 2569 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Intelligence is an intrapersonal phenomenon, that is inside a person and it is generally agreed that the nature of this energy is unknown. Nevertheless, it may be known by its mental products (Groth-Marnet, 1997; Wechsler, 1939). Because there are many different ways to be intelligent there have also been many different definitions proposed (see Neiser, et al., 1996 for summary). A consensus on what constitutes intelligence is generally lacking. Alfred Binet (1908), the author of one of the first modern intelligence tests, defined intelligence as the inclination to take and maintain a specific direction, and capacity to adapt to achieve a goal outcome, and the power of autocriticism (Kaplan, & Saccuzzo, 2005). In contrast, David Wechsler, the developer of the Wechsler scales, defined intelligence as the aggregate capacity to act purposefully, think rationally, and deal effectively with the environment (Wechsler, 1958 as cited in Kaplin, & Saccuzzo). A review by Sternberg, (2005) of intelligence literature over the past century by psychologists and intelligence experts reveals two…

    • 4122 Words
    • 17 Pages
    Powerful Essays
  • Good Essays

    1. Find a 95% confidence interval for the true proportion of the professor’s students who were…

    • 770 Words
    • 6 Pages
    Good Essays
  • Satisfactory Essays

    Ethical Speaking Analysis

    • 423 Words
    • 2 Pages

    I’m not really sure about this test, because I don’t believe I have ever taking one before. I feel that IQ isn’t really a measure of how good you are in school. It is a direct reflection of how quickly you learn and the potential depth of thought you are capable of. This extends into creativity and every facet of interaction with reality; it certainly goes beyond the scope of knowledge and education. IQ test is an accurate measure of a person’s intelligence, only that there are certain environmental factors that can affect it. It has also been proven that results from the score of a standard IQ test may vary up to 15 points, when the person being tested is affected by factors such as mood, anxiety, emotions and biochemistry. In order to lessen the effects of these factors, many people choose to take multiple IQ tests instead of single standard IQ test, simply because the former test gives a more accurate perception.…

    • 423 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Intellectual Power

    • 637 Words
    • 3 Pages

    Gottfredson L. & Saklofske D. (2009). Intelligence: Foundations and Issues in Assessment. Canadian Psychology © 2009 Canadian Psychological Association. Vol. 50, No. 3, 183–195…

    • 637 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Camara, W. J., Nathan, J. S, & Puente, A. E. (2000). Psychological test usage: Implications in…

    • 1650 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Child Psychology

    • 517 Words
    • 3 Pages

    Another problem with IQ tests is that the scoring might be too subjective. A number of alternative IQ tests have been put forward to measure intelligent behaviour. These include elementary cognitive tasks, visual illusions and the Raven’s standard Progressive matrices. This last test was created to determine a person’s non-verbal intelligence. This test requires a person to identify missing elements in a series of patterns, with each pattern becoming increasingly more difficult. The test measures the ability to make sense of complex data and the ability to retain…

    • 517 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    The Values and Motives Questionnaire, also known as the Values and Motives Inventory, is designed to examine a person’s motivation in relation to his values and activities. In order to ensure a comprehensive understanding of values, the VMQ assess three distinct areas, including: interpersonal, intrinsic, and extrinsic. Interpersonal values, according to the VMQ refer to one’s relationships with others. Intrinsic values contain one’s personal beliefs and attitudes. Finally, extrinsic values are one’s motivating factors at the workplace. Each of these three areas contain twelve topics addressed during the test. While the VMQ can be used for a variety of reasons, it is typically used in the workplace as a guidance tool. When exploring the Values and Motives Questionnaire, it is important to understand its reliability and validity. This paper will address the measurement’s reliability and validity, including its coefficients, strengths, and weaknesses.…

    • 1068 Words
    • 5 Pages
    Powerful Essays
  • Powerful Essays

    As a further check on the validity of EPPS results, Edwards included a consistency scale with 15 pairs of statement repeated in identical form. In other words, the 210 pairs of statements, only 195 are unique. The 15 that occur twice are presented more or less randomly throughout the test. With this format the number of times a subject make the identical choice can be converted to a percentile based on a normative data. Inventories consisted of 225 pairs of statements in which items from each of the 15 scales paired with other items from the 14 plus pairs of twelve other items to check consistency optional. This leaves the number of items (14x15) at 210. Edwards has used 15…

    • 2096 Words
    • 9 Pages
    Powerful Essays

Related Topics