Bez tytulu slajdu - PowerPoint PPT Presentation

1 / 55
About This Presentation
Title:

Bez tytulu slajdu

Description:

Principles and applications: International Edition (6/e) ... Biserial correlation coefficient; Point-biserial correlation coefficient; Phi coefficient; ... – PowerPoint PPT presentation

Number of Views:38
Avg rating:3.0/5.0
Slides: 56
Provided by: kup2
Category:
Tags: bez | biserial | slajdu | tytulu

less

Transcript and Presenter's Notes

Title: Bez tytulu slajdu


1
Lecture 10
STRATEGIES OF TEST DEVELOPMENT
2
References
  • Murphy, K. R. Davidshofer, C. O. (1998).
    Psychological testing. Principles and
    applications International Edition (6/e). Upper
    Saddle River, N.J. Prentice-Hall, Inc. (CHAPTER
    11).

3
Strategy of test development - definition
The strategy of test development means a
well-planned and carefully realized procedure
aimed on maximalizing the validity of the test.
The basis of all strategies are questions
concerning existence of given psychological
phenomena.
4
Sorts of strategies
?Deductive (theoretical). ?Inductive
(internal). ?Criterion-oriented (external).
5
Aspects of strategy
  • The theoretical basis of the strategy
  • Crucial phase of the strategy
  • Psychometric model applied
  • Final psychological, diagnostical and
    psychometric properties of the test
  • The examples of the instruments

6
Theoretical basis of the strategy
?Deductive (theoretical) strategy is based on the
theory. This strategy previously was called as
rational. The goal of this strategy is
diagnosis of the traits, described by a
theoretical concept (traits are known from the
begining), so the whole procedure sould lead to
confirm theoretical expectations and instrument,
assessing a given set of psychological constructs.
7
Theoretical basis of the strategy
?Inductive (internal) strategy is based on
methodology. The goal of this strategy is
diagnosis of the traits, which explain a set of
behaviors - those traits will be known on the end
of this process). The whole procedure sould lead
to discovery of those set of traits and
instrument, which enables to assess them.
8
Theoretical basis of the strategy
?Criterion-oriented (external) strategy is based
on knowledge, concerning the psychological
syndrome related to a given criterion. The goal
of this strategy is diagnosis of the type of
personality (or syndrome of traits), which
diagnoses (or prognoses) of criterion. The whole
procedure sould lead to conform expectations
about this syndrom and instrument, which enables
to assess it.
9
Special assumptions of the external strategy
  • Syndrom of traits diagnoses (prognoses) the
    criterion.
  • Syndrom of traits may be assessed by the
    inventory.
  • Inventory is used to diagnose criterion and -
    indirectly - to identification of this syndrom of
    traits.

10
Crucial phase of the strategy
?Deductive (theoretical) strategy -
representative pool of the items, designed to
measure an a priori choosen traits. In this
strategy procedures of content validity
assessment are applied and items are originally
written out.
11
Crucial phase of the strategy
?Inductive (internal) strategy - very broad and
representative for all forms of human behaviors
the pool of items (otherwise some traits may not
be covered). In this strategy the banks of items
are applied (based on vocabularies or other
instruments, like Cattelian 16PF).
12
Crucial phase of the strategy
?Criterion-oriented (external) strategy - broad
and specific for the given syndrome only pool of
the items (otherwise some aspects of the syndrom
may not be covered or other syndromes will be
also diagnosed). In this strategy typically items
are borrowed from other instruments (like
clinical inventories from MMPI).
13
Psychometric model
?Deductive (theoretical) strategy item-total
correlation, leading to selection of the items,
which are the best measures of the assummed trait
(assessed by the whole set of the
items). ?Criterion-oriented (external) strategy -
item-criterion correlation, leading to selection
of the items, which are the best measures of the
criterion.

14
Psychometric model
?Inductive (internal) strategy exploratory
factor analysis, leading to selection (on the
basis of factor loadings) of the items, which are
the best measures of traits. Factor analysis is
used to discover the structure of traits and the
set of the items measuring those traits.
15
Final psychological properties of the instrument
?Deductive (theoretical) strategy promotes the
theory development or may lead to reformulation
(or even falsification) of the theory. ?Inductive
(internal) strategy may be the basis for the
new theory or leads to simple description of the
personality.
16
Final psychological properties of the instrument
?Criterion-oriented (external) strategy
enriches the knowledge about personality
syndromes and provides a very useful instruments,
but leads to theoretical chaos (thousands
personality constructs violence of the Ockham
razor rule).
17
Final diagnostic properties of the instrument
?Deductive (theoretical) strategy instrument
diagnoses traits from some domains of personality
(typically not the whole personality). ?Inductive
(internal) strategy instrument diagnoses basic
dimensions of personality (very general
traits). ?Criterion-oriented (external) strategy
the tool diagnoses the syndrome of traits (the
personality type).
18
Final psychometric properties of the instrument
?Inductive (internal) strategy scales are very
economical (short) with high reliability, very
high construct-oriented validity (MT-MM matrix,
factor analysis) and low criterion-related
validity (in the case of multiscales instrument
acceptable criterion-related validity of the
whole inventory).
19
Final psychometric properties of the instrument
?Criterion-oriented (external) strategy scales
of very high criterion convergent validity, but
unpredictable discriminant validity of other
criteria scales are not economical (very long)
with low reliability and low construct-oriented
validity (factor analysis).
20
Final psychometric properties of the instrument
  • ?Deductive (theoretical) strategy scales of
    intermediate properties Economical with
    sufficient (but not very high) reliability and
    criterion-related validity acceptable
    construct-oriented validity, especially
    convergent aspect, but weak discriminant one
    (MT-MM matrix, factor analysis).

21
Conclusions
  • Each strategy has its advantages and
    disadvantages for the specific purposes a given
    classical strategy may be applied even in modern
    psychology.
  • It is suggested a golden way of test
    development mixed or sequential strategies
    (which preserve the strong aspects and eliminate
    the weak points).

22
Lecture 11
THE PROCEDURE OF TEST DEVELOPMENT
23
References
  • Murphy, K. R. Davidshofer, C. O. (1998).
    Psychological testing. Principles and
    applications International Edition (6/e). Upper
    Saddle River, N.J. Prentice-Hall, Inc. (CHAPTER
    10-11).

24
The general properties of the procedure of test
development
  • The process of test developement is
    longlasting and contains several phases. For
    these reasons it should be very carefully planned
    and realized. Some phases of the procedure are
    common for all testst, but most of them are
    specific for the chosen strategy of test
    development.

25
Phases of test development
Phase I the choice of the strategy and the plan
of constructing the test. Phase II analysis of
the theoretical basis of the instrument
theoretical constructs, methodology or knowledge
about the syndrom and criterion. Phase III
process of generating of the items (forming the
pool of the items). Phase IV writing down the
test items.
26
The properties of the good personality test item
  • The good personality test item should be
  • Comprehensible
  • short one
  • kept in simple and positive grammatical form no
    negations and double negations, simple in
    grammatical form,
  • written using well-known expressions
  • in Polish female and male forms
  • Unambigous with regard to the content
  • Free from social desirability

27
The properties of the good personality item
  • The good personality test item should have
  • Extended format of options of the items
  • optimal number of cathegories of answers from 4
    up to 7
  • An appropriate scoring rule of the answer numbers
    with regular intervals
  • the best solution using natural numbers,
    anchored at zero

28
Phases of test development
  • Phase V - pilot testing of the items
  • Content validity (in the case of the theoretical
    strategy) and
  • comprehensibility,
  • ambiguity,
  • social desirability checked in the itemmetric
    analyses.

29
Phases of test development
Phase VI preparation of the test to empirical
studies Title of the test, instruction and
random order of the items in inventories, and
progressive order of the items with regard to
their difficulty in ability tests. Phase VII
investigations of the sample of examinees and
psychometric analyses of the data, leading to the
final form of the test.
30
Sample requirements
  • Sample size - sufficiently large 5-10 examinees
    per item (psychometrics does not tolerate small
    samples).
  • Sample composition Heterogenous with regard to
    gender, age, education level, job, region of
    residence, etc.).
  • Number of samples At least two fully comparable
    samples, designed to cross-validation

31
Item analysis a definition and indices
Item analysis psychometric analysis, which
comprises item difficulty, item-total correlation
(discriminative power) and external validity of
the item. Item-total correlation indicator of
the correlation between the item and the test as
a whole indicates how well the item measures
the trait as the whole test does.
32
Item analysis a definition and indices
Item difficulty proportion of the correct
answers on the item (or the mean of item scores
in relation to maximal possible score in the
scale of answers). External validity of the item
correlation between criterion and the item
indicates how well the item diagnose (or
prognose) the criterion.
33
The item-total correlation
  • Biserial correlation coefficient
  • Point-biserial correlation coefficient
  • Phi coefficient
  • Corrected-item total correlation (CITC).

34
Corrected item-total correlation coefficient
Corrected item-total correlation (CITC)
coefficient obtained by analysis of variance (as
simple effect) it indicates correlation between
the item and total test score without the score
of given item (there is no auto-correlation
effects) ? correlation between the item and the
sum of other items in the scale.
35
External validity and factor validity indices
External validity correlation with criterion
(Pearsons coefficient or phi coefficient). Facto
r validity of the item factor loading on own
factor and other factors.
36
Phases of test development
Phase VIII test norms and publications of the
test (handbook). Phase IX revision of the test.
37
The content of the test handbook
  • Theoretical basis of the test
  • Procedure of test development
  • Reliability and validity indices (with sample
    characteristics)
  • The description of test administration and rules
    of scoring and interpreting the test scores
    (standarization and objectivity)
  • Norms (tables of norms).

38
Revision of the test
The test should be revised after some years of
use - the new form of the test should be
developed. The revision should be done on the
items level and the test test as a whole. The
revision should be done after about 20 years of
use the test in psychological practice, due to
the changes in culture or weak points of the test
(constructor has a results from many studies,
necessary to conclusions about desired test
properties).
39
Example development of the FCB-TI
  • Stage I
  • Regulative Theory of Temperament separaction
    and analysis of energetic and temporal
    characteristics of temperament
  • Stage II
  • Generation of items (students, N15
    psychologists, N5)
  • Writing the items introductory pool of 600
    items
  • Stage III
  • Linguistic assessment of items (students, N30)
    linguist)

40
Example development of the FCB-TI
  • Stage IV.
  • Evaluation of content validity
  • Sorting the items into scales (experts, N12)
  • Evaluation of the items within scales (experts,
    N9)
  • Sorting the items into subscales (experts, N12)
  • ? reduction to 392 items

41
Example development of the FCB-TI
  • Stage V.1
  • Evaluation of formal characteristics of the
    items itemmetric analyses (N334)
  • reduction to 381 items
  • Stage V.2
  • Research on preferred format of answers (N334)

42
Example development of the FCB-TI
  • Stage VI.
  • Final preparation works on the inventory title,
    instruction, items order 381 items
  • Stage VII.
  • Administration of the questionnaire 2 samples
    development sample (1012) test sample (1011)

43
Example development of the FCB-TI
  • Stage VIII. Psychometric studies on the inventory
  • Construction of subscales basing on corrected
    item-total correlation
  • Factor analysis separate for energetic and
    temporal characteristics
  • Construction of scales reduction from 12 to 6
    scales, selection of best items with regard to
    corrected item-total correlation
  • ? Reduction to 120 items (6 scales)

44
Example development of the FCB-TI
  • Stage IX.
  • Verfication of psychometric properties of the
    questionnaire
  • Distributions of test scores
  • Internal consistency 5 samples 1?
    N828, 2 ?N475, 3 ?N527, 4? N392, 5?
    N407)
  • Temporal stability (with 2 weeks 6 months
    interval

45
Example development of the FCB-TI
  • Stage X.
  • Evaluation of test validity
  • 1.
  • 2.
  • 3.
  • 4.
  • 4.
  • 5.
  • 7.

46
Schematyczna procedura konstrukcji inwentarza
Formalna Charakterystyka Zachowania -
Kwestionariusz Temperamentu (FCZ-KT)
  • Badanie trafnosci pomiaru FCZ-KT
  • 1. Analiza porównawcza diagnozy cech temperamentu
    za pomoca samoopisu i szacowania
  • 2. Analiza porównawcza diagnozy cech temperamentu
    za pomoca samoopisu i metod laboratoryjnych
    (wybrane wskazniki psychofizyczne i
    psychofizjologiczne)
  • 3. Analiza relacji cech temperamentu do innych
    biologicznych wymiarów osobowosci
  • 4. Badanie zwiazków pomiedzy cechami temperamentu
    a zdolnosciami
  • 5. Analiza funkcjonalnego znaczenia temperamentu
  • cechy temperamentu a adaptacja zawodowa i stan
    zdrowia
  • 6. Analiza relacji pomiedzy struktura
    temperamentu a stylami zachowania wiazacymi sie z
    ryzykiem choroby psychosomatycznej
  • 7. Badanie struktury temperamentu u osób chorych

47
Example development of the FCB-TI
  • Stage XI.
  • Normalisation
  • Stage XII.
  • Preparation of test manual

48
Assumptions of the Classical Test Theory and Item
Response Theory
The relationship between the item score and the
test score is Linear Classical Test Theory
(might be assessed by the simple
correlations) Curvilinear Item Response
Theory (might be assessed by the curvilinear
regression).
49
Fig. 28. The item characteristic curve (ICC)
50
Parameters of the item characteristics curve
  • Parametr of difficulty
  • Parametr of discrimination
  • Parametr of quessing
  • Parametr of carrelessness

51
Parameters of the item characteristics curve
  • Difficulty - intensity of the trait, necessary
    to have a chance higher than 0.50 for correct
    answer
  • Discrimination - the slope of the curve, near
    by the difficulty
  • Guessing - the difference of probability from
    zero among low test scorers

52
Parameters of the item characteristics curve
  • Carrelessness - the difference of probability
    from one among high test scorers.
  • Typically are two-parameter models are applied
    (difficulty and discrimination).

53
Fig. 29. ICCs for different items
54
Fig. 30a. ICC for item no 35 from ER scale
(FCB-TI inventory)
55
Fig. 30b. ICC for item no 102 from ER scale
(FCB-TI inventory)
Write a Comment
User Comments (0)
About PowerShow.com