Title:
1Thinking About Longitudinal Data
- (or Cross-sectional v Longitudinal Analysis)
2A glib claim that longitudinal data analysis is
important because it permits insights into the
processes of change is inadequate and certainly
fails to convince many social science researchers
who are concerned with substantive rather than
methodological challenges...
3What is required is an understanding on the
limitations of cross-sectional analysis.
4Cross-sectional V Longitudinal Data
5Four Main Issues
- Age and Cohort Effects
- Direction of Causality
- State Dependence
- Residual Heterogeneity
6Age And Cohort Effects
7Should I buy a new car?
8Owners Experience Of Car Reliability Over The
Last Twelve Months - Specific Model
9What happens when these cars get older (ageing
effects)?
10Owners Experience Of Car Reliability Over The
Last Twelve Months - Specific Model
11The manufacture tells me that there is a cohort
effect The more recently made cars are now much
more reliable than the ones made five years
ago.Could this be true?
12Owners Experience Of Car Reliability Over The
Last Twelve Months - Specific Model
13- Cross-sectional data are completely
uninformative as to whether age or cohort effects
(or a combination of each) provide correct
explanations. We would need longitudinal data to
find out!
14Direction Of Causality
15There is unequivocal evidence from
cross-sectional data that, overall, the
unemployed have poorer health.
16This is consistent with botha) unemployment
causing ill health andb) ill health causing
unemployment
17Ill Health
?
Unemployment
18Ill Health
Unemployment
19If we had a cross-sectional survey that asked how
long people had been unemployed and also their
level of health, generally, we would find a
negative relationship.
20Negative Lower levels of health for people who
had been unemployed for longer.
21This is consistent with a) unemployment causing
ill health
Ill Health
Unemployment
22HOWEVER.
23If ill health causes unemploymentthen people
with comparatively modest levels of ill health
will tend to recover more quickly and return to
work.
24This is consistent with b) ill health causing
unemployment
Ill Health
Unemployment
25 With the increasing duration of unemployment
those with less severe ill health will be
progressively under represented while those with
more severe ill health will be over represented.
26This is known as asample selection bias and
could therefore explain the cross-sectional
picture of declining ill health with duration of
unemployment.
27It is not possible to untangle this conundrum
with cross-sectional data.Longitudinal data are
required!
28 State Dependence
29Past Behaviour
Current Behaviour
30Young People Aged 19
APRIL
MAY
Employed
Unemployed
Employed
31Residual Heterogeneity(Omitted Explanatory
Variables)
32- The advantage of longitudinal data over
cross-sectional data is that it not only
facilitates analysis between cases but also
facilitates analysis within cases.
33A simplified view of a difficult concept!
34CROSS-SECTIONAL EXPLANATORY VARIABLES
OUTCOME
Person A
Person B
35My two hypothetical identical twin daughters
The Gayle sisters.
36CROSS-SECTIONAL EXPLANATORY VARIABLES
OUTCOME A
WENDY VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
BELOWNA VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
37This is often called between cases analysis.
38There is no way of accounting for omitted
explanatory variables in cross-sectional analysis.
39CROSS-SECTIONAL EXPLANATORY VARIABLES
OUTCOME A
WENDY VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
BELOWNA VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
40LONGITUDINAL EXPLANATORY VARIABLES
TIME POINT 1
TIME POINT 2
Person A
Person A
Person B
Person B
41This is often called within cases analysis.
There are techniques for accounting for omitted
explanatory variables if we have data at more
than one time point.
42TIME 2
OUTCOME A
WENDY VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
OUTCOME B
BELOWNA VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
43HOW GOOD HAVE THE EXPLANATORY VARIABLES BEEN AS
FAR AS HELPING US TO UNDERSTAND THE OUTCOME?
44TIME 2
WENDY VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
OUTCOME
Unexplained
BELOWNA VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
Unexplained
45RESIDUAL HETEROGENEITY
WENDY VARIABLE A1 VARIABLE B1 VARIABLE
C2 VARIABLE D2
Omitted Explanatory Variables
Unexplained
Unmeasured or Unmeasurable Variables
46- It is sometimes claimed that the main advantage
of longitudinal data is that it facilitates
improved control for the plethora of variables
that are omitted from any analysis.
47- Because surveys fail to capture the detailed
nature of social life there is, almost
inevitably, considerable heterogeneity in
response variables even amongst respondents that
share the same characteristics across all of the
explanatory variables.
48- The possibility of substantial variation between
similar individuals due to unmeasured and
possibly unmeasureable variables is known as
residual heterogeneity.
49BEWARE
- We can begin to see why cross-sectional analysis
might incorrectly estimate the effects of
explanatory variables, and therefore result in
misleading conclusions being drawn.
50- Cross-sectional V Longitudinal
-
- 0 4
51Four Main Issues
- Age and Cohort Effects
- Direction of Causality
- State Dependence
- Residual Heterogeneity
52THINKING ABOUT CHANGE
- COHORT A common group being studied.
- AGE Amount of time since cohort was
constituted. - PERIOD Moment of observation.
53THREE YOUTH COHORT STUDIES
AGE 16 17 18 19 20 21 (COHORT 1) AGE 16 17 18 19
(COHORT 2) AGE 16 17 (COHORT 3)
We can study the effects of age or ageing.
54THREE YOUTH COHORT STUDIES
AGE 16 17 18 19 20 21 (COHORT 1) AGE 16 17 18 1
9 (COHORT 2) AGE 16 17 (COHORT 3)
We can study the effects of cohort.
55THREE YOUTH COHORT STUDIES
AGE 16 17 18 19 20 21 (COHORT 1) AGE 16 17 18 1
9 (COHORT 2) AGE 16 17 (COHORT 3)
Period of high unemployment
Period of low unemployment
We can study the effects of period.
56Beware Age, Cohort and Period effects are often
very hard to untangle See the relevant
literature to become frightened and confused!
57Longitudinal data are not a panacea there are
problems