Tackling over-dispersion in NHS performance indicators

1 / 31

About This Presentation

Title:

Tackling over-dispersion in NHS performance indicators

Description:

Tackling over-dispersion in NHS performance indicators. Robert Irons ... Options for tackling the problem. Our solution an additive random effects model ... – PowerPoint PPT presentation

Number of Views:48

Avg rating:3.0/5.0

Slides: 32

Provided by: rir6

more less

Transcript and Presenter's Notes

Title: Tackling over-dispersion in NHS performance indicators

1
Tackling over-dispersion in NHS performance
indicators

Robert Irons (Analyst Statistician)
Dr David Cromwell (Team Leader)

20/10/2004
2
Outline of presentation

NHS Star Ratings Model
Criticism of some of the indicators
The reason overdispersion
Options for tackling the problem
Our solution an additive random effects model
Effects on the ratings indicators

3
Performance Assessment in the UK

1990s Government focused on efficiency
1997 Labour replaces Conservative government
Late 90s Labour focus on quality efficiency
Define Performance Assessment Framework
Publish NHS Plan in 2000
Commission for Health Improvement (CHI) created
Performance ratings first published in 2001,
responsibility passed to CHI for 2003 publication
Healthcare Commission replaces CHI on April 2004,
has broader inspection role

4
NHS Performance Ratings

An at a glance assessment of NHS trusts
performance
Performance rated as 0, 1, 2, or 3 stars
Yearly publication
Focus on how trusts deliver government priorities
Linked to implementation of key policies
Priorities and Planning framework
National Service Frameworks
Have limited role in direct quality improvement
Modernisation agency helps trusts with low rating

5
Scope of NHS ratings
2001 2002 2003 2004
Acute trusts ? ? ? ?
Ambulance trusts ? ? ?
Mental health trusts ? ?
Primary care trusts ? ?
6
The ratings model

Overall rating derived from many different
indicators
and affected by Clinical Governance Reviews
Two types of indicators, organised in 4 groups
Key targets Balanced Scorecard indicators
BS indicators grouped into 3 focus areas
Patient focus, clinical focus, capacity
capability

7
Combining the indicators

Indicators are measured on different scales
Categorical (eg. Yes/No)
Proportional (eg. proportion of patients waiting
longer than 15 months)
Rates (eg. mortality rate within 30 days
following selected surgical procedures)
Further complication
Performance on some indicators is measured
against published targets define thresholds
Performance on other indicators is based on
relative differences between trusts

8
Combining the indicators

Indicators first transformed so they are all on
an equivalent scale
Key targets assigned to three levels
achieved
under-achieved
significantly under-achieved
Balanced scorecard indicators
1 significantly below average (worst
performance)
2 below average
3 average
4 above average
5 significantly above average (best
performance)

9
Transforming the indicators

Key target indicators transformed using
thresholds defined by government policy
Balanced scorecard indicators transformed via
several methods
Percentile method
Statistical method
Absolute method, if policy target exists
Mapping method (for indicators with ordinal
scales)

Trust type Trust type Trust type Trust type
Acute trusts Ambulance trusts Mental health trusts Primary care trusts
Percentile 11 3 9 11
Statistical 12 8 9 11
Absolute 8 3 5 4
Defined mapping 4 5 8 7
10
Transforming the indicators- the statistical
method
Trust type Trust type Trust type Trust type
Indicators Acute trusts Ambulance trusts Mental health trusts Primary care trusts
Clinical indicators 4 2
Patient survey 5 5 4 5
Staff survey 3 3 3 3
Change in rate indicators 3
11
The old statistical method

Based on simple confidence intervals
95 and 99 confidence intervals calculated for a
trusts indicator value
Trust confidence interval compared with the
overall national rate (effectively a single point)

12
The old statistical method- problematic

Not a proper statistical hypothesis test
Differentiating between trusts based on
differences that exceed levels of sampling
variation
On some indicators, this led to the assignment of
too many NHS trust to the significantly good/ bad
bands on some indicators

13
Working example- standardised readmission rate
of patients within 28 days of initial discharge
Significantly below average Below average Average Above average Significantly above average Total
32 6 40 13 49 140
14
Readmissions within 28 days of discharge- funnel
plot (2003/04 data)
15
Mortality within 30 days of selected surgical
procedures- funnel plot (2003/04 data)
16
Z scores

Standardised residual
Z scores are used to summarise extremeness of
the indicators
Funnel plot limits approximate to the naïve Z
score
Naïve Z score given by
Zi (yi t)/si
Where yi is the indicator value, and si is the
local standard error

17
Dealing with over-dispersion

Three options were considered
Use of an interval null hypothesis
Allow for over-dispersion using a multiplicative
variance model
or a random-effects additive variance model

18
Interval null hypothesis

Similar to the naïve Z score or standard funnel
limits
Uses a judgement of what constitutes a normal
range for the indicator
Define normal range (eg percentiles, national
rate x)
Funnel limits then defined as
Upper/ lower limit Range limit (x si0)
Reduces number of significant results
But might be considered somewhat arbitrary
Interval could be defined based on previous
years data, or prior knowledge
Makes minimal use of the sampling error

19
Interval null hypothesis-a funnel plot
20
Multiplicative variance model

Inflates the variance associated with each
observation by an over-dispersion factor (? )
? Zi2 Pearson X2
? X2 / I
Limits on funnel plot are then expanded by ? ?
Do not want ? to be influenced by the outliers we
are trying to identify
Data are first winsorised (shrinks the extreme
z-values in)
Over dispersion factor could be provisionally
defined based on previous years data
Statistically respectable, based on a
quasi-likelihood approach

21
Multiplicative over-dispersion-a funnel plot
(not winsorised, ? 21.45)
22
Multiplicative over-dispersion-a funnel plot
(10 winsorised, ? 13.97)
23
Winsorising

Winsorising consists of shrinking in the extreme
Z-scores to some selected percentile, using the
following method.
Rank cases according to their naive Z-scores.
Identify Zq and Z1-q, the (100q) most extreme
top and bottom naive Z-scores, where q might, for
example, be 0.1
Set the lowest (100q) of Z-scores to Zq, and
the highest (100q) of Z-scores to Z1-q. These
are the Winsorised statistics.
This retains the same number of Z-scores but
discounts the influence of outliers.

24
Winsorising
Non winsorised

Winsorising

10 winsorised
25
Random effects additive variance model

Based on a technique developed for meta-analysis
Originally designed for combining the results of
disparate studies into the same effect
In meta-analysis terms, consider the indicator
value of each trust to be a separate study
Essentially seeks to compare each trust to a
null distribution instead of a point
Assumes that Eyi ?i, and V?i
Uses a method-of-moments method to estimate
(Dersimonian and Laird, 1986)
Based on winsorised estimate of ?

26
Random effects additive variance model

If ( I ? ? ) lt ( I 1) then
the data are not over-dispersed, and 0
use standard funnel limits/ naïve Z scores
Otherwise
Where wi 1 / si2
The new random-effects Z score is then calculated
as

27
Comparing to a null distribution
28
Additive over-dispersion-a funnel plot (20
winsorised)
29
Effects on the banding of trusts- Readmissions
2002/03 data
Significantly below average Below average Average Above average Significantly above average
Previous banding method 32 6 40 13 49
Random-effects (20 winzorised) 3 9 101 21 6
30
Why we chose the additive variance method

Generally avoids situations where two trusts
which have the same value for the indicator get
put in different bands because of precision
A multiplicative model would increase the
variance at some trusts more than at others
e.g. a small trust with large variance would be
affected much more than a large trust with small
variance
By contrast, an additive model increases the
variance at all trusts by the same amount
Better conceptual fit with our understanding of
the problem, that the factors inflating variance
affect all trusts equally, so an additive model
is preferable

31
References DJ Spiegelhalter (2004) Funnel plots
for comparing institutional performance.
Statistics in Medicine, 24, (to appear) DJ
Spiegelhalter (2004) Handling over-dispersion of
performance indicators (submitted) R DerSimonian
N Laird (1986) Meta-analysis in clinical
trials. Controlled Clinical Trials,
7177-188 Acknowledgements David
Spiegelhalter Adrian Cook Theo Georghiou Thank you

Write a Comment

User Comments (0)