Title: Coding the Data: Creating Codebooks
1Coding the Data Creating Codebooks
2Stages in the Research Process
Formulate Problem
Determine Research Design
Determine Data Collection Method
Design Data Collection Forms
Design Sample and Collect Data
Analyze and Interpret the Data
Prepare the Research Report
3Coding
- The process of transforming raw data into symbols
(usually numbers) that can be utilized for
analysis.
4Example of Likert Scale
Neither Agree nor Disagree
Strongly Disagree
Disagree
Agree
Strongly Agree
5Coding Likert Scales
- 1. Give each statement a name
- Trustworthy
- Unattractive
- Expert
- Knowledge
- 2. Assign numbers to each response
- 1 Strongly disagree
- 2 Disagree
- 3 Neither Agree nor Disagree
- 4 Agree
- 5 Strongly Agree
6Example of Semantic-Differential Scale
SALESPERSON
7Coding Semantic Differential Scales
- 1. Name each set of bipolar adjectives
- Helpful
- Friendly
- Pester
- Knowledge
- 2. Assign numbers to each blank
- Example for Helpful
- 1 not helpful
- 2
- 3
- 4 Do this for each set.
- 5 Beware of Reverse Coded Items
- 6
- 7 helpful
8Coding Closed-ended Items
What is your overall opinion of SEARS department
stores?
unfavorable
favorable
Typical coding 1unfavorable 2 3
4 5 6 7 favorable
9Coding Closed-ended ItemsCheck All That Apply
Typical coding 6 different variables (1 if
checked 0 if not) (1 if checked 0 if not) (1 if
checked 0 if not) (1 if checked 0 if not) (1 if
checked 0 if not) (1 if checked 0 if not)
10Coding Open-ended Items
Open-ended items seeking concrete, or factual,
responses are relatively easy to code numeric
answers are typically recorded as given by the
respondent, while other types of responses are
given a specific code number.
11Coding Open-ended Items
Open-ended items seeking less structured
responses are much more difficult to code.
12Process for Coding (Abstract) Open-ended
Questions
- Develop initial response categories (before
reading responses) - Identify usable responses
- Review responses add, delete, revise categories
- Sort responses into categories, using multiple
coders compare results - Repeat 3 and 4 if one or more categories are
too broad - Assign code numbers for each category use these
codes to represent responses in the data file - Assess interrater reliability (the degree of
agreement between coders) low interrater
reliability suggests that the categories are not
well-defined, and 3-6 should be repeated
13Developing a Codebook
- SPORTING GOODS SURVEY
- Please answer the following questions about
buying sporting goods over the internet - During the past year, what percentage of the
sporting goods you purchased were ordered through
the internet? - ________ percent
- 2. How willing are you to purchase merchandise
offered through the Avery Sporting Goods web
site? - Not at all willing Somewhat
willing Very willing - 3. Please provide some reasons why someone might
not want to purchase sporting goods over the
internet
14Developing a Codebook
Avery Sporting Goods CODEBOOK (partial) Var.
Name Description ID questionnaire
identification number PERCENT products
purchased through internet (record
response) WILLING willingness to purchase through
web site 1not at all willing 2somewhat
willing 3very willing REASON1 first
reason for not purchasing over internet 1security
issues (open ended) 2no internet
access 3cant examine goods 4diffi
cult to return 5dont want to
wait 6prior bad exper. w/internet 7
other REASON2 second reason SAME REASON3 third
reason SAME
15What do the data look like?
ID PERCENT WILLING REASON1 REASON2 REASON3
1 50 2 4 3
2 0 1 1 6 7
3 20 2 3 4 5
4 90 3 5
5 80 3 5 7