Qualitative - PowerPoint PPT Presentation

1 / 36
About This Presentation
Title:

Qualitative

Description:

To distinguish Partridge from Porter (FISK = PRTR) an additional letter g is added to make the new FISK for Partridge (PRTRG) ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 37
Provided by: Sar9159
Category:

less

Transcript and Presenter's Notes

Title: Qualitative


1
Qualitative Quantitative Research
  • Session Tutor Sarah Richardson
    sarah.richardson_at_warwick.ac.uk

2
Overview
  • Assessment procedures
  • Qualitative research methods
  • Source assessment and data-modelling
  • The challenge of sources
  • How will relational databases help?
  • Source analysis
  • Database design and creation
  • Free text databases
  • Methodological issues
  • Practical sessions

3
Assessment
  • Link http//www2.warwick.ac.uk/fac/arts/history/p
    ostgraduate/tsm/quan_research

4
Qualitative/Quantitative Divide
  • Elliot blurs distinction between the two (but
    hers is a social science definition)
  • Quantitative standardised questions generates
    data expressed in numerical form
  • Qualitative less structured research yielding
    rich textual or observational data
  • She suggests focus on narrative is the way forward

5
How does this relate to TSM?
  • A more critical framework for the collection of
    data
  • Source appreciation and analysis
  • Encouragement to engage with a wider theoretical
    context eg on historical change, causality and
    identity
  • Awareness of the skills of both qualitative and
    quantitative researchers

6
Challenges
  • Unstructured source material
  • Missing data
  • Complications with numbers and dates
  • Data comes from more than one source

7
Databases should look like this?
Unique identifier or primary key
Column or field or attribute
Row or record
Field name or attribute name
8
But what do you do with this?
Letter from the Medici Granducal Archive
9
How will relational databases help?
  • A relational database is a database created with
    many tables linked together
  • Each table has a common factor which links it to
    others in the database
  • For complex sources a number of tables may be
    created to deal with different aspects of the data

10
Relational model
Offences Table Defendant ID Case Number Offence
Type Place of Offence Date of Offence Description
Comments
Defendant Table Defendant ID First
name Surname Address Age Sex Occupation
Title Comments
Sentence Table Defendant ID Case
Number Verdict Sentence Comments
Witnesses Table Case Number Witness 1 First
name Witness 1 Surname Witness 1 Address Witness
1 Sex Witness 2 First name Witness 2
Surname Witness 2 Address Witness 2 Sex Comments
Occupational Categorisation Table Occupation
Title Occupational Categorisation 1 Occupational
Categorisation 2
11
A more complex relational database
12
Source analysis
  • Data should be broken down into components that
    collects groups of information into objects or
    events.
  • For example information relating to a person, an
    organisation, a document, an object or a
    building, or to events such as a marriage, a
    transaction, the making of a will, or an
    election.
  • In database terminology these are referred to as
    entities.
  • Each entity will form a table in the final
    database.

13
Attributes
  • Once each entity has been identified, list the
    data associated with each.
  • For example, the Defendant table has information
    on the first name, surname, address, age, sex and
    occupation of each defendant.
  • This information will produce the fields for each
    table.
  • The fields are also known as attributes.

14
Field types
15
Issues for field types
  • Size
  • Calculations
  • Dates
  • Currency
  • Unstructured data
  • Unique identifiers

16
Relationships
  • One-to-one relationships records in one table
    have only one match with records in a second
    table.
  • One-to-many relationships records in the first
    table match many in the second, but those in the
    second table only have one match.
  • Many-to-many relationships records from both
    tables have relationships between them

17
Data entry tips
  • Fields may be designated as required.
  • Default values may be entered.
  • Use the tool to allow one of only two options to
    be entered such as Yes/No, True/False, Guilty/Not
    Guilty.
  • Look-up tables a fixed list of values that may
    be entered into a particular field.
  • Validation rules.
  • Automatic generation of unique numbers.

18
Free Text Databases
  • Free text databases search unstructured texts and
    images provided in digital form
  • They work by tagging the text in a mark-up
    language (eg HTML, XML, SGML). In the past users
    had to do this. Now most programmes will do it
    for you.
  • The database may then be searched in a number of
    ways full-text wildcard searches with and ?
    Boolean searches (AND, OR, and NOT) proximity
    searches numeric searches (gt, lt, gt, lt, ltgt)
    Date searches Fuzzy searches

19
Examples of free text databases
20
Methodological Issues
  • Nominal record linkage
  • Coding
  • Occupational analysis
  • Prosopography
  • Community reconstruction

21
Nominal Record Linkage
  • Concerns all historians using data containing
    names
  • How do we determine that sources relate to the
    same person and not another person with the same
    name?
  • Particularly difficult for early modern sources
    where names are not fixed.
  • Two problems
  • The existence of multiple common names. This
    problem is particularly acute in local
    communities where certain surnames are dominant.
  • Variation in spellings.

22
Solutions
  • Coding surnames using standardisation schemes, eg
    SOUNDEX or FISKs
  • Using multiple passes through the data changing
    variables each time as the data is matched
  • Using a combination of computer and manual
    techniques

23
SOUNDEX
24
SOUNDEX rules
  • Names With Double Letters If the surname has any
    double letters, they should be treated as one
    letter.
  • Names with Letters Side-by-Side that have the
    Same SOUNDEX Code Number should be treated as
    one letter. For example, Jackson or Schmidt.
  • Names with Prefixes such as Van or De should be
    coded twice with and without the prefix
  • Consonant Separators If a vowel (A, E, I, O, U)
    separates two consonants that have the same
    SOUNDEX code, the consonant to the right of the
    vowel is coded.

25
Problems with SOUNDEX
  • Does not work so well for European names. Works
    best with names of English origin
  • Does not work as well with early modern names and
    spelling variants
  • One solution for early modern historians is FISK

26
Four Letter Initial Surname Codes (FISK)
  • Consists of letters and punctuation marks
  • Generated from first letter of a surname variant
    plus up to three further consonants from the
    surname.
  • Vowels only used when they are the first letter
    of the surname
  • A full stop is used where no second, third or
    fourth letter is available for use.

27
  • If surname variants are deduced to be of the same
    surname base these names are considered to form a
    distinct surname group and the same FISK is
    allocated
  • Thus Eyres is coded as ARS. Group Ayres. Morrice
    is coded as MRS. Group Morris
  • Bowyer is coded with Boyer and Springall with
    Springold.
  • Davies and Davidson are placed in one group. ap
    Howell is included in the group Powell

28
Five letter FISKs
  • Used to differentiate between similar but
    distinct surname groups.
  • Fifth letter would normally be a distinctive
    letter from the end of the surname, but any
    letter could be used, and often a vowel from the
    start of the surname would be convenient.
  • To distinguish Partridge from Porter (FISK
    PRTR) an additional letter g is added to make the
    new FISK for Partridge (PRTRG). The code for
    Porter remains as (PRTR).
  • To distinguish Bailey from Bloy (FISK BLY.) an
    additional letter y is added to make the new FISK
    for Bailey (BLY.Y) The code for Bloy remains as
    (BLY.)

29
Coding
  • Used to be necessary because databases could not
    handle large amounts of text
  • Historians still code
  • data entry may be speeded up by using simple
    codes eg. M for married, U for unmarried, and
    W for widowed but complicated coding may slow
    data entry down
  • Is a form of close assessment of the data and may
    lead to the development of categories for ease
  • May facilitate the process of record linkage

30
Deciding to code
  • Should coding take place before or after data
    entry?
  • Should codes be letters or numbers? Numbers mean
    high level of error
  • Coding schemes should make decisions in the light
    of other classification systems used by
    historians.
  • Full code book should be developed as part of the
    documentation to accompany the database.

31
Occupational analysis
  • Form of post-coding
  • Assist in analysing fields with numerous values
  • Most common type is categorisation of
    occupational information.
  • Must be able to compare with other research in
    the field and to provide as complete a picture as
    possible regarding the status and occupation of
    the population

32
Coding schemes
  • Modern historians use standardised occupational
    classification systems
  • Early modern historians often each devise their
    own schema
  • A compromise is to use a multi-dimensional
    approach each occupation is classified using
    several different methods. Occasionally
    individual occupational titles may be isolated
    where any categorisation would destroy the
    nuances of work experiences.

33
Prosopography
  • Mostly used for study of elites
  • Database is created not from a single source but
    many bringing biographical data together
  • Use relational design to avoid very large,
    multi-field databases containing many blank
    fields
  • Consider issues of nominal record linkage

34
Community Reconstruction
  • Concentrates on bringing together all records
    from one place
  • Needs careful design
  • Primary methodological issue is one of record
    linkage, so documents, place names and
    individuals may all have their own ID codes

35
Practical Exercises
  • Source analysis and database construction
  • Occupational categorisation
  • Nominal record linkage and coding using SOUNDEX

36
Sources
  • Charles Harvey and Jon Press, Databases in
    Historical Research (Palgrave, 1996)
  • Sonja Cameron and Sarah Richardson, A Computing
    Guide for Historians (Palgrave forthcoming)
  • History Data Service www.ahds.ac.uk
  • Free text database www.asksam.com
Write a Comment
User Comments (0)
About PowerShow.com