Writer identification by writers invariants - PowerPoint PPT Presentation

1 / 15
About This Presentation
Title:

Writer identification by writers invariants

Description:

The documents D and T are represented by their respective set of grapheme ... Identification results on grapheme sequences with compression (a) and without ... – PowerPoint PPT presentation

Number of Views:178
Avg rating:3.0/5.0
Slides: 16
Provided by: kimjoo
Category:

less

Transcript and Presenter's Notes

Title: Writer identification by writers invariants


1
Writer identification bywriters invariants
2
? ?
  • 1. Introduction
  • 2. State of the art
  • 3. Proposed approach
  • 4. Experiment
  • 5. Conclusion

3
1. Introduction
  • Pre-processing the image is cleaned by noise
    reduction,then lines and words are extracted
  • Feature extraction features which are
    quantitative measurements are used to
    discriminate the writers as well as possible
    they can be global or local and structural or
    statistical.
  • Classification the search of the nearest writer
    is guided by the extracted features using an
    adapted metric.
  • Assumption Use the elementary patterns the
    writing is made up of graphemes
  • Writing is individual, the elements which make it
    up are also individual.

4
2. State of the art
  • Writer verification approach two documents read
    in input determine whether the two documents are
    written by the same writer or by two different
    writers.
  • cf) writer identification approach identify a
    writer among a set of N candidates
  • The verification approach simply formulated
    as a two-class discrimination problem,
  • the identification approach requires the
    use of a nearest neighbor based decision, due to
    the potentially large number of candidates.

5
  • Features from texture the document image is
    seen in this case as simple image and not as a
    writing
  • EX) Garbor filter and cooccurence matrics
  • Structural features in this case the extracted
    features attempt to describe the writing
    properties.
  • EX) The average height, width, slope and
    legibility of the characters

6
3. Proposed approach
  • 3.1 identification
  • D(each handwritten document)
  • A similarity measure between the handwritten
    document D and an unspecified handwritten
    document T
  • Similarity measure between two unspecified
    graphemes.

7
  • Sim(x,y) is 1 two D is exact same.
  • Finally classified as the writer of the document
    of the reference set

8
3.2 identification with writers invariants
  • The documents D and T are represented by their
    respective set of grapheme
  • Each individual writing a certain level of
    redundancy of the elementary patterns writers
    invariants
  • Based on these invariants, we can define
    variability measure of the writing.
  • In order to accelerate the procedure of writer
    identification, we propose to represent the
    handwritten texts by their invariant graphemes.
  • We need the compression of the handwritten
    information.

9
4.1 Database
4. Experiment
  • A database of 88 writers who have been asked to
    copy out one letter chosen among two suggested
  • Each one is made up of 107 and 98 words
    respectively.
  • The texts obtained were cut in nonequal parts
  • Two thirds, one thirds.
  • The first two thirds were used as the reference
    set of writers, and the remaining third was used
    for testing.

10
Total 88 writers
107 words
Choose one letter copy out the letter
test
88 words
reference
Cut in nonequal parts
11
  • 1 series of test the last third of each text is
    used to identify the writer
  • 2 series of test we extract from each third of
    each text 5 samples of graphemes sequences of 5
    various length
  • for each writer, 5 examples for each of
    the 5 lengths selected(10,20,30,40 and 50
    graphemes).

12
4.2 writer identification from texts
  • Method1 based on template matching using the
    correlation measure, uncompressed.
  • Method2 unknown text-compressed , reference set
    - all graphemes.
  • Method3 references set compressed , unknown
    text - all graphemes
  • Method4 references set, unknown text -
    compressed

13
  • Figure1 identification results on tests using
    various compressed representation

14
4.3 writer identification from sequences of
graphemes
Identification results on grapheme sequences with
compression (a) and without compression (b).
.
15
5.Conclusion
  • Identification is based on pattern matching of
    individual components of handwriting.
  • Give good identification performance even using
    small samples of handwriting.
  • The correct writer can be selected within a list
    of 5 candidates in almost all cases, using only a
    sample of 50 graphemes.
Write a Comment
User Comments (0)
About PowerShow.com