Dr. Istv - PowerPoint PPT Presentation

About This Presentation

Title:

Dr. Istv

Description:

B/W Scanning. Gray Scanning. Color Scanning. Load from ... Photo. Text recognition. User assisted correction. Result exportation. 04 Jul 2005. Istvan Marosi ... – PowerPoint PPT presentation

Number of Views:70

Avg rating:3.0/5.0

Slides: 70

Provided by: marosi

Category:

Tags: istv

more less

Transcript and Presenter's Notes

Title: Dr. Istv

1
SSIP 2005, Szeged
Character Recognition Internals

Dr. István Marosi
Scansoft-Recognita, Inc., Hungary

2
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text recognition
User assisted correction
Result exportation

3
OCR Internals

Main tasks of an OCR system
Image acquisition
Get image
B/W Scanning
Gray Scanning
Color Scanning
Load from image file
Preprocess image
Layout recognition
Text recognition
User assisted correction
Result exportation

4
OCR Internals

Main tasks of an OCR system
Image acquisition
Get image
Preprocess image
Color separation
Thresholding
Despeckling
Rotation
Deskewing
Layout recognition
Text recognition
User assisted correction
Result exportation

Color Separation
De-speckle, de-skew
5
The Preprocessed Image
Joined chars
6
The Preprocessed Image
Joined chars
7
The Preprocessed Image
Joined chars
8
The Preprocessed Image
Broken chars
9
The Preprocessed Image
Broken chars
10
The Preprocessed Image
Broken chars
11
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text zones
Columns of flowed text
Tables
Inverse text
Graphic zones
Text recognition
User assisted correction
Result exportation

12
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text zones
Graphic zones
Line Art
Photo
Text recognition
User assisted correction
Result exportation

13
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text recognition
Segmentation
Calculation of Feature Vector Elements
Classification
Language Analysis
Voting
User assisted correction
Result exportation

14
Segmentation

What are those pixel groups belonging to a single
letter?

15
Segmentation

What are those pixel groups belonging to a single
letter?

16
Segmentation

What are those pixel groups belonging to a single
letter?

17
Segmentation

What are those pixel groups belonging to a single
letter?

18
Segmentation

What are those pixel groups belonging to a single
letter?

19
Segmentation

What are those pixel groups belonging to a single
letter?

20
Segmentation

What are those pixel groups belonging to a single
letter?

21
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text recognition
Segmentation
Calculation of Feature Vector Elements
Classification
Language Analysis
Voting
User assisted correction
Result exportation

22
Calculation of FV Elements Contour Tracing

Find a (new) white-black transition
Follow the edge of the pixels using the MIN or
MAX rule
Administrate the already traced white-black
transitions
Collect information while going around
And repeat the process on new shapes ...

23
Contour Tracing

Find a (new) white-black transition
Follow the edge of the pixels using the MIN or
MAX rule
Administrate the already traced white-black
transitions
Collect information while going around
And repeat the process on new shapes ...

24
Contour Tracing

Find a (new) white-black transition
Follow the edge of the pixels using the MIN or
MAX rule

if black(a) then turn(ccw) else if black(b) then
forward else turn(cw)
a
b
25
Contour Tracing

Find a (new) white-black transition
Follow the edge of the pixels using the MIN or
MAX rule

if black(a) then turn(ccw) else if black(b) then
forward else turn(cw)
a
b
a
if white(b) then turn(cw) else if white(a) then
forward else turn(ccw)
b
26
Contour Tracing

Find a (new) white-black transition
Follow the edge of the pixels using the MIN or
MAX rule
Administrate the already traced white-black
transitions
Collect information while going around
And repeat the process on new shapes ...

27
Some Easily Calculatable Data

Problem 1

Turning CW InIn-11 Turning CCW
InIn-1-1 Going Forward InIn-1
28
Some Easily Calculatable Data

Problem 2

Turning CW InIn-11 Turning CCW
InIn-1-1 Going Forward InIn-1
29
Some Easily Calculatable Data

Problem 3

Going Up InIn-1-Xn Going Down
InIn-1Xn Going Right InIn-1 Going Left
InIn-1
30
Some Easily Calculatable Data

Problem 4

Going Up InIn-1-Xn Going Down
InIn-1Xn Going Right InIn-1 Going Left
InIn-1
31
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text recognition
Segmentation
Calculation of Feature Vector Elements
Classification
Language Analysis
Voting
User assisted correction
Result exportation

32
Classification Training models

Restricted Coulomb Energy (RCE) Network(Dr. Leon
Cooper, Dr. Charles Elbaum)

B
A
B
A
33
Classification Training models

Restricted Coulomb Energy (RCE) Network(Dr. Leon
Cooper, Dr. Charles Elbaum)

B
A
B
A
34
Classification Training models

Nestor Learning System (NLS)

35
Classification Training models

Nestor Learning System (NLS)

Default radius Rmax
36
Classification Training models

Nestor Learning System (NLS)

37
Classification Training models

Nestor Learning System (NLS)

Default radius Rmax
38
Classification Training models

Nestor Learning System (NLS)

39
Classification Training models

Nestor Learning System (NLS)

40
Classification Training models

Nestor Learning System (NLS)

Default radius Rmax
41
Classification Training models

Nestor Learning System (NLS)

42
Classification Training models

Nestor Learning System (NLS)

Decreased radius
43
Classification Training models

Nestor Learning System (NLS)

44
Classification Training models

Nestor Learning System (NLS)

Decreased radius Rmin
45
Classification Training models

Nestor Learning System (NLS)

Pass 2
Decreased radius
46
OCR Internals

Main tasks of an OCR system
Image acquisition
Layout recognition
Text recognition
Segmentation
Calculation of Feature Vector Elements
Classification
Language Analysis
Voting
User assisted correction
Result exportation

47
Voting

Text recognition in OmniPage Pro
OCR Engines available
Caeres engine (codename Salt Pepper)
Recognitas engine (codename Paprika)
ScanSofts engine (codename Fireworx)

48
Voting

Text recognition in OmniPage Pro
OCR Engines available
Caeres engine (Salt Pepper)
Uses a Matrix Matching based algorithm
feature set 40 cells of an 8x5 grid
good overall description of a shape
weaker at detailed structure
Recognitas engine (Paprika)
Uses a Contour Tracing based algorithm
feture set convex and concave arcs on the
contour
good detailed description of a shape
weaker at overall structure

49
Voting

Text recognition in OmniPage Pro
OCR Engines available
Caeres engine (Salt Pepper)
Recognitas engine (Paprika)
ScanSofts engine (Fireworx)
Segmentation algorithms

50
Voting

Text recognition in OmniPage Pro
OCR Engines available
Caeres engine (Salt Pepper)
Recognitas engine (Paprika)
ScanSofts engine (Fireworx)
Segmentation algorithms
Developed by independent groups
Have different strengths and weaknesses

51
Voting

Text recognition in OmniPage Pro
OCR Engines available
Segmentation algorithms
Conclusion
They are complementary
Lets create a voting system

52
Voting
Image

Voting strategies
External Black boxvoting20 gain

Paprika
Salt Pepper
Fire- worx
Txt 3
Txt 1
Txt 2
Dict
Vote
Final Txt
53
Voting
Image

Voting strategies
External Black boxvoting
Internal Shapevoting

Fire- worx
Salt Pepper
Paprika
Txt 2
Txt 1
Txt 3
Dict
Bronze
Final Txt
54
Voting
Image
Recognize originalsegmentation

Paprika
Original segmentation
Every independent connected component is a
character
Good segmentation recognize
Bad segmentation reject

K.B.
55
Voting
Image
Recognize originalsegmentation

Paprika

K.B.
Txt 1
Train adaptive classifierfrom original shapes
Txt 2
AdaptiveK.B.
56
Voting
Image
Recognize originalsegmentation

Paprika
Try several segmentations
Loop if unrecognizable

K.B.
Txt 1
Train adaptive classifierfrom original shapes
Txt 2
Recognize broken andjoined shapes
AdaptiveK.B.
Dict
57
Voting
Image
Recognize originalsegmentation

Paprika

K.B.
Txt 1
Train adaptive classifierfrom original shapes
Txt 2
Recognize broken andjoined shapes
AdaptiveK.B.
Train adaptive classifierfrom ugly shapes
Dict
58
Voting
Image
Recognize originalsegmentation

Paprika

K.B.
Txt 1
Train adaptive classifierfrom original shapes
Txt 2
Recognize broken andjoined shapes
AdaptiveK.B.
Train adaptive classifierfrom ugly shapes
Dict
Recognize more brokenand joined shapes