Title: Development of an OCR System
1Development of an OCR System
Nathan Harmata Period 5
2What is OCR?
Optical Character Recognition
Font and handwriting based
3Goals of My Project
Generic recognition for Latin-based fonts
Proper handling of most formatting
4Potential Issues
"Noise"
Weird formatting or page content
5How the current version works
Input png file Courier font, size 24
6How the current version works
7How the current version works
8How the current version works
Integer pixel counts for each quadrant
9How the current version works
Cache system Uppercase, lowercase, and
punctuation Spaces and newlines Compares with
's, not integers