[aclug-L] OCR Success
[Top] [All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index] [Thread Index]
I needed and ocr program several months ago and tried 'clara'. It was
very disappointing, being unable to identify five percent of the
characters in a scanned document, even after several hours trying to
train it.
Today I found 'gocr' and played with it for several hours. With no
training, it can identify probably 99% of a 12 pt. ariel font document
- not perfect, but a definite time saver. Gocr is not too good on
smaller text and my scanner only goes to 300x300 dpi. In a few
paragraphs from Linux Mag it didn't identify a single lower case 't'.
There were about 40 of them and most were identified as being the
symbol for the Euro. A few times through it with sed s/EUR/t/ made it
readable, but just barely. A scanned newspaper is not much better.
So I got an ocr program and learned a little sed. Love my linux more
every day.
bruce
-- This is the discussion@xxxxxxxxx list. To unsubscribe,
visit http://www.complete.org/cgi-bin/listargate-aclug.cgi
|
|