OCR Toolkit for Gamera
======================

Purpose
-------

"Optical character recognition" (OCR) means the extraction of the
text content from a document image.

This toolkit provides

 - python library functions for building custom ocr applications
 - a ready to use script ocr4gamera


Requirements
------------

This toolkit has been written for the Gamera framework and requires
a working Gamera installation. See the Gamera homepage:

	http://gamera.informatik.hsnr.de/


Documentation
-------------

For a user's guide and a developer's guide see 'doc/html/index.html'.
For release notes and a revision history see 'CHANGES'.

A comprehensive overview of design, usage and customization of the OCR
toolkit can be found in the paper

    C. Dalitz, R. Baston: Optical Character Recognition with the 
    Gamera Framework. In C. Dalitz (Ed.): "Document Image Analysis 
    with the Gamera Framework." Schriftenreihe des Fachbereichs 
    Elektrotechnik und Informatik, Hochschule Niederrhein, vol. 8,
    pp. 53-65, Shaker Verlag (2009)


Installation
------------

See the section "Installation" in 'doc/html/index.html' or 
'doc/src/index.txt'.


Authors
-------

Rene Baston, 2009
Christoph Dalitz, <christoph dot dalitz at hsnr dot de>, 2009

Please contact Christoph Dalitz for questions about this toolkit.


License
-------

This toolkit is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License, either version 2
of the license, or (at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
file LICENSE for more details.
