ocr.rst
author Oleksandr Gavenko <gavenkoa@gmail.com>
Sat, 08 Sep 2012 18:48:13 +0300
changeset 1366 c57cd27e840e
parent 1346 a2fbf50a43f4
child 1905 fba288d59662
permissions -rw-r--r--
If 'hg' found no changes it report error code 1. Ignore them as it is valid.

.. -*- coding: utf-8; -*-
.. include:: HEADER.rst

======
 OCS.
======
.. contents::

gocr.
=====

  $ gocr $IN.pnm >$OUT.txt

ocrfeeder.
==========

Document layout analysis and optical character recognition system::

  $ sudo apt-get install ocrfeeder

Using::

  $ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm

tesseract.
==========

Installing::

  $ sudo apt-get install tesseract-ocr

Using::

  $ tesseract $IN.tif $OUT
  $ cat $OUT.txt

ocropus.
========

  $ ocropus hocr-to-text screen.ppm

ocrad
=====

Optical Character Recognition program::

  $ sudo apt-get install ocrad

Misc.
=====

unpapper