ocr.rst
changeset 1136 8d9c9a102827
child 1334 9bf0d5a1f0cf
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/ocr.rst	Tue Dec 13 12:52:38 2011 +0200
@@ -0,0 +1,49 @@
+
+======
+ OCS.
+======
+
+gocr.
+=====
+
+  $ gocr $IN.pnm >$OUT.txt
+
+ocrfeeder.
+==========
+
+Document layout analysis and optical character recognition system::
+
+  $ sudo apt-get install ocrfeeder
+
+Using::
+
+  $ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm
+
+tesseract.
+==========
+
+Installing::
+
+  $ sudo apt-get install tesseract-ocr
+
+Using::
+
+  $ tesseract $IN.tif $OUT
+  $ cat $OUT.txt
+
+ocropus.
+========
+
+  $ ocropus hocr-to-text screen.ppm
+
+ocrad
+=====
+
+Optical Character Recognition program::
+
+  $ sudo apt-get install ocrad
+
+Misc.
+=====
+
+unpapper