About OCR program.
authorOleksandr Gavenko <gavenkoa@gmail.com>
Tue, 13 Dec 2011 12:52:38 +0200
changeset 1136 8d9c9a102827
parent 1135 d98ac8df0e85
child 1137 161ffe7b7daf
About OCR program.
ocr.rst
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/ocr.rst	Tue Dec 13 12:52:38 2011 +0200
@@ -0,0 +1,49 @@
+
+======
+ OCS.
+======
+
+gocr.
+=====
+
+  $ gocr $IN.pnm >$OUT.txt
+
+ocrfeeder.
+==========
+
+Document layout analysis and optical character recognition system::
+
+  $ sudo apt-get install ocrfeeder
+
+Using::
+
+  $ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm
+
+tesseract.
+==========
+
+Installing::
+
+  $ sudo apt-get install tesseract-ocr
+
+Using::
+
+  $ tesseract $IN.tif $OUT
+  $ cat $OUT.txt
+
+ocropus.
+========
+
+  $ ocropus hocr-to-text screen.ppm
+
+ocrad
+=====
+
+Optical Character Recognition program::
+
+  $ sudo apt-get install ocrad
+
+Misc.
+=====
+
+unpapper