author | Oleksandr Gavenko <gavenkoa@gmail.com> |
Fri, 13 Jul 2012 22:42:13 +0300 | |
changeset 1336 | 80c5eff010a1 |
parent 1334 | 9bf0d5a1f0cf |
child 1346 | a2fbf50a43f4 |
permissions | -rw-r--r-- |
1334
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
1 |
.. -*- coding: utf-8; -*- |
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
2 |
.. include:: HEADER.rst |
1136 | 3 |
|
4 |
====== |
|
5 |
OCS. |
|
6 |
====== |
|
7 |
||
8 |
gocr. |
|
9 |
===== |
|
10 |
||
11 |
$ gocr $IN.pnm >$OUT.txt |
|
12 |
||
13 |
ocrfeeder. |
|
14 |
========== |
|
15 |
||
16 |
Document layout analysis and optical character recognition system:: |
|
17 |
||
18 |
$ sudo apt-get install ocrfeeder |
|
19 |
||
20 |
Using:: |
|
21 |
||
22 |
$ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm |
|
23 |
||
24 |
tesseract. |
|
25 |
========== |
|
26 |
||
27 |
Installing:: |
|
28 |
||
29 |
$ sudo apt-get install tesseract-ocr |
|
30 |
||
31 |
Using:: |
|
32 |
||
33 |
$ tesseract $IN.tif $OUT |
|
34 |
$ cat $OUT.txt |
|
35 |
||
36 |
ocropus. |
|
37 |
======== |
|
38 |
||
39 |
$ ocropus hocr-to-text screen.ppm |
|
40 |
||
41 |
ocrad |
|
42 |
===== |
|
43 |
||
44 |
Optical Character Recognition program:: |
|
45 |
||
46 |
$ sudo apt-get install ocrad |
|
47 |
||
48 |
Misc. |
|
49 |
===== |
|
50 |
||
51 |
unpapper |