author | Oleksandr Gavenko <gavenkoa@gmail.com> |
Sat, 03 Oct 2015 18:29:10 +0300 | |
changeset 1765 | 2132765de2f4 |
parent 1346 | a2fbf50a43f4 |
child 1905 | fba288d59662 |
permissions | -rw-r--r-- |
1334
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
1 |
.. -*- coding: utf-8; -*- |
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
2 |
.. include:: HEADER.rst |
1136 | 3 |
|
4 |
====== |
|
5 |
OCS. |
|
6 |
====== |
|
1346
a2fbf50a43f4
Fix: Has no 'contents::' directive.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1334
diff
changeset
|
7 |
.. contents:: |
1136 | 8 |
|
9 |
gocr. |
|
10 |
===== |
|
11 |
||
12 |
$ gocr $IN.pnm >$OUT.txt |
|
13 |
||
14 |
ocrfeeder. |
|
15 |
========== |
|
16 |
||
17 |
Document layout analysis and optical character recognition system:: |
|
18 |
||
19 |
$ sudo apt-get install ocrfeeder |
|
20 |
||
21 |
Using:: |
|
22 |
||
23 |
$ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm |
|
24 |
||
25 |
tesseract. |
|
26 |
========== |
|
27 |
||
28 |
Installing:: |
|
29 |
||
30 |
$ sudo apt-get install tesseract-ocr |
|
31 |
||
32 |
Using:: |
|
33 |
||
34 |
$ tesseract $IN.tif $OUT |
|
35 |
$ cat $OUT.txt |
|
36 |
||
37 |
ocropus. |
|
38 |
======== |
|
39 |
||
40 |
$ ocropus hocr-to-text screen.ppm |
|
41 |
||
42 |
ocrad |
|
43 |
===== |
|
44 |
||
45 |
Optical Character Recognition program:: |
|
46 |
||
47 |
$ sudo apt-get install ocrad |
|
48 |
||
49 |
Misc. |
|
50 |
===== |
|
51 |
||
52 |
unpapper |