author | Oleksandr Gavenko <gavenkoa@gmail.com> |
Mon, 22 Feb 2016 13:31:27 +0200 | |
changeset 1909 | 6b76120bd2d3 |
parent 1905 | fba288d59662 |
child 1912 | 8b81a8f0f692 |
permissions | -rw-r--r-- |
1334
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
1 |
.. -*- coding: utf-8; -*- |
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
2 |
.. include:: HEADER.rst |
1136 | 3 |
|
4 |
====== |
|
5 |
OCS. |
|
6 |
====== |
|
1346
a2fbf50a43f4
Fix: Has no 'contents::' directive.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1334
diff
changeset
|
7 |
.. contents:: |
1905
fba288d59662
Include only local subsections into TOC. This prevent duplication of
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1346
diff
changeset
|
8 |
:local: |
1136 | 9 |
|
10 |
gocr. |
|
11 |
===== |
|
12 |
||
13 |
$ gocr $IN.pnm >$OUT.txt |
|
14 |
||
15 |
ocrfeeder. |
|
16 |
========== |
|
17 |
||
18 |
Document layout analysis and optical character recognition system:: |
|
19 |
||
20 |
$ sudo apt-get install ocrfeeder |
|
21 |
||
22 |
Using:: |
|
23 |
||
24 |
$ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm |
|
25 |
||
26 |
tesseract. |
|
27 |
========== |
|
28 |
||
29 |
Installing:: |
|
30 |
||
31 |
$ sudo apt-get install tesseract-ocr |
|
32 |
||
33 |
Using:: |
|
34 |
||
35 |
$ tesseract $IN.tif $OUT |
|
36 |
$ cat $OUT.txt |
|
37 |
||
38 |
ocropus. |
|
39 |
======== |
|
40 |
||
41 |
$ ocropus hocr-to-text screen.ppm |
|
42 |
||
43 |
ocrad |
|
44 |
===== |
|
45 |
||
46 |
Optical Character Recognition program:: |
|
47 |
||
48 |
$ sudo apt-get install ocrad |
|
49 |
||
50 |
Misc. |
|
51 |
===== |
|
52 |
||
53 |
unpapper |