author | Oleksandr Gavenko <gavenkoa@gmail.com> |
Sat, 10 Feb 2018 01:28:53 +0200 | |
changeset 2227 | 1e9323e7ec88 |
parent 2058 | b6533d1e3019 |
child 2228 | 837f1337c59b |
permissions | -rw-r--r-- |
1334
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
1 |
.. -*- coding: utf-8; -*- |
1136 | 2 |
|
3 |
====== |
|
4 |
OCS. |
|
5 |
====== |
|
1346
a2fbf50a43f4
Fix: Has no 'contents::' directive.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1334
diff
changeset
|
6 |
.. contents:: |
1905
fba288d59662
Include only local subsections into TOC. This prevent duplication of
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1346
diff
changeset
|
7 |
:local: |
1136 | 8 |
|
9 |
gocr. |
|
10 |
===== |
|
2058 | 11 |
:: |
1136 | 12 |
|
13 |
$ gocr $IN.pnm >$OUT.txt |
|
14 |
||
15 |
ocrfeeder. |
|
16 |
========== |
|
17 |
||
18 |
Document layout analysis and optical character recognition system:: |
|
19 |
||
20 |
$ sudo apt-get install ocrfeeder |
|
21 |
||
22 |
Using:: |
|
23 |
||
24 |
$ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm |
|
25 |
||
26 |
tesseract. |
|
27 |
========== |
|
28 |
||
29 |
Installing:: |
|
30 |
||
31 |
$ sudo apt-get install tesseract-ocr |
|
32 |
||
33 |
Using:: |
|
34 |
||
35 |
$ tesseract $IN.tif $OUT |
|
36 |
$ cat $OUT.txt |
|
37 |
||
38 |
ocropus. |
|
39 |
======== |
|
40 |
||
41 |
$ ocropus hocr-to-text screen.ppm |
|
42 |
||
43 |
ocrad |
|
44 |
===== |
|
45 |
||
46 |
Optical Character Recognition program:: |
|
47 |
||
48 |
$ sudo apt-get install ocrad |
|
49 |
||
50 |
Misc. |
|
51 |
===== |
|
52 |
||
53 |
unpapper |