author | Oleksandr Gavenko <gavenkoa@gmail.com> |
Sat, 10 Feb 2018 01:49:07 +0200 | |
changeset 2230 | 9e6ad6607a9e |
parent 2228 | 837f1337c59b |
permissions | -rw-r--r-- |
1334
9bf0d5a1f0cf
Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1136
diff
changeset
|
1 |
.. -*- coding: utf-8; -*- |
1136 | 2 |
|
3 |
====== |
|
4 |
OCS. |
|
5 |
====== |
|
1346
a2fbf50a43f4
Fix: Has no 'contents::' directive.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1334
diff
changeset
|
6 |
.. contents:: |
1905
fba288d59662
Include only local subsections into TOC. This prevent duplication of
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
1346
diff
changeset
|
7 |
:local: |
1136 | 8 |
|
9 |
gocr. |
|
10 |
===== |
|
2058 | 11 |
:: |
1136 | 12 |
|
13 |
$ gocr $IN.pnm >$OUT.txt |
|
14 |
||
15 |
ocrfeeder. |
|
16 |
========== |
|
17 |
||
18 |
Document layout analysis and optical character recognition system:: |
|
19 |
||
20 |
$ sudo apt-get install ocrfeeder |
|
21 |
||
22 |
Using:: |
|
23 |
||
24 |
$ ocrfeeder-cli --o $OUTDIR --format HTML --images $IN.pnm |
|
25 |
||
26 |
tesseract. |
|
27 |
========== |
|
28 |
||
29 |
Installing:: |
|
30 |
||
31 |
$ sudo apt-get install tesseract-ocr |
|
32 |
||
33 |
Using:: |
|
34 |
||
35 |
$ tesseract $IN.tif $OUT |
|
36 |
$ cat $OUT.txt |
|
37 |
||
38 |
ocropus. |
|
39 |
======== |
|
2228
837f1337c59b
Removed indentation that compiled into <blockquote>.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
2058
diff
changeset
|
40 |
:: |
1136 | 41 |
|
42 |
$ ocropus hocr-to-text screen.ppm |
|
43 |
||
44 |
ocrad |
|
45 |
===== |
|
46 |
||
47 |
Optical Character Recognition program:: |
|
48 |
||
49 |
$ sudo apt-get install ocrad |
|
50 |
||
51 |
Misc. |
|
52 |
===== |
|
53 |
||
54 |
unpapper |