notesalexp.org

 /Packages  /xenial /all  / ocrodjvu
[buster] [xenial] [bionic]
[Source: ocrodjvu]

Package: ocrodjvu (0.12.0+git1361-9b31731-9reb~2401)

tool to perform OCR on DjVu documents

Ocrodjvu is a wrapper around the Optical Character Recognition (OCR) systems Cuneiform, Gocr, Ocrad, OCRopus and (standalone) Tesseract. It is designed for OCR on documents in DjVu format, which is especially suited for high-quality archiving of books.

After processing, the DjVu document embeds a text layer. Other programs can then be used to read the document, search it for specific terms, print it out, or use the information in the OCR layer as a way to improve the document’s accessibility.

Other Packages Related to ocrodjvu

  • depends
  • recommends
  • suggests
  • dep: djvulibre-bin
    Utilities for the DjVu image format
  • dep: python (>= 2.7)
    interactive high-level object-oriented language (default version)
    or python-argparse
    Interactive high-level object-oriented language (standard library, version 2.7)
  • dep: python-djvu
    Python support for the DjVu image format
  • dep: python:any (>= 2.7.5-5~)
    None
  • rec: python-html5lib
    HTML parser/tokenizer based on the WHATWG HTML5 specification
  • rec: python-lxml
    pythonic binding for the libxml2 and libxslt libraries
  • rec: python-pyicu (>= 1.0~)
    Python extension wrapping the ICU C++ API
  • rec: python-subprocess32
    backport of the Py3 stdlib subprocess module for Py2
  • rec: tesseract-ocr
    Tesseract command line OCR tool
  • sug: cuneiform
    multi-language OCR system
  • sug: gocr
    Command line OCR
  • sug: ocrad
    optical character recognition program

Download ocrodjvu

Download for all available architectures
Architecture Version Package Size Installed Size Files
all 0.12.0+git1361-9b31731-9reb~2401 51.5 KiB 191 KiB [list of files]