tesseract: 3.04.01 -> 3.05.00

Upstream changelog:

 * Made some fine tuning to the hOCR output.
 * Added TSV as another optional output format.
 * Fixed ABI break introduced in 3.04.00 with the AnalyseLayout()
   method.
 * text2image tool - Enable all OpenType ligatures available in a font.
   This feature requires Pango 1.38 or newer.
 * Training tools - Replaced asserts with tprintf() and exit(1).
 * Fixed Cygwin compatibility.
 * Improved multipage tiff processing.
 * Improved the embedded pdf font (pdf.ttf).
 * Enable selection of OCR engine mode from command line.
 * Changed tesseract command line parameter '-psm' to '--psm'.
 * Added new C API for orientation and script detection, removed the old
   one.
 * Increased minimum autoconf version to 2.59.
 * Removed dead code.
 * Fixed many compiler warning.
 * Fixed memory and resource leaks.
 * Fixed some issues with the 'Cube' OCR engine.
 * Fixed some openCL issues.
 * Added option to build Tesseract with CMake build system.
 * Implemented CPPAN support for easy Windows building.

The upstream URL of the change log is:

https://github.com/tesseract-ocr/tesseract/releases/tag/3.05.00

Tested by building against the following packages that directly depend
on it:

 * vapoursynth (with ocrSupport = true)
 * pyocr (fails)
 * vobsub2srt

Also tested against the following NixOS VM tests that have OCR enabled:

 * nixos/tests/chromium.nix -A stable
 * nixos/tests/emacs-daemon.nix
 * nixos/tests/installer.nix -A luksroot
 * nixos/tests/lightdm.nix
 * nixos/tests/plasma5.nix
 * nixos/tests/sddm.nix

All of the packages and tests except pyocr build/succeed on
x86_64-linux.

Fixing pyocr is outside of the scope of this commit and will happen very
soon.

Signed-off-by: aszlig <aszlig@redmoonstudios.org>
This commit is contained in:
aszlig 2017-04-08 02:43:18 +02:00
parent 42bb63f803
commit c381fa9b63
No known key found for this signature in database
GPG Key ID: 1DE8E48E57DB5436

View File

@ -1,5 +1,5 @@
{ stdenv, fetchFromGitHub, pkgconfig, leptonica, libpng, libtiff { stdenv, fetchFromGitHub, autoreconfHook, pkgconfig
, icu, pango, opencl-headers , leptonica, libpng, libtiff, icu, pango, opencl-headers
# Supported list of languages or `null' for all available languages # Supported list of languages or `null' for all available languages
, enableLanguages ? null , enableLanguages ? null
@ -7,13 +7,13 @@
stdenv.mkDerivation rec { stdenv.mkDerivation rec {
name = "tesseract-${version}"; name = "tesseract-${version}";
version = "3.04.01"; version = "3.05.00";
src = fetchFromGitHub { src = fetchFromGitHub {
owner = "tesseract-ocr"; owner = "tesseract-ocr";
repo = "tesseract"; repo = "tesseract";
rev = version; rev = version;
sha256 = "0h1x4z1h86n2gwknd0wck6gykkp99bmm02lg4a47a698g4az6ybv"; sha256 = "11wrpcfl118wxsv2c3w2scznwb48c4547qml42s2bpdz079g8y30";
}; };
tessdata = fetchFromGitHub { tessdata = fetchFromGitHub {
@ -23,7 +23,7 @@ stdenv.mkDerivation rec {
sha256 = "1v4b63v5nzcxr2y3635r19l7lj5smjmc9vfk0wmxlryxncb4vpg7"; sha256 = "1v4b63v5nzcxr2y3635r19l7lj5smjmc9vfk0wmxlryxncb4vpg7";
}; };
nativeBuildInputs = [ pkgconfig ]; nativeBuildInputs = [ pkgconfig autoreconfHook ];
buildInputs = [ leptonica libpng libtiff icu pango opencl-headers ]; buildInputs = [ leptonica libpng libtiff icu pango opencl-headers ];
LIBLEPT_HEADERSDIR = "${leptonica}/include"; LIBLEPT_HEADERSDIR = "${leptonica}/include";