Overview

Request 337049 accepted

- Update to version 3.04.00:
* Added OpenCL support (experimental).
* Many bug fixes.
From version 3.03.00:
* Added new training tool text2image to generate box/tif file
pairs from text and truetype fonts.
* Added support for PDF output with searchable text.
* Removed entire IMAGE class and all code in image directory.
* Tesseract executable: support for output to stdout; limited
support for one page images from stdin (especially on Windows)
* Added Renderer to API to allow document-level processing and
output of document formats, like hOCR, PDF.
* Major refactor of word-level recognition, beam search,
eliminating dead code.
* Refactored classifier to make it easier to add new ones.
* Generalized feature extractor to allow feature extraction from
greyscale.
* Improved sub/superscript treatment.
* Improved baseline fit.
* Added set_unicharset_properties to training tools.
* Many bug fixes.
* More training source data included.
- Added new build requirements cairo-devel, doxygen, libicu-devel
and pango-devel.
- Recommend tesseract-ocr-traineddata-english instead of
tesseract-ocr-traineddata-american (based on new (3.04.00)
tesseract-ocr traineddata files).

Request History
Ismail Dönmez's avatar

namtrac created request

- Update to version 3.04.00:
* Added OpenCL support (experimental).
* Many bug fixes.
From version 3.03.00:
* Added new training tool text2image to generate box/tif file
pairs from text and truetype fonts.
* Added support for PDF output with searchable text.
* Removed entire IMAGE class and all code in image directory.
* Tesseract executable: support for output to stdout; limited
support for one page images from stdin (especially on Windows)
* Added Renderer to API to allow document-level processing and
output of document formats, like hOCR, PDF.
* Major refactor of word-level recognition, beam search,
eliminating dead code.
* Refactored classifier to make it easier to add new ones.
* Generalized feature extractor to allow feature extraction from
greyscale.
* Improved sub/superscript treatment.
* Improved baseline fit.
* Added set_unicharset_properties to training tools.
* Many bug fixes.
* More training source data included.
- Added new build requirements cairo-devel, doxygen, libicu-devel
and pango-devel.
- Recommend tesseract-ocr-traineddata-english instead of
tesseract-ocr-traineddata-american (based on new (3.04.00)
tesseract-ocr traineddata files).


Stephan Kulow's avatar

coolo added as a reviewer

Being evaluated by staging project "openSUSE:Leap:42.1:Staging:adi:1"


Stephan Kulow's avatar

coolo accepted review

Picked openSUSE:Leap:42.1:Staging:adi:1


Stephan Kulow's avatar

coolo accepted review

ready to accept


Stephan Kulow's avatar

coolo approved review

ready to accept


Stephan Kulow's avatar

coolo added as a reviewer

Being evaluated by staging project "openSUSE:Leap:42.1:Staging:adi:1"


Stephan Kulow's avatar

coolo accepted review

ready to accept


Stephan Kulow's avatar

coolo approved review

ready to accept


Stephan Kulow's avatar

coolo accepted request

Accept to openSUSE:Leap:42.1

openSUSE Build Service is sponsored by