Releases: ropensci/tesseract
Releases · ropensci/tesseract
tesseract 5.0.0
tesseract 4.1
tesseract 4.1
- Fix memory leak in ocr_data()
- Windows / MacOS: update to libtesseract 4.1.0. This re-enables
the whitelist/blacklist options that were missing in Tesseract 4.0
tesseract 4.0
tesseract 4.0
- Windows, MacOS: Upgrade to upstream Tesseract 4.0! Completely new OCR engine.
- Tesseract 4 has a new training data format. On Windows / MacOS you need to
re-download your language data with tesseract_download(). The package uses
separate directories for storing Tesseract 3 vs 4 data so they shouldn't get
mixed up (hopefully). - Drop hard-dependency on tibble (only load if available)
tesseract 2.3
tesseract 2.3
- Fix problem with setlocale() not properly restoring locale.
- Switch examples from dontrun{} to donttest{}, and '--run-donttest' on travis/appveyor
tesseract 2.2
Tesseract 2.2
- Fixes for breaking changes in Tesseract 4.0.0 beta.3
- Set LC_ALL = C when initiating tesseract
- Include <tesseract/*> to support Tesseract 4
tesseract 2.1
v2.1 Bump
tesseract 2.0
tesseract 2.0
- Set tesseract parameters at init so that all parameters types now actually work!
- New function tesseract_params() lists all supported parameters and their default
- Added 'config' argument to tesseract() which specifies a file with parameter values
- Internally validate paremeter names before init to revent tesseract crashes
- Rewrite the ocr_data() function in C++ to make it much faster
- Tesseract 4 now gets data from the tessdata_fast repo as recommended upstream
- Use default resolution of 300dpi when image does not contain resolution info
tesseract 1.8
1.8
- Add support for HOCR output (#20)
- Remove 'script' and 'orientation' attributes in output (doesn't work in Tesseract 4)
1.7 (internal)
- Add support upcoming Tesseract 4 (compiler fix + separate tessdata dir)
- Configure script now explicitly tests for CXX11 (required by Tesseract 4)
tesseract 1.6
Tesseract 1.6
- Windows: update libtesseract to 3.05.01
- tesseract_download now uses 3.04 tree (instead of 4.00) as suggested in readme
- For static packags on Win/Mac, languages stored in: rappdirs::user_data_dir('tesseract')
- Use 'png' instead of 'tiff' to read magick images
- Compile with $(C_VISIBILITY) to hide internal symbols (requires Rcpp 0.12.12)
- Use Rcpp symbol registration
CRAN release 1.4
Tesseract 1.4
- Run engine finalizer on R exit (requires Rcpp 0.12.10)
- Move autobrew script to separate repository
- Add symbol registration