February 29th, 2012· Update Tesseract engine to v3.02 Alpha (r684)
· Enable text entry in the combobox for Tesseract 3.02's multi-language OCR support
· Fit Image now retains image aspect ratio
February 13th, 2012· Update Tesseract engine to v3.02 Alpha (r671)
· Use Tesseract 3.02 language data packs
· Enable text entry in the combobox for Tesseract 3.02's multi-language OCR support
· Update Hunspell to v1.3.2
January 23rd, 2012· Fix a context menu's font issue with displaying Unicode characters for spellcheck suggestions
January 16th, 2012· Fix an issue with opening Help file on OS X
· Update JACOB to 1.16-M2 version
· Update JNA to 3.4.0 version
October 24th, 2011· Update Tesseract 3.01 to r638 (final release version)
· Remove unneeded liblept168.dll
· Update lists of language codes
· Update JACOB to 1.16-M1 version
· Add PSM support to execution from command line
October 3rd, 2011· Update Tesseract 3.01 to r625
· Provide Page Segmentation Mode options for Tesseract engine
August 23rd, 2011· Update Tesseract 3.01 to r622
· Provide PSM options to Tesseract engine
August 2nd, 2011· Update Tesseract 3.01 to r597
June 6th, 2011· Refactoring
· Improve program usability, enabling image nagivation and manipulation with keyboard
· Fix an EOL issue that broke Remove Line Breaks functionality on Windows
· Fix an issue with restart notification after language pack downloads
· Update Tesseract 3.01 to r585
· Replace Vietnamese language pack with an improved version
May 30th, 2011· Incorporate deskew functionality using GMSE Deskew algorithm
· Fix a MissingResourceException associated with Font dialog (Java only)
May 30th, 2011· Port changes from version 2.0
· Update Tesseract OCR engine to 3.01 (r551)
May 30th, 2011· Upgrade Tesseract OCR engine to 3.0
· Replace old format (2.0x) language data with new format (3.0) language data
· Change datafile suffix from .inttemp to .traineddata
May 30th, 2011· Incorporate deskew functionality using GMSE Deskew algorithm
· Fix a MissingResourceException associated with Font dialog (Java only)
May 30th, 2011· Fix a bug which hangs the program if x.DangAmbigs.txt contains entries starting with an equal symbol
· Improve postprocessing performance by caching the word list used; reload only if changes
· Fix a bug that crashes the program when inline spellcheck suggests on empty text (.NET only)
· Incorporate Apple Java Extensions (Java only)
May 30th, 2011· Upgrade JACOB library to version 1.15-M4 (Java only)
· Add support for spellcheck suggestion in context menu
· Improve program accessibility and usability
· Add support for downloading and installing language data packs and appropriate spell dictionaries
· Add UI localization for Lithuanian and Slovak
· Refactor by breaking up large classes into smaller ones
May 30th, 2011· Integrate a Java binding for Hunspell library to provide spellchecking and spellcheck-as-you-type functionality. Include English and Vietnamese dictionaries
· Add support for a custom dictionary
· List in correct order files generated from PDF conversion
· Upgrade JACOB library to version 1.15-M3
· Preset Tesseract path on Linux to /usr/bin, the default install location of Tesseract
May 30th, 2011· Display image information
· Add Screenshot Mode, which rescales low-resolution images to 300 DPI to be more suitable for OCR operations
· Read output and error streams to prevent subprocess to block or deadlock due to limited buffer size for standard output streams (Java version)
· Fix a problem in which paste (image) event fires twice (Java version)
· Fix an issue with subimages generated by selection box on Linux (Java version)
May 30th, 2011· Add provision to load UTF-8 text file into textbox
· Add Recent Files submenu
· Add Save button on toolbar
· Fix scale factor, offset issues in image manipulation
· Improve postprocessing for Vietnamese
· Add support for more VNI fonts to Vietnamese language data
May 30th, 2011· Fix an image size issue and associated scale factor when toggling between Fit Image vs. Actual Size after (Java) resizing window or (.NET) scrolling in picturebox
· Add unit test
· Improve post-OCR correction for Vietnamese
· Bundle Vietnamese language data for VNI & TCVN3 (ABC) fonts
May 30th, 2011· Add support for execution from command line
· Add support for paste image from clipboard
· Add support for JPEG2000 and PNM image types (Java version)
October 28th, 2009· Publish OCR interim results to produce more responsive UI performance, improving user experience
· Support for cancellation of running OCR tasks
· Merge PDF functionality
October 28th, 2009· Improved exception handling with appropriate error messages
· Improved handling of PDF documents that has many pages. Putting too many images, as a result of PDF extraction, in a multi-page TIFF eventually will generate out-of-memory exceptions
· Split PDF functionality
October 28th, 2009· Integrated PDF support using GPL Ghostscript
October 28th, 2009· Merge TIFF functionality
October 28th, 2009· Refactored for improvements
October 28th, 2009· Updated to Tesseract 2.04 engine (bundled Windows executable)
· Added more language codes to ISO639-3.xml file
· Added a pangram.xml file for displaying appropriate Preview text in the Font Dialog for the OCR language currently selected
· Moved various settings to the Options dialog
· Removed the option of Locating Tesseract on Windows. Current Tesseract is the executable bundled inside the program
· Added support for custom text replacement in postprocessings
October 28th, 2009· Updated to Tesseract 2.04RC engine
· Added indeterminate progressbar for (more animated) task status
· Added All Image Files filter
· Removed Vietnamese-glyph font filter to now show all system fonts
· Changed FontDialog's default Preview text to the standard English pangram to make it more universal
· Modified SimpleFilter to accept multiple file extensions
October 28th, 2009· Fixed the way TESSDATA_PREFIX environment variable handled in Linux
· Clean up temporary files if errors occur during OCR operations
· Fixed a regression EOL bug with output files in Windows
· Display appropriate error message during batch process
October 28th, 2009· Added text formatting functionality
October 28th, 2009· Added watch folder functionality for Batch Processing support
October 28th, 2009· Revamped localization codes
· Added rudimentary support for English postprocessing
October 28th, 2009· Minor fixes and various improvements
October 28th, 2009· Implemented image rotation functionality
October 28th, 2009· Fixed an error with path in Linux
· Additional instruction for configuring Tesseract on Linux
October 28th, 2009· Integrated scanning support via WIA Automation Library v2.0
October 28th, 2009· Localized user interface
October 28th, 2009· Proof-of-concept design
· Support TIFF image formats
· Added support for JPEG, GIF, BMP, PNG formats
· Added post-processing for Vietnamese to improve accuracy
· Added Vietnamese input methods
· Added recognition of selected area on image
· Added file drag-drop
· Added a context menu for the textarea
· Added support for selection of Look and Feel
· Display appropriate message when Tesseract engine crashes
· Fixed the issue involving filepaths containing spaces
· Bundled JAI Image I/O 1.1 library
· Use Java 6.0
· Use Tesseract 2.03 OCR engine
· Use Vietnamese language data for Tesseract 2.03 (data for 2.01 crashes frequently with Tesseract 2.03)