VietOCR is an easy-to-use application that provides optical character recognition (OCR) solutions for Vietnamese language.
Optical character recognition or OCR is the process through which images of handwritten, typewritten or printed text are converted to computer-editable text.
Here are some key features of "VietOCR":
· PDF, TIFF, JPEG, GIF, PNG, BMP image formats
· Multi-page images
· Selection box
· File drag-and-drop
· Postprocessing for Vietnamese to boost accuracy rate
· Vietnamese input methods
· Integrated scanning support (on Windows only)
· Watch folder monitor for support of batch processing
· Custom text replacement in postprocessing
Requirements:
· Java
What's New in This Release: [ read full changelog ]
· Update Tesseract engine to v3.02 Alpha (r671)
· Use Tesseract 3.02 language data packs
· Enable text entry in the combobox for Tesseract 3.02's multi-language OCR support
· Update Hunspell to v1.3.2