Nicomsoft OCR SDK Tutorials
OCR MRZ
What is MRZ? MRZ is the machine-readable-zone with the data on the identity page, encoded in optical character recognition format. It is widely used in passports, vises, travel documents, identity cards, etc. According ISO/IEC-7501, MRZ consists of two or three lines, every line consists of 30, 36 or 44 characters. Most data in MRZ is [...]
Video – Embedding OCR within a few minutes
This video demonstrates how to include NSOCR into your application quickly: Download video
Working with multi-page images
How to check if an image consists of several pages? It is easy to do: Load the image to an IMG object (for example, using the “Img_LoadFile” function), and then call the “Img_GetPageCount” function, which returns the number of pages in the image. If the value returned by the second function is more than 1, [...]
OCR in Java
The Nicomsoft OCR library (NSOCR) is a set of native x86/x64 binary files and data files. To use such a library in Java, you need a proxy library developed in compliance with the Java Native Interface (JNI). In NSOCR SDK, we have included the “JNSOCR” proxy library that implements the JNE interface and works between [...]
OCR in Linux
Nicomsoft OCR SDK for Linux consists of a set of precompiled dynamic libraries (lib*.so) for the x86 and x64 platforms, additional files required for OCR (they are placed in the same directory), and sample projects with source code. We have tested NSOCR on the following Linux-based operating systems: Ubuntu Mint Fedora Mageia OpenSUSE Nicomsoft OCR [...]
OCR steps in details
Any OCR process consists of several steps like image scaling, deskewing, noise removing, binarization, lines detection, and so on. Sometimes it is very useful to have some kind of control at these steps, for example, to pause the OCR process after some step, analyze the intermediate results and change some settings of the next step [...]
General description of Nicomsoft OCR architecture
The Nicomsoft OCR engine (NSOCR) consists of the following modules: Main engine module: Engine_XXXXX functions. It is used for initialization/uninitialization of the library and general library configuration. Configuration module: Cfg_XXXXX functions. It is used to create, destroy, and manage CFG objects. A CFG object stores all settings that are used in the OCR process. OCR [...]
Optical Character Recognition (OCR) – How it works
OCR is a complex technology that converts images containing text into formats with editable text. OCR allows you to process scanned books, screenshots, and photos with text, and get editable documents like TXT, DOC, or PDF files. This technology is widely used in many areas. The most advanced OCR systems can handle almost any types [...]