Smart and powerful OCR tools

Nicomsoft OCR SDK Knowledge base

Video – Embedding OCR within a few minutes

This video demonstrates how to include NSOCR into your application quickly: Download video

>>

Working with multi-page images

How to detect if image has several page? It is easy: load image to IMG object (for example, with “Img_LoadFile” function) and then call “Img_GetPageCount” function, it returns number of pages of the image. So if this function returns value more than 1, we have loaded multipage image. Only few image formats can contain multiple [...]

>>

OCR in Java

Nicomsoft OCR library (NSOCR) is a set of native x86/x64 binary files and data files, in order to use such library in Java it is necessary to use a proxy library developed in compliance with Java Native Interface (JNI). In NSOCR SDK we have included “JNSOCR” proxy library that implements JNE interface and works between [...]

>>

OCR in Linux

Nicomsoft OCR SDK for Linux consists of a set of precompiled dynamical libraries (lib*.so) for x86 platform, additional files that are needed for OCR (they are placed in the same directory) and sample projects with source codes. We have tested NSOCR on the following Linux-based operating systems: Ubuntu Mint Fedora Mageia OpenSUSE Files of Nicomsoft [...]

>>

OCR steps in details

Any OCR process always contains several steps like image scaling, deskew, noise removing, binarization, lines detection and so on. Sometimes it is very useful to have some kind of control on these steps, for example, pause OCR process after some step, analyze intermediate results and change some settings of next step before applying it, or [...]

>>

General description of Nicomsoft OCR architecture

Nicomsoft OCR engine (NSOCR) has the following modules: Main engine module: Engine_XXXXX functions. It is used for initialization/uninitialization of the library and general library configuration. Configuration module: Cfg_XXXXX functions. It is used to create, destroy and manage CFG objects. CFG object stores all settings that are used in OCR process. OCR module: Ocr_XXXXX functions. It [...]

>>

Optical Character Recognition (OCR) – How it works

OCR is a complex technology that converts images with text into editable formats. OCR allows you to process scanned books, screenshots and photos with text and get editable documents like TXT, DOC or PDF files. This technology is widely used in many areas and the most advanced OCR systems can handle almost all types of [...]

>>