JetStream Recognition

JetStream Recognition

Key Highlight for Data Capture with Unparalleled OCR & ICR Recognition


Experience unparalleled accuracy in data capture with IDA Recognition. Our advanced OCR (Optical Character Recognition) and ICR (Intelligent Character Recognition) technology excel even in challenging scenarios such as distorted, poor-quality scans, machine-printed text, and difficult-to-read handwriting. With IDA Recognition, you can significantly reduce the need for manual corrections, ensuring enhanced straight-through processing and increased efficiency. Moreover, our Recognition Feature is available as an SDK, providing you with greater flexibility and seamless integration options into your existing systems.


Recognition Datasheet

JetStream Recognition Demo Video

Intelligent Document Recognition Key Features

Graphic related to document  management with two large document folder icons as well as some people and other icons.


PATENTED CORE TECHNOLOGY


  • Compex documents
  • Preserves all transcripts
  • Highest accuracy



WIDE SCOPE OF SCENARIOS



  • Machine printed
  • Handwritten text
  • Barcodes & more


Grapgic of a woman standing in front of some sort of overview with boxes and lines that she is inspecting.
Graphic of a woman with a laptop standing next to some other graphics that are related to document management.


COMPREHENSIVE PDF CAPABILITIES


  • Stores Smart PDFs with text layers
  • Supporting PDF/A
  • High efficiency archiving


AUTO LANGUAGE DETECTION



  • Multiple language simultaneously
  • Based on refined language models
  • Pre-trained models


Graphic of two people standing next to a light bulb.
Graphic of a person standing in front of a computer and phone icon with a loading bar and a settings icon.


EASY DEPLOYMENT & INTEGRATION


  • On-premises or cloud-based
  • Java app or Docker
  • Integrating seamlessly with gRPC API



Versatile JSON output format


  • Output: JSON format
  • Easy access to extracted data fields
  • Seamless downstream processing
A graphic of a person sitting on a box with a settings icon with another box next to her with icons related  to coding languages.

JetStream Recognition Add-Ons

Enhance the capabilities of JetStream Recognition with a diverse selection of Recognition Add-ons. These add-ons, available as separate licenses, expand the functionality of JetStream Recognition to include a wide range of features. These include capturing 1D and 2D codes, redaction of sensitive information, recognition of historical scripts, Arabic and Chinese character recognition, and advanced table extraction. With these add-ons, you have the flexibility to customize JetStream Recognition to meet your specific document processing needs, unlocking even more possibilities for automation and efficiency.

Barcode Recognition

Supported codes:

1D, 2D, CodaBar, Code39, Code93, Code128, EAN8, EAN13, ITF, UPC-E, Aztec, DataMatrix, PDF417, QR

Historical Scripts

  • Historical blackletter scripts (Gothic)
  • Historical handwriting (f.e. Kurrent, Sütterlin)
  • English, French, German, Latin, Italian


Entity Finder

Based on either a list of words or regular expressions.


  • Highlighting
  • RedactionBETA

Beta Add-Ons

  • Non-Latin Script LanguagesBETA


  • Table RecognitionBETA



What Is Document Recognition?

Document recognition, also known as document analysis or document understanding, is the process of automatically analyzing and understanding the content and structure of a document. It involves extracting meaningful information from documents, such as text, images, tables, and other elements, and interpreting their meaning and context. Document recognition techniques typically include optical character recognition (OCR) for extracting text from images, as well as techniques for layout analysis, document segmentation, and classification. Document recognition is widely used in various industries, such as finance, healthcare, and legal, to automate document processing tasks and improve efficiency.

What is OCR?

OCR, Optical Character Recognition, is a technology that enables computers to recognize and extract text from images or scanned documents. OCR systems use algorithms and pattern recognition techniques to identify characters and transform them into machine-readable text. This technology is commonly used to convert physical documents, such as invoices, receipts, or printed text, into editable and searchable digital files. OCR has various applications, including document digitization, data entry automation, and improving accessibility for visually impaired individuals.

Difference between OCR and AI Document Recognition

AI document recognition uses advanced techniques to analyze and understand entire documents, extracting meaningful information such as text, images, and tables while interpreting their context with normally a very high accuracy. It can combine different aspects such as OCR with document segmentation, layout analysis, and classification.


Traditional OCR, however, mainly focuses on recognizing and converting text from images into machine-readable form. It relies on pixel-based pattern recognition, identifying letters by analyzing their shapes. In contrast, AI-powered recognition uses neural networks and considers the entire document’s context, enabling more sophisticated comprehension and adaptability beyond basic text conversion.


The Importance of OCR & AI Recognition


1) Digitization and Document Management:


OCR enables the conversion of physical documents, such as paper files or scanned images, into editable and searchable digital text. This facilitates efficient document management, storage, and retrieval, reducing reliance on physical paperwork and enabling easy access to information.


2) Time & Cost Savings:


OCR automates data entry processes, eliminating the need for manual typing and reducing human error. This saves significant time and reduces labor costs associated with data entry tasks, enabling employees to focus on more value-added work.


3) Improved Accuracy & Efficiency:


OCR technology has advanced significantly, achieving high accuracy rates in text recognition. Automated data extraction with OCR reduces the risk of human error and improves data accuracy. It also speeds up processing times, allowing businesses to handle large volumes of documents quickly and efficiently.


4) Enhanced Searchability & Accessability:


By converting physical documents into searchable digital files, OCR enables users to easily locate specific information within documents. This improves efficiency in information retrieval and enables faster decision-making. Additionally, OCR helps make printed text accessible to individuals with visual impairments by converting it into speech or Braille.


5) Integration with Business Systems:


OCR can seamlessly integrate with various business systems, such as content management systems (CMS), enterprise resource planning (ERP) software, or customer relationship management (CRM) platforms. This enables streamlined workflows, data synchronization, and efficient data exchange between different systems.


6) Compliance and Regulatory Requirements:


Many industries have compliance and regulatory requirements related to document management and data processing. OCR technology helps organizations meet these requirements by ensuring accurate data capture, retention, and retrieval.


OCR technology offers significant advantages in terms of efficiency, accuracy, cost savings, and compliance. It has become an essential tool for businesses across industries, improving productivity, data management, and decision-making processes.




AI Recognition & OCR FAQ



  • What does OCR Stand for

    OCR stands for Optical Character Recognition, a technology used to convert different types of documents, such as scanned paper documents or images, into machine-readable text.

  • is OCR AI

    Traditional OCR by itself is not typically considered AI, as it relies on basic pattern recognition to convert text from images or scanned documents into digital text. However, modern OCR systems often incorporate AI techniques, such as neural networks, to enhance their capabilities. Neural networks enable OCR to better understand text context, adapt to various fonts and layouts, and improve accuracy by learning from data over time. This AI-driven approach allows OCR systems to move beyond simple text extraction to more sophisticated document comprehension and processing.

  • What is Intelligent Document Recognition

    Intelligent Document Recognition (IDR) is an advanced technology that automatically identifies, categorizes, and extracts data from documents using AI-powered methods. Unlike basic OCR, which primarily focuses on text extraction, IDR leverages machine learning, natural language processing (NLP), and other AI techniques to understand the context, structure, and relationships within a document. This enables it to handle complex documents with varying formats, such as invoices, contracts, or forms, and accurately extract relevant data. 

Share by: