| Back | Main view

Processed objects in IMiS/OCR Server Notes Application job

Product:IMiS/OCR Server
Release:7.6.907
Date:08/03/2009

Case: One of the way to use IMiS/OCR Server is through a Notes application. Administrator needs to define a Notes Application job that more or less consists of setting up Domino server and Source/Target settings. With IMiS enabled Notes application users can than use IMiS/OCR Server to OCR image objects. This article describes the way processed objects are handled in Notes documents with IMiS/OCR Server.

Description:

IMiS/OCR Server Notes application job saves information of processed objects in a document multi-value text field item called IMiSOCRServerProcessedObjs. Before IMiS/OCR Server 7.6.907 object identifiers of processed objects were added to IMiSOCRServerProcessedObjs item and recognized fulltext was stored in a richtext item called IMiSOCRServerFT. This allowed Notes application to know which objects were processed and also to perform a fulltext search of document associated with image objects.



With IMiS/OCR Server 7.6.907 this design is changed in the following way. IMiSOCRServerProcessedObjs item now holds OCR process information listed below including so-called version tag, which will be used to handle future changes. Current version tag is set to "$20001". IMiSOCRServerFT is obsolete as far as IMiS/OCR Server is concerned and is now replaced with richtext field items with root name IMiSOCRServerFTItem followed by a number starting with 1 (e.g. IMiSOCRServerFTItem1) that hold only one processed object's recognized fulltext per item.

IMiSOCRServerProcessedObjs item value has the following form:

      "<version_tag>","<src_object_id>","<ft_item_name>","<object_name>","<ocr_result_name>", "<ocr_stats>","<ocr_error_msg>"

Item value consists of double-quated, comma-separated string values which represent:

<version_tag>version tag of the IMiSOCRServerProcessedObjs value format (currently $20001)
<src_object_id>source object identifier (e.g. for object on IMiS/Arc: ARCDEV/b264aa02790fbf3057621f6819a15e393da9b59b1a1490eb9db0a653fc05116e)
<ft_item_name>name of the richtext item that holds processed object's recognized fulltext. It starts with IMiSOCRServerFTItem followed by a number (e.g. IMiSOCRServerFTItem1)
<object_name>object name, that shows up in a menu under View button (e.g: test.tif)
<ocr_result_name>name of the processed object, that shows up in a menu under View button with postfix ' (OCR)' (e.g. test.tif (OCR))
<ocr_stats>statistics of the processed object, with comma-separated numerical values (e.g. 114106,69558,3,4169,0,0)
<ocr_error_msg>error message that occurred during OCR process if any, otherwise empty double-quates

and <ocr_stats> value has the following form:

       <object_size>,<ocr_result_size>,<page_count>,<total_chars>,<susp_chars>,<unrec_chars>

where values are numbers and represent:

<object_size>object size in bytes
<ocr_result_size>processed object size in bytes
<page_count>number of pages
<total_chars>number of total characters
<susp_chars>number of suspicious characters
<unrec_chars>number of unrecognized characters


Example of IMiSOCRServerProcessedObjs item value:

"$20001","ARCDEV/b264aa02522b325db0d7f9892a2f85e34fc0f323d420fce58ba354fea88a0a31","IMiSOCRServerFTItem1","test.tif","test.tif (OCR)","114106,69558,3,4169,0,0",""

Related Documents:

Database 'IMiS Knowledge database', View 'All Documents', Document 'IMiS/OCR Server 7.6.907' IMiS/OCR Server 7.6.907

| Back | Main view