| Back | Main view | Parent doc
CSV format
CSV format section provides tuning parameters of recognized text export in CSV format.
Code page: this property specifies the code page to which the recognized text is exported. If this property is (Automatic) the code page is selected automatically using the Code page type property value (default: (Automatic))
Text encoding: specifies the encoding type of the output file in CSV format (default: Auto)
Options:
- ASCII: ASCII encoding, one byte per symbol.
- Unicode UTF8: UTF8 is a code page that uses a string of bytes to represent a 16-bit Unicode string where ASCII text remains unchanged as a single byte, other text is converted to a 2-byte sequence (including Latin, Greek, Cyrillic, Hebrew, and Arabic) or a 3-byte sequence (Chinese, Japanese, Korean, and others).
- Unicode UTF16: native Unicode format where every symbol is represented by two-byte sequence.
- Auto: encoding is selected automatically.
Separator: stores the character with which the table separators are replaced in the exported text (default: Tabulator)
Options:
- Tabulator: separator character is tabulator '\t'
- Comma: separator character is comma ','
- Semicolon: separator character is comma ';'
Export text from table blocks only: this option tells OCR engine to export recognized text from table blocks only (default: Unchecked)
Use page break character as page separator: this option specifies if page break symbols (0x12) should be inserted between pages in case multiple pages are exported into CSV format (default: Unchecked)
Use blank line as paragraph separator: option specifies if an empty line should be inserted between paragraphs and act as paragraph separator (default: Unchecked)
Keep line breaks: this option specifies if original lines in recognized text are retained during export in CSV format (default: Checked)
Append to the end of file: this option specifies if exported text is to be appended to the end of file if it already exists (default: Unchecked)
Append EOF: specifies if the EOF symbol is inserted at the end of file (default: Unchecked)
| Back | Main view | Parent doc