| Back | Main view | Parent doc
DOCX format
DOCX format section provides tuning parameters of recognized text export in DOCX (Microsoft Word 2007) format.
Size settings: use this option to set paper size of the output DOCX file (default: A4 (210 x 297 mm))
Layout: use this parameter to tell OCR engine to what degree it should retain page layout, font sizes and other formatting parameters or output result without any formatting (default: Retain full page layout)
Options:
- Remove all formatting: the text in output file is formatted in a single column. Frames are not used. Paragraphs are retained, while types and sizes of fonts are not retained.
- Retain font and font size: paragraphs and fonts types and sizes are retained. The text formatting inside paragraphs is not retained.
- Retain full page layout: full formatting is retained using columns and frames. This is the most convenient type for further text editing.
- Retain exact page layout: produces a document that maintains the formatting of the original. This option is recommended for documents with complex layouts, such as promotion booklets. Note, however, that this option limits the ability to change the text and formatting of the output document.
- Retain editable page layout: produces a document that preserves the original format and text flow but allows easy editing.
Remove optional hyphens: this parameter tells OCR Engine to remove optional hyphens when exporting recognized text in DOCX format. If checked optional hyphens are replaced with hyphens (default: Unchecked)
Keep line breaks: this option specifies if original lines in recognized text are retained during export in DOCX format (default: Checked)
Keep page breaks: specifies if original page arrangement and breaks in recognized text is retained during export in DOCX format (default: Checked)
Retain text color: specifies if original colors of text and background are retained during export of the recognized text in DOCX format (default: Checked)
Background color saving mode: Specifies the mode of background color saving when exporting to DOCX format (default: Save in color)
Options:
- Don't save: the background color is not saved.
- Save in black and white: the background is saved in black-and-white.
- Save inverted blocks only: the background color is saved only for inverted blocks.
- Save in color: the background color is saved.
Keep pictures: specifies if pictures are written in files in DOCX format
Picture format: specifies the image format which will be used during export to an DOCX file with embedded pictures (default: Automatic)
Options:
- Automatic: format is defined automatically.
- JPEG Color: color JPEG format.
- JPEG Gray: gray JPEG format.
- PNG Color: color PNG format.
- PNG Gray: gray PNG format.
- PNG Black And White: black and white PNG format.
Picture resolution: stores the value of picture resolution (dpi) that is used for exporting pictures for DOCX format (default: 200)
JPEG quality: stores the value of the JPEG quality for color pictures saved in DOCX format in percent (default: 50)
Highlight uncertain characters: this option specifies if uncertainly recognized symbols are highlighted with text or background color when exported in DOCX format.
Options:
- With text color: uncertainly recognized characters are highlighted in output DOCX file with the selected text color (default: Unchecked; default color: Green)
- With background color: uncertainly recognized characters are highlighted in output DOCX file with the selected background color (default: Unchecked; default color: Green)
The color with which to highlight the text or background can be set in the Color dialog available through the Color button. Select color by clicking on it and confirming it with OK or abort this selection by pressing Cancel.
| Back | Main view | Parent doc