| Back | Main view
Multiple recognition languages in IMiS/OCR Server
Product: | IMiS/OCR Server |
Release: | 7.6.907 |
Date: | 07/30/2009 |
Case:
IMiS/OCR Server 7.6.907 is the first version that allows administrator and users to set more than one OCR languages. Language setting is crucial to successful OCR of image documents. Characters which are not found among valid language character set are unrecognized or substituted by wrong characters. Users are now able to OCR documents with text in several languages or alphabets, for example users can now use both latin- and cyrillic-based language to properly recognize scanned documents.
Description:
IMiS/OCR Server administrator can set default language(s) in Preferences dialog available through Tools menu or tray icon popup menu. In the General section of the OCR Settings you can choose from all available languages. A simple interface allows you to add or remove languages.
Languages set in preferences can be overridden by job language settings. Job settings are available through Tools menu or tray icon popup menu. To enable job language settings uncheck inherited default OCR language option and set new recognition languages in the same way as in IMiS/OCR Server preferences.
Language(s) set in the preferences or job settings can be overridden by user requests to OCR attached documents. The following syntax is used for IMiS/OCR Server email requests subject:
IMiS OCR Server Request, <languages>
IMiS OCR Server Request, L=<languages>, O=<output-format>
Examples:
IMiS OCR Server Request, English
IMiS OCR Server Request, L=English, Slovenian, O=pdf
IMiS OCR Server Request, L=Serbian (Cyrillic), Serbian (Latin)
As indicated in the examples above user can set custom language(s) specific to his request. If no language(s) are set, job or default languages are used to recognize the attachments. Languages can be selected from the list of languages bellow. Languages have to be comma separated and are case- and order-insensitive.
Choose <language> parameter from the following options:
MAIN LANGUAGES:
Armenian (Eastern), Armenian (Grabar), Armenian (Western), Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, Dutch (Belgian), English, Estonian, Finnish, French, German, German (new spelling), Greek, Hungarian, Italian, Latvian, Lithuanian, Norwegian, Norwegian (Bokmal), Norwegian (Nynorsk), Polish, Portuguese, Portuguese (Brazilian), Romanian, Russian, Slovak, Swedish, Tatar, Turkish, Ukrainian.
ADDITIONAL LANGUAGES:
Abkhaz, Adyghian, Afrikaans, Agul, Albanian, Altai, Avar, Aymara, Azeri (Cyrillic), Azeri (Latin), Bashkir, Basque, Bemba, Blackfoot, Breton, Bugotu, Buryat, Belarusian, Cebuano, Chamorro, Chechen, Chukchee, Chuvash, Corsican, Crimean Tatar, Crow, Dargwa, Dungan, Dakota, Eskimo (Cyrillic), Eskimo (Latin), Even, Evenki, Faroese, Fijian, Frisian, Friulian, Scottish Gaelic, Gagauz, Galician, Ganda, German (Luxembourg), Guarani, Hani, Hausa, Hawaiian, Icelandic, Indonesian, Ingush, Irish, Kabardian, Kalmyk, Karachay-balkar, Karakalpak, Kasub, Kawa, Kazakh, Khakass, Khanty, Kikuyu, Kirghiz, Kongo, Koryak, Kpelle, Kumyk, Kurdish, Lak, Sami (Lappish), Latin, Lezgi, Luba, Macedonian, Malagasy, Malay (Malaysian), Malinke, Maltese, Mansi, Maori, Mari, Maya, Miao, Minangkabau, Mohawk, Moldavian, Mongol, Mordvin, Nahuatl, Nenets, Nivkh, Nogay, Nyanja, Ojibway, Ossetian, Papiamento, Tok Pisin, Provencal, Quechua, Rhaeto-Romanic, Romany, Rwanda, Rundi, Russian (Old Spelling), Samoan, Selkup, Serbian (Cyrillic), Serbian (Latin), Shona, Slovenian, Somali, Sorbian, Sotho, Sunda, Swahili, Swazi, Tabasaran, Tagalog, Tahitian, Tajik, Jingpo, Tongan, Tswana, Tun, Turkmen, Tuvinian, Udmurt, Uzbek (Cyrillic), Uzbek (Latin), Welsh, Wolof, Xhosa, Yakut, Zapotec, Zulu.
ARTIFICIAL LANGUAGES:
Esperanto, Interlingua, Ido, Occidental.
FORMAL LANGUAGES:
Basic, C/C++, COBOL, Digits, Fortran, Pascal, Simple chemical formulas, MICR (E-13B), MICR (CMC-7), Digits
Related Documents:
IMiS/OCR Server 7.6.907
| Back | Main view