Arabic documents processing is one of the urgent tasks to be reviewed today. Spread of digitized resources makes the informational society use modern technologies in order to retrieve exact data within a minimum period. Documents processing demands developing of new approaches on the bases of analysis of the Arabic graphic in particular and peculiarities of the Arabic language as a whole. Currently different principles of document processing in Arabic are applied. Some of them are based on the grammar structure of the language, while others use techniques that rely on images classification. The paper critically analyses main techniques for different types of Arabic document classification and suggests the most efficient methods for their processing. Choice of the most relevant approach depends on the character of a certain extra linguistic task, for example principles of classification of modern handwritten documents varies from the technique, used for historical manuscript processing.

Язык оригиналаанглийский
Название основной публикацииProceedings of the 9th International Multi-Conference on Society, Cybernetics and Informatics (July 12-15. 2015, Orlando, USA)
ИздательInternational Institute of Informatics and Systemics
Страницы81-86
Число страниц6
ISBN (электронное издание)9781941763308
СостояниеОпубликовано - 2015
Опубликовано для внешнего пользованияДа
Событие9th International Multi-Conference on Society, Cybernetics and Informatics, IMSCI 2015 - Orlando, Соединенные Штаты Америки
Продолжительность: 11 июл 201514 июл 2015

конференция

конференция9th International Multi-Conference on Society, Cybernetics and Informatics, IMSCI 2015
Страна/TерриторияСоединенные Штаты Америки
ГородOrlando
Период11/07/1514/07/15

    Предметные области Scopus

  • Информационные системы
  • Искусственный интеллект

ID: 4746239