Белорусский национальный технический университет
Repository of the Belarusian National Technical University
ISSN: 2310-7405
Repository of the Belarusian National Technical University
View Item 
  •   Repository BNTU
  • Сериальные издания
  • Системный анализ и прикладная информатика
  • 2022
  • № 4
  • View Item
  •   Repository BNTU
  • Сериальные издания
  • Системный анализ и прикладная информатика
  • 2022
  • № 4
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Optimizing the performance of a server-based classification for a large business document flow

Thumbnail
DOI
10.21122/2309-4923-2022-4-60-64
Authors
Slavin, O. A.
Date
2022
Publisher
БНТУ
Bibliographic entry
Slavin, O. A. Optimizing the performance of a server-based classification for a large business document flow / O. A. Slavin // Системный анализ и прикладная информатика. – 2022. – № 4. – С. 60-64.
Abstract
The document categorization problem in the case of a large business document flow is considered. Textual and visual embeddings were employed for classification. Textual embeddings were extracted via OCR Tesseract. The Viola and Jones method was applied to generate visual embeddings. This paper describes the performance optimization technology for the implemented classification algorithm. Servers with Intel CPUs were used for the algorithm execution. For single-threaded implementation, high-level and low-level optimizations were performed. High-level optimization was based on the parametrization of the recognition algorithms and the employment of intermediate data. Low-level optimization was carried out via compiler tools allowing for an extended set of SIMD instructions. The implementation of parallelization with several multithreaded applications on multiple servers was also described. The proposed solution was tested using own test data sets of business documents. The proposed method can be applied in modern information systems to analyze the content of a large flow of digital document images.
URI
https://rep.bntu.by/handle/data/126534
View/Open
60-64.pdf (396.5Kb)
Collections
  • № 4[9]
Show full item record
CORE Recommender

Belarusian National Technical University | Science Library | About Repository | Размещение в Репозитории | Contact Us
Яндекс.МетрикаIP Geolocation by DB-IP
Science Library | About Repository | Размещение в Репозитории | Contact Us
 

Browse

All of Repository BNTUCommunities & CollectionsAuthorsTitlesBy Issue DatePublisherBy Submit DateTypeThis CollectionAuthorsTitlesBy Issue DatePublisherBy Submit DateType

My Account

LoginRegister

Belarusian National Technical University | Science Library | About Repository | Размещение в Репозитории | Contact Us
Яндекс.МетрикаIP Geolocation by DB-IP
Science Library | About Repository | Размещение в Репозитории | Contact Us