Software Performance Optimization for Classification and Linking of Administrative Documents

Guardat en:
Dades bibliogràfiques
Publicat a:Programming and Computer Software vol. 50, no. 6 (Dec 2024), p. 457
Autor principal: Slavin, O. A.
Publicat:
Springer Nature B.V.
Matèries:
Accés en línia:Citation/Abstract
Full Text
Full Text - PDF
Etiquetes: Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!

MARC

LEADER 00000nab a2200000uu 4500
001 3130548260
003 UK-CbPIL
022 |a 0361-7688 
022 |a 1608-3261 
024 7 |a 10.1134/S0361768824700324  |2 doi 
035 |a 3130548260 
045 2 |b d20241201  |b d20241231 
100 1 |a Slavin, O. A.  |u Federal Research Center “Computer Science and Control,” Russian Academy of Sciences, Moscow, Russia (GRID:grid.4886.2) (ISNI:0000 0001 2192 9124); LLC Smart Engines Service, Moscow, Russia (GRID:grid.518849.9) 
245 1 |a Software Performance Optimization for Classification and Linking of Administrative Documents 
260 |b Springer Nature B.V.  |c Dec 2024 
513 |a Journal Article 
520 3 |a This paper discusses technologies for software performance optimization. Optimization methods are divided into high-level and low-level, as well as parallelization. The described optimization methods are applied to programs and software systems for processing large volumes of information, which have hot spots. An algorithm for classifying and linking fields in a recognized image of an administrative document is described. The implementation features of the classification and linking tasks, which consist in using constellations of text key points and a modified Levenshtein distance, are considered. For optical character recognition (OCR), Smart Document Engine and Tesseract are employed. Several methods used to optimize the performance of functions for document classification and linking are described. The performance optimization of the system for sorting administrative document image streams is considered. The proposed methods for software performance optimization are suitable not only for image processing algorithms but also for computational algorithms with cyclic information processing. The approach can also be used in modern CAD systems to analyze the content of recognized text files. 
653 |a Operating systems 
653 |a Parallel processing 
653 |a Software 
653 |a Data processing 
653 |a Software development 
653 |a Classification 
653 |a Optical character recognition 
653 |a Optimization techniques 
653 |a Documents 
653 |a Optimization 
653 |a Methods 
653 |a Algorithms 
653 |a Performance evaluation 
653 |a Sorting algorithms 
653 |a Image processing 
653 |a Workloads 
653 |a Product development 
773 0 |t Programming and Computer Software  |g vol. 50, no. 6 (Dec 2024), p. 457 
786 0 |d ProQuest  |t Advanced Technologies & Aerospace Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3130548260/abstract/embedded/6A8EOT78XXH2IG52?source=fedsrch 
856 4 0 |3 Full Text  |u https://www.proquest.com/docview/3130548260/fulltext/embedded/6A8EOT78XXH2IG52?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3130548260/fulltextPDF/embedded/6A8EOT78XXH2IG52?source=fedsrch