Standard

Универсальная модель архитектуры краудсорсинговой системы разметки и подготовки медицинских данных. / Коваленко, Лев Алексеевич; Блеканов, Иван Станиславович; Ежов, Федор Валерьевич; Ларин, Евгений Сергеевич; Ким, Глеб Ирламович.

In: Научно-технический вестник СПбГУ ИТМО, Vol. 25, No. 5, 27.10.2025, p. 844-855.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Author

Коваленко, Лев Алексеевич ; Блеканов, Иван Станиславович ; Ежов, Федор Валерьевич ; Ларин, Евгений Сергеевич ; Ким, Глеб Ирламович. / Универсальная модель архитектуры краудсорсинговой системы разметки и подготовки медицинских данных. In: Научно-технический вестник СПбГУ ИТМО. 2025 ; Vol. 25, No. 5. pp. 844-855.

BibTeX

@article{3599eedd037c40019db016e55be8af73,
title = "Универсальная модель архитектуры краудсорсинговой системы разметки и подготовки медицинских данных",
abstract = "Machine Learning (ML) and Artificial Intelligence (AI) methods are used to process and intelligently analyze medical data. The application of ML/AI methods requires specialized sets of labeled medical data of large dimensions. Process organization of quality medical data labeling requires the involvement of a large number assessors and specialists in a particular field of medicine as well as the availability of specialized tools for labeling process optimization considering the specifics of medical data processing. In this paper a universal architectural model of a crowdsourcing system specifically designed for medical data labeling was proposed. The model supports processing of diverse medical data formats, incorporates data anonymization mechanisms and multi-level quality control, while enabling a distributed annotation process with expert community involvement. As a result, classification of actual problems of the process of medical data labeling and data collection, and a quality and safety criteria for comparative analysis of medical data labeling systems was detected and formulated. The scheme of generalized scenario of users{\textquoteright} groups interaction with crowdsourcing system in the context of solving AI problems in the field of medicine was proposed. A universal model of such system architecture was designed and a specialized crowdsourcing system of medical data labeling based on Computer Vision Annotation Tool was implemented on its basis. Testing and approbation of the realized system was carried out at the Pirogov Clinic of High Medical Technologies. The proposed universal model of crowdsourcing system architecture can be used to improve the efficiency and safety of organization and construction of the process of labeling patients{\textquoteright} medical data in the context of solving various applied ML/AI tasks, such as semantic segmentation of internal organs and their pathologies, detection and classification of diseases based on medical images (e.g. computed tomography scans). The developed solution can be used by doctors of various specializations, researchers and developers aimed at the development and creation of methods and technologies of AI in the field of medicine.",
keywords = "crowdsourcing, medical data annotation, medical data processing, quality criteria for crowdsourcing systems, software architecture model, use case",
author = "Коваленко, {Лев Алексеевич} and Блеканов, {Иван Станиславович} and Ежов, {Федор Валерьевич} and Ларин, {Евгений Сергеевич} and Ким, {Глеб Ирламович}",
year = "2025",
month = oct,
day = "27",
doi = "10.17586/2226-1494-2025-25-5-844-855",
language = "русский",
volume = "25",
pages = "844--855",
journal = "Scientific and Technical Journal of Information Technologies, Mechanics and Optics",
issn = "2226-1494",
publisher = "НИУ ИТМО",
number = "5",

}

RIS

TY - JOUR

T1 - Универсальная модель архитектуры краудсорсинговой системы разметки и подготовки медицинских данных

AU - Коваленко, Лев Алексеевич

AU - Блеканов, Иван Станиславович

AU - Ежов, Федор Валерьевич

AU - Ларин, Евгений Сергеевич

AU - Ким, Глеб Ирламович

PY - 2025/10/27

Y1 - 2025/10/27

N2 - Machine Learning (ML) and Artificial Intelligence (AI) methods are used to process and intelligently analyze medical data. The application of ML/AI methods requires specialized sets of labeled medical data of large dimensions. Process organization of quality medical data labeling requires the involvement of a large number assessors and specialists in a particular field of medicine as well as the availability of specialized tools for labeling process optimization considering the specifics of medical data processing. In this paper a universal architectural model of a crowdsourcing system specifically designed for medical data labeling was proposed. The model supports processing of diverse medical data formats, incorporates data anonymization mechanisms and multi-level quality control, while enabling a distributed annotation process with expert community involvement. As a result, classification of actual problems of the process of medical data labeling and data collection, and a quality and safety criteria for comparative analysis of medical data labeling systems was detected and formulated. The scheme of generalized scenario of users’ groups interaction with crowdsourcing system in the context of solving AI problems in the field of medicine was proposed. A universal model of such system architecture was designed and a specialized crowdsourcing system of medical data labeling based on Computer Vision Annotation Tool was implemented on its basis. Testing and approbation of the realized system was carried out at the Pirogov Clinic of High Medical Technologies. The proposed universal model of crowdsourcing system architecture can be used to improve the efficiency and safety of organization and construction of the process of labeling patients’ medical data in the context of solving various applied ML/AI tasks, such as semantic segmentation of internal organs and their pathologies, detection and classification of diseases based on medical images (e.g. computed tomography scans). The developed solution can be used by doctors of various specializations, researchers and developers aimed at the development and creation of methods and technologies of AI in the field of medicine.

AB - Machine Learning (ML) and Artificial Intelligence (AI) methods are used to process and intelligently analyze medical data. The application of ML/AI methods requires specialized sets of labeled medical data of large dimensions. Process organization of quality medical data labeling requires the involvement of a large number assessors and specialists in a particular field of medicine as well as the availability of specialized tools for labeling process optimization considering the specifics of medical data processing. In this paper a universal architectural model of a crowdsourcing system specifically designed for medical data labeling was proposed. The model supports processing of diverse medical data formats, incorporates data anonymization mechanisms and multi-level quality control, while enabling a distributed annotation process with expert community involvement. As a result, classification of actual problems of the process of medical data labeling and data collection, and a quality and safety criteria for comparative analysis of medical data labeling systems was detected and formulated. The scheme of generalized scenario of users’ groups interaction with crowdsourcing system in the context of solving AI problems in the field of medicine was proposed. A universal model of such system architecture was designed and a specialized crowdsourcing system of medical data labeling based on Computer Vision Annotation Tool was implemented on its basis. Testing and approbation of the realized system was carried out at the Pirogov Clinic of High Medical Technologies. The proposed universal model of crowdsourcing system architecture can be used to improve the efficiency and safety of organization and construction of the process of labeling patients’ medical data in the context of solving various applied ML/AI tasks, such as semantic segmentation of internal organs and their pathologies, detection and classification of diseases based on medical images (e.g. computed tomography scans). The developed solution can be used by doctors of various specializations, researchers and developers aimed at the development and creation of methods and technologies of AI in the field of medicine.

KW - crowdsourcing

KW - medical data annotation

KW - medical data processing

KW - quality criteria for crowdsourcing systems

KW - software architecture model

KW - use case

UR - https://www.mendeley.com/catalogue/f2c2bac8-e6e8-3a52-8c96-304d42f9f6eb/

U2 - 10.17586/2226-1494-2025-25-5-844-855

DO - 10.17586/2226-1494-2025-25-5-844-855

M3 - статья

VL - 25

SP - 844

EP - 855

JO - Scientific and Technical Journal of Information Technologies, Mechanics and Optics

JF - Scientific and Technical Journal of Information Technologies, Mechanics and Optics

SN - 2226-1494

IS - 5

ER -

ID: 143039188