Text segmentation on photorealistic images

Standard

Text segmentation on photorealistic images. / Grishkin, Valery ; Ebral, Alexander ; Stepenko, Nikolai ; Sene, Jean.

в: CEUR Workshop Proceedings, Том 2267, 01.01.2018, стр. 369-373.

Результаты исследований: Научные публикации в периодических изданиях › статья в журнале по материалам конференции › Рецензирование

BibTeX

@article{e27f2e9ca8c341a0a9a7c4e171d7decb,

title = "Text segmentation on photorealistic images",

abstract = "The paper proposes an algorithm for segmentation of text, applied or presented in photorealistic images, characterized by a complex background. The algorithm is able to determine the exact location of image regions containing text. It implements the method for semantic segmentation of images, while the text symbols serve as detectable objects. The original images are pre-processed and fed to the input of the pre-trained convolutional neural network. The paper proposes a network architecture for text segmentation, describes the procedure for the formation of the training set, and considers the algorithm for pre-processing images, reducing the amount of processed data and simplifying the segmentation of the object {"}background{"}. The network architecture is a modification of well-known ResNet network and takes into account the specifics of text character images. The convolutional neural network is implemented using CUDA parallel computing technology at the GPU. The experimental results for evaluating quality of text segmentation with IoU (Intersection over Union) criterion have proved effectiveness of the proposed method.",

keywords = "Convolution neural network, Semantic segmentation, Text segmentation",

author = "Valery Grishkin and Alexander Ebral and Nikolai Stepenko and Jean Sene",

year = "2018",

month = jan,

day = "1",

language = "English",

volume = "2267",

pages = "369--373",

journal = "CEUR Workshop Proceedings",

issn = "1613-0073",

publisher = "RWTH Aahen University",

note = "8th International Conference {"}Distributed Computing and Grid-Technologies in Science and Education{"}, GRID 2018 ; Conference date: 10-09-2018 Through 14-09-2018",

}

RIS

TY - JOUR

T1 - Text segmentation on photorealistic images

AU - Grishkin, Valery

AU - Ebral, Alexander

AU - Stepenko, Nikolai

AU - Sene, Jean

PY - 2018/1/1

Y1 - 2018/1/1

N2 - The paper proposes an algorithm for segmentation of text, applied or presented in photorealistic images, characterized by a complex background. The algorithm is able to determine the exact location of image regions containing text. It implements the method for semantic segmentation of images, while the text symbols serve as detectable objects. The original images are pre-processed and fed to the input of the pre-trained convolutional neural network. The paper proposes a network architecture for text segmentation, describes the procedure for the formation of the training set, and considers the algorithm for pre-processing images, reducing the amount of processed data and simplifying the segmentation of the object "background". The network architecture is a modification of well-known ResNet network and takes into account the specifics of text character images. The convolutional neural network is implemented using CUDA parallel computing technology at the GPU. The experimental results for evaluating quality of text segmentation with IoU (Intersection over Union) criterion have proved effectiveness of the proposed method.

AB - The paper proposes an algorithm for segmentation of text, applied or presented in photorealistic images, characterized by a complex background. The algorithm is able to determine the exact location of image regions containing text. It implements the method for semantic segmentation of images, while the text symbols serve as detectable objects. The original images are pre-processed and fed to the input of the pre-trained convolutional neural network. The paper proposes a network architecture for text segmentation, describes the procedure for the formation of the training set, and considers the algorithm for pre-processing images, reducing the amount of processed data and simplifying the segmentation of the object "background". The network architecture is a modification of well-known ResNet network and takes into account the specifics of text character images. The convolutional neural network is implemented using CUDA parallel computing technology at the GPU. The experimental results for evaluating quality of text segmentation with IoU (Intersection over Union) criterion have proved effectiveness of the proposed method.

KW - Convolution neural network

KW - Semantic segmentation

KW - Text segmentation

UR - http://www.scopus.com/inward/record.url?scp=85060102982&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85060102982

VL - 2267

SP - 369

EP - 373

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

T2 - 8th International Conference "Distributed Computing and Grid-Technologies in Science and Education", GRID 2018

Y2 - 10 September 2018 through 14 September 2018

ER -

ID: 47803632

Standard

Harvard

APA

Vancouver

Author

BibTeX

RIS