DOI

The rapid technological development has created new opportunities for language digitalization and the development of language technology applications. The core element of language technology is language resources, which is in a broad sense, can be considered as a scope of the databases that consists of the myriad of texts both in oral and written forms and used in the machine-learning algorithm. The creation of language resources requires two processes: the first one is language digitalization, meaning the transformation of the speech and texts into the machine-responsible form. The second process refers to text mining, which analyzes data by using a machine-learning algorithm. Adoption of the General Data Protection Regulation (GDPR) and Directive on copyright and related rights in the Digital Single Market (DSM Directive) has been building a renewed legal framework that addresses the demands of the digital economies and unseals challenges, opens prospects for further development. We examine the language resources from two perspectives. Firstly, the language resources are considered a database covered by the protection regulation (the person’s rights who created the LR database). Within the second perspective, the legal analysis focuses on the materials used for the language resource creation (data subject’s rights, copyright, related rights). The result of the research can be used for further legal investigations and policy design in the field of language technology development.
Язык оригиналаанглийский
Название основной публикацииLanguage and Technology Conference LTC 2019
Подзаголовок основной публикацииHuman Language Technology. Challenges for Computer Science and Linguistics
ИздательSpringer Nature
Страницы367-376
Число страниц10
ISBN (печатное издание)9783031053276
DOI
СостояниеОпубликовано - 2022
Событие9th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics - Познань, Польша
Продолжительность: 17 мая 201919 мая 2019
http://ltc.amu.edu.pl/a2019/

Серия публикаций

НазваниеLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том13212 LNAI

конференция

конференция9th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics
Страна/TерриторияПольша
ГородПознань
Период17/05/1919/05/19
Сайт в сети Internet

ID: 111202967