The rapid technological development has created new opportunities for language digitalization and the development of language technology applications. The core element of language technology is language resources, which is in a broad sense, can be considered as a scope of the databases that consists of the myriad of texts both in oral and written forms and used in the machine-learning algorithm. The creation of language resources requires two processes: the first one is language digitalization, meaning the transformation of the speech and texts into the machine-responsible form. The second process refers to text mining, which analyzes data by using a machine-learning algorithm. Adoption of the General Data Protection Regulation (GDPR) and Directive on copyright and related rights in the Digital Single Market (DSM Directive) has been building a renewed legal framework that addresses the demands of the digital economies and unseals challenges, opens prospects for further development. We examine the language resources from two perspectives. Firstly, the language resources are considered a database covered by the protection regulation (the person’s rights who created the LR database). Within the second perspective, the legal analysis focuses on the materials used for the language resource creation (data subject’s rights, copyright, related rights). The result of the research can be used for further legal investigations and policy design in the field of language technology development.
Original languageEnglish
Title of host publicationLanguage and Technology Conference LTC 2019
Subtitle of host publicationHuman Language Technology. Challenges for Computer Science and Linguistics
PublisherSpringer Nature
Pages367-376
Number of pages10
ISBN (Print)9783031053276
DOIs
StatePublished - 2022
Event9th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics - Познань, Poland
Duration: 17 May 201919 May 2019
http://ltc.amu.edu.pl/a2019/

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13212 LNAI

Conference

Conference9th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics
Country/TerritoryPoland
CityПознань
Period17/05/1919/05/19
Internet address

    Research areas

  • Copyright, Intellectual property protection, Language technology, Text mining

ID: 111202967