Standard

Progress in Natural Language Processing Technologies: Regulating Quality and Accessibility of Training Data. / Ильин, Илья Геннадьевич.

In: Legal Issues in the Digital Age, Vol. 5, No. 2, 20.07.2024, p. 36-56.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{b7b980833adb45bca5d0fd7c2da13e46,
title = "Progress in Natural Language Processing Technologies: Regulating Quality and Accessibility of Training Data",
abstract = "Progress in natural language processing technologies (NLP) is a cardinal factor of major socioeconomic importance behind innovative digital products. However, inadequate legal regulation of quality and accessibility of training data is a major obstacle to this technological development. The paper is focused on regulatory issues affecting the quality and accessibility of data needed for language model training. In analyzing the normative barriers and proposing ways to remove them, the author of the paper argues for the need to develop a comprehensive regulatory system designed to ensure sustainable development of the technology.",
author = "Ильин, {Илья Геннадьевич}",
year = "2024",
month = jul,
day = "20",
doi = "10.17323/2713-2749.2024.2.36.56",
language = "English",
volume = "5",
pages = "36--56",
journal = "Legal Issues in the Digital Age",
issn = "2713-2749",
publisher = "Издательский дом НИУ ВШЭ",
number = "2",

}

RIS

TY - JOUR

T1 - Progress in Natural Language Processing Technologies: Regulating Quality and Accessibility of Training Data

AU - Ильин, Илья Геннадьевич

PY - 2024/7/20

Y1 - 2024/7/20

N2 - Progress in natural language processing technologies (NLP) is a cardinal factor of major socioeconomic importance behind innovative digital products. However, inadequate legal regulation of quality and accessibility of training data is a major obstacle to this technological development. The paper is focused on regulatory issues affecting the quality and accessibility of data needed for language model training. In analyzing the normative barriers and proposing ways to remove them, the author of the paper argues for the need to develop a comprehensive regulatory system designed to ensure sustainable development of the technology.

AB - Progress in natural language processing technologies (NLP) is a cardinal factor of major socioeconomic importance behind innovative digital products. However, inadequate legal regulation of quality and accessibility of training data is a major obstacle to this technological development. The paper is focused on regulatory issues affecting the quality and accessibility of data needed for language model training. In analyzing the normative barriers and proposing ways to remove them, the author of the paper argues for the need to develop a comprehensive regulatory system designed to ensure sustainable development of the technology.

UR - https://www.mendeley.com/catalogue/54ea1de9-4a50-3091-82d2-8d1a07b4d8b5/

U2 - 10.17323/2713-2749.2024.2.36.56

DO - 10.17323/2713-2749.2024.2.36.56

M3 - Article

VL - 5

SP - 36

EP - 56

JO - Legal Issues in the Digital Age

JF - Legal Issues in the Digital Age

SN - 2713-2749

IS - 2

ER -

ID: 122054853