Standard

Extending the applicabilityof the Zipf’s laws to the sequences of byte data. / Сергеев, Сергей Львович; Блеканов, Иван Станиславович; Ежов, Федор Валерьевич; Тарасов, Никита Андреевич.

в: Вестник Санкт-Петербургского университета. Прикладная математика. Информатика. Процессы управления, Том 20, № 3, 2024, стр. 391–403.

Результаты исследований: Научные публикации в периодических изданияхстатьяРецензирование

Harvard

Сергеев, СЛ, Блеканов, ИС, Ежов, ФВ & Тарасов, НА 2024, 'Extending the applicabilityof the Zipf’s laws to the sequences of byte data', Вестник Санкт-Петербургского университета. Прикладная математика. Информатика. Процессы управления, Том. 20, № 3, стр. 391–403. https://doi.org/10.21638/spbu10.2024.307

APA

Сергеев, С. Л., Блеканов, И. С., Ежов, Ф. В., & Тарасов, Н. А. (2024). Extending the applicabilityof the Zipf’s laws to the sequences of byte data. Вестник Санкт-Петербургского университета. Прикладная математика. Информатика. Процессы управления, 20(3), 391–403. https://doi.org/10.21638/spbu10.2024.307

Vancouver

Сергеев СЛ, Блеканов ИС, Ежов ФВ, Тарасов НА. Extending the applicabilityof the Zipf’s laws to the sequences of byte data. Вестник Санкт-Петербургского университета. Прикладная математика. Информатика. Процессы управления. 2024;20(3):391–403. https://doi.org/10.21638/spbu10.2024.307

Author

Сергеев, Сергей Львович ; Блеканов, Иван Станиславович ; Ежов, Федор Валерьевич ; Тарасов, Никита Андреевич. / Extending the applicabilityof the Zipf’s laws to the sequences of byte data. в: Вестник Санкт-Петербургского университета. Прикладная математика. Информатика. Процессы управления. 2024 ; Том 20, № 3. стр. 391–403.

BibTeX

@article{955a063a6a924d2c84156b423f42b732,
title = "Extending the applicabilityof the Zipf{\textquoteright}s laws to the sequences of byte data",
abstract = "Zipf{\textquoteright}s law have been shown to hold true in many places. From it{\textquoteright}s first idea of a statistical phenomenon related to natural language to it{\textquoteright}s later adaptations for economical, social and many other fields, it has been shown to work almost universally. In all of these cases authors discuss the applicability of the Zipf{\textquoteright}s law in terms of semantically complex structures. We take this notion a step further and show how this law can work for data analysis, in particular for the sequences of byte data, obtained from various sources. We show that, using the basic chunking methodology, the Zipf{\textquoteright}s law can be shown to hold true for many different types of raw sequences of byte data. In particular, the law holds true in all caes for the “middle point” of data, where it is present with a degree of certainty of more than 90 %. We conclude by discussing the implications and potential use cases of these findings.",
keywords = "Zipf{\textquoteright}s laws, byte data, chunking, frequency analysis",
author = "Сергеев, {Сергей Львович} and Блеканов, {Иван Станиславович} and Ежов, {Федор Валерьевич} and Тарасов, {Никита Андреевич}",
year = "2024",
doi = "10.21638/spbu10.2024.307",
language = "English",
volume = "20",
pages = "391–403",
journal = " ВЕСТНИК САНКТ-ПЕТЕРБУРГСКОГО УНИВЕРСИТЕТА. ПРИКЛАДНАЯ МАТЕМАТИКА. ИНФОРМАТИКА. ПРОЦЕССЫ УПРАВЛЕНИЯ",
issn = "1811-9905",
publisher = "Издательство Санкт-Петербургского университета",
number = "3",

}

RIS

TY - JOUR

T1 - Extending the applicabilityof the Zipf’s laws to the sequences of byte data

AU - Сергеев, Сергей Львович

AU - Блеканов, Иван Станиславович

AU - Ежов, Федор Валерьевич

AU - Тарасов, Никита Андреевич

PY - 2024

Y1 - 2024

N2 - Zipf’s law have been shown to hold true in many places. From it’s first idea of a statistical phenomenon related to natural language to it’s later adaptations for economical, social and many other fields, it has been shown to work almost universally. In all of these cases authors discuss the applicability of the Zipf’s law in terms of semantically complex structures. We take this notion a step further and show how this law can work for data analysis, in particular for the sequences of byte data, obtained from various sources. We show that, using the basic chunking methodology, the Zipf’s law can be shown to hold true for many different types of raw sequences of byte data. In particular, the law holds true in all caes for the “middle point” of data, where it is present with a degree of certainty of more than 90 %. We conclude by discussing the implications and potential use cases of these findings.

AB - Zipf’s law have been shown to hold true in many places. From it’s first idea of a statistical phenomenon related to natural language to it’s later adaptations for economical, social and many other fields, it has been shown to work almost universally. In all of these cases authors discuss the applicability of the Zipf’s law in terms of semantically complex structures. We take this notion a step further and show how this law can work for data analysis, in particular for the sequences of byte data, obtained from various sources. We show that, using the basic chunking methodology, the Zipf’s law can be shown to hold true for many different types of raw sequences of byte data. In particular, the law holds true in all caes for the “middle point” of data, where it is present with a degree of certainty of more than 90 %. We conclude by discussing the implications and potential use cases of these findings.

KW - Zipf’s laws

KW - byte data

KW - chunking

KW - frequency analysis

UR - https://applmathjournal.spbu.ru/article/view/19038

UR - https://www.mendeley.com/catalogue/cf07abe2-23c0-3640-a2cc-36612fab6694/

U2 - 10.21638/spbu10.2024.307

DO - 10.21638/spbu10.2024.307

M3 - Article

VL - 20

SP - 391

EP - 403

JO - ВЕСТНИК САНКТ-ПЕТЕРБУРГСКОГО УНИВЕРСИТЕТА. ПРИКЛАДНАЯ МАТЕМАТИКА. ИНФОРМАТИКА. ПРОЦЕССЫ УПРАВЛЕНИЯ

JF - ВЕСТНИК САНКТ-ПЕТЕРБУРГСКОГО УНИВЕРСИТЕТА. ПРИКЛАДНАЯ МАТЕМАТИКА. ИНФОРМАТИКА. ПРОЦЕССЫ УПРАВЛЕНИЯ

SN - 1811-9905

IS - 3

ER -

ID: 126974789