It is widely accepted that nowadays a significant part of the content on the internet is generated by users of social media platforms which form the basis of Web 2.0. That is why modern media researchers use user-generated content to test their scientific hypotheses using automated data analysis methods and data mining tools. In this study we examine the main approaches to user opinion data crawling in modern social media platforms for subsequent analysis for scientific and research purposes. We propose a data collection approach based on reverse engineering of APK applications, which allows for data extraction from social networks that will not differ in completeness from data from mobile applications. A comparative analysis of the proposed methods in terms of completeness and execution speed is also carried out. According to our findings, implementing a custom REST API is the best approach as it is both reliable and computationally efficient.
Original languageEnglish
Title of host publicationSocial Computing and Social Media. HCII 2024.
PublisherSpringer Nature
Pages72-81
Number of pages10
ISBN (Electronic)978-3-031-61281-7
ISBN (Print)978-3-031-61280-0
DOIs
StatePublished - 2024
EventSocial Computing and Social Media: 16th International Conference: Held as Part of the 26th HCI International Conference - Washington, United States
Duration: 29 Jun 20244 Jul 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14703 LNCS

Conference

ConferenceSocial Computing and Social Media: 16th International Conference
Abbreviated titleSCSM 2024
Country/TerritoryUnited States
CityWashington
Period29/06/244/07/24

    Research areas

  • API, Data Collection, Data Crawling, SSL Pinning, User Opinion, Web Crawler, gRPC

ID: 125271477