Extracting useful and relevant information from the massive amount of data by web users becomes a challenging task. This work regarding applications based on the use of Web Usage Mining (WUM). Clickstream, transaction data, and user profile data represent various sources of web usage data. Preprocessing is an essential process in WUM for understanding the user's life as a whole within a session on a website. It signifies the aggregation of consequent series of page views executed by a singular user traversing through a website. Therefore, it represents a critical task that involves converting raw web log data to obtain a suitable pattern for efficient analysis. The importance of these processes relies on the complex nature of web architecture, and it takes almost 80% of the mining process. This work introduces a complete framework for data preprocessing with a comprehensive demonstration of a real dataset.

Original languageEnglish
Title of host publication2020 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728169514
DOIs
StatePublished - 6 Oct 2020
Event2020 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2020 - Vladivostok, Russian Federation
Duration: 6 Oct 20209 Oct 2020

Publication series

Name2020 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2020

Conference

Conference2020 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2020
Country/TerritoryRussian Federation
CityVladivostok
Period6/10/209/10/20

    Scopus subject areas

  • Control and Optimization
  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems and Management
  • Energy Engineering and Power Technology
  • Renewable Energy, Sustainability and the Environment
  • Electrical and Electronic Engineering
  • Industrial and Manufacturing Engineering

    Research areas

  • clickstream analysis, data preprocessing, link prediction, server log, web mining, web usage mining

ID: 72796541