The article deals with a special case of the preparation of data about the vehicles movements which comes in large volumes from the source to the accelerated applied methods of data mining. Data preparation goes through several stages from selecting the necessary fields and records to saving them with modified values into a new data structure. The source data which consist of 18 fields has a share of incorrect information and formats of numerical information that are not suitable for further processing. The source data is large in volume and processing it in the original form takes a very long time. The article shows how to use the pthreads library to organize multi-threaded processing of this data. To confirm the applicability of this library, the article presents the results of numerical experiments.
Original languageEnglish
Title of host publicationComputational Science and Its Applications – ICCSA 2017
Subtitle of host publication17th International Conference, Trieste, Italy, July 3-6, 2017, Proceedings, Part V
PublisherSpringer Nature
Pages463-472
ISBN (Electronic)978-3-319-62404-4
ISBN (Print)978-3-319-62403-7
DOIs
StatePublished - 2017
Event17th International Conference on Computational Science and Its Applications, ICCSA 2017 - Trieste, Italy
Duration: 2 Jul 20175 Jul 2017
Conference number: 17

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Nature
Volume10408
ISSN (Print)0302-9743

Conference

Conference17th International Conference on Computational Science and Its Applications, ICCSA 2017
Abbreviated titleICCSA 2017
Country/TerritoryItaly
CityTrieste
Period2/07/175/07/17

    Research areas

  • Data mining Data cleaning Data transformation PHP Pthreads

ID: 71328049