Standard

Technology of cleaning and transforming data using the knowledge discovery in databases (KDD) technology for fast application of data mining methods. / Shichkina, Y. A.; Degtyarev, A. B.; Koblov, A. A.

In: CEUR Workshop Proceedings, Vol. 1787, 01.01.2016, p. 428-434.

Research output: Contribution to journalConference articlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{ffd150c0932241a1b20fb259281b767b,
title = "Technology of cleaning and transforming data using the knowledge discovery in databases (KDD) technology for fast application of data mining methods",
abstract = "For large data amounts today brings a nu mber of reasons. The increase in databases makes new and very serious hardware requirements of data centers and in recent years, it requires greater investment in hardware, software, the corresponding work and management. The decision of the main problems associated with the actual har dware infrastructure of data center, it must be said, it is much cheaper than the improvement of the software. But this is only a temporary solution. We must look for a global solution to the problem of large data. It is necessary to improve the methods of data processing with the help of the equipment that is available. This article discusses methods of cleaning and transforming data within the Knowledge Discovery in Databases technology for fast applying data mining techniques. In particular, the article shows how the metho d can significantly reduce the data selection for query building in noSQL databases on the example of MongoDB.",
keywords = "Big data, Data cleaning, Database, MongoDB, Query",
author = "Shichkina, {Y. A.} and Degtyarev, {A. B.} and Koblov, {A. A.}",
year = "2016",
month = jan,
day = "1",
language = "English",
volume = "1787",
pages = "428--434",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "RWTH Aahen University",
note = "7th International Conference Distributed Computing and Gridtechnologies in Science and Education, GRID 2016 ; Conference date: 04-07-2016 Through 09-07-2016",

}

RIS

TY - JOUR

T1 - Technology of cleaning and transforming data using the knowledge discovery in databases (KDD) technology for fast application of data mining methods

AU - Shichkina, Y. A.

AU - Degtyarev, A. B.

AU - Koblov, A. A.

PY - 2016/1/1

Y1 - 2016/1/1

N2 - For large data amounts today brings a nu mber of reasons. The increase in databases makes new and very serious hardware requirements of data centers and in recent years, it requires greater investment in hardware, software, the corresponding work and management. The decision of the main problems associated with the actual har dware infrastructure of data center, it must be said, it is much cheaper than the improvement of the software. But this is only a temporary solution. We must look for a global solution to the problem of large data. It is necessary to improve the methods of data processing with the help of the equipment that is available. This article discusses methods of cleaning and transforming data within the Knowledge Discovery in Databases technology for fast applying data mining techniques. In particular, the article shows how the metho d can significantly reduce the data selection for query building in noSQL databases on the example of MongoDB.

AB - For large data amounts today brings a nu mber of reasons. The increase in databases makes new and very serious hardware requirements of data centers and in recent years, it requires greater investment in hardware, software, the corresponding work and management. The decision of the main problems associated with the actual har dware infrastructure of data center, it must be said, it is much cheaper than the improvement of the software. But this is only a temporary solution. We must look for a global solution to the problem of large data. It is necessary to improve the methods of data processing with the help of the equipment that is available. This article discusses methods of cleaning and transforming data within the Knowledge Discovery in Databases technology for fast applying data mining techniques. In particular, the article shows how the metho d can significantly reduce the data selection for query building in noSQL databases on the example of MongoDB.

KW - Big data

KW - Data cleaning

KW - Database

KW - MongoDB

KW - Query

UR - http://www.scopus.com/inward/record.url?scp=85016167388&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85016167388

VL - 1787

SP - 428

EP - 434

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

T2 - 7th International Conference Distributed Computing and Gridtechnologies in Science and Education, GRID 2016

Y2 - 4 July 2016 through 9 July 2016

ER -

ID: 33811613