Purity: a New Dimension for Measuring Data Centralization Quality

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data has become an asset for companies, originating from various sources, such as IoT paradigms. It is crucial to safeguard its life cycle using suitable, scalable, and effective technologies, like those enabled by cloud computing models. However, in order to extract value from this data, complementary processes of collection, refinement, cleaning, or modeling, among many others, are required. Furthermore, organizations greatly vary in their methodologies and approaches to handling data, which further emphasizes the need for standardized techniques. In this regard, data management methodologies promote the adoption of the various dimensions of data quality in order to ensure the reliability of data across different systems and processes. The main contribution of this manuscript is the proposal of a new data quality dimension, coined purity, to measure the importance of the data in a processing pipeline topology. As a result, organizations can better guarantee the quality of their datasets in order to raise the success of data-driven endeavors within organizations. The proposed methodology is validated in an urban mobility use case.

Original languageEnglish
Title of host publicationICCBDC 2024 - 2024 8th International Conference on Cloud and Big Data Computing
PublisherAssociation for Computing Machinery
Pages8-14
Number of pages7
ISBN (Electronic)9798400717253
DOIs
Publication statusPublished - 8 Nov 2024
Event8th International Conference on Cloud and Big Data Computing, ICCBDC 2024 - Oxford, United Kingdom
Duration: 15 Aug 202417 Aug 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference8th International Conference on Cloud and Big Data Computing, ICCBDC 2024
Country/TerritoryUnited Kingdom
CityOxford
Period15/08/2417/08/24

Keywords

  • Big Data
  • Centrality
  • Computing Continuum
  • Data Quality
  • DataOps

Fingerprint

Dive into the research topics of 'Purity: a New Dimension for Measuring Data Centralization Quality'. Together they form a unique fingerprint.

Cite this