Abstract
Data has become an asset for companies, originating from various sources, such as IoT paradigms. It is crucial to safeguard its life cycle using suitable, scalable, and effective technologies, like those enabled by cloud computing models. However, in order to extract value from this data, complementary processes of collection, refinement, cleaning, or modeling, among many others, are required. Furthermore, organizations greatly vary in their methodologies and approaches to handling data, which further emphasizes the need for standardized techniques. In this regard, data management methodologies promote the adoption of the various dimensions of data quality in order to ensure the reliability of data across different systems and processes. The main contribution of this manuscript is the proposal of a new data quality dimension, coined purity, to measure the importance of the data in a processing pipeline topology. As a result, organizations can better guarantee the quality of their datasets in order to raise the success of data-driven endeavors within organizations. The proposed methodology is validated in an urban mobility use case.
| Original language | English |
|---|---|
| Title of host publication | ICCBDC 2024 - 2024 8th International Conference on Cloud and Big Data Computing |
| Publisher | Association for Computing Machinery |
| Pages | 8-14 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798400717253 |
| DOIs | |
| Publication status | Published - 8 Nov 2024 |
| Event | 8th International Conference on Cloud and Big Data Computing, ICCBDC 2024 - Oxford, United Kingdom Duration: 15 Aug 2024 → 17 Aug 2024 |
Publication series
| Name | ACM International Conference Proceeding Series |
|---|
Conference
| Conference | 8th International Conference on Cloud and Big Data Computing, ICCBDC 2024 |
|---|---|
| Country/Territory | United Kingdom |
| City | Oxford |
| Period | 15/08/24 → 17/08/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
Keywords
- Big Data
- Centrality
- Computing Continuum
- Data Quality
- DataOps
Fingerprint
Dive into the research topics of 'Purity: a New Dimension for Measuring Data Centralization Quality'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver