K2E: Building MLOps Environments for Governing Data and Models Catalogues while Tracking Versions

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Nowadays, there are a variety of problems associated with the process of extracting value and information from data such as: Data heterogeneity, data distribution, model versioning, and the vast variety of techniques and approaches. Due to all this, the data management process becomes hard to implement in real world scenarios. In this context, the catalogue tools for data and Artificial Intelligence models alleviate the burden of dealing with versioning tasks. Thus, the automation of the data and models' management processes is facilitated, complying with DataOps and MLOps good practices. This work in progress enumerates key challenges to address when creating these types of catalogues: On the one hand, the management of the diversity of data and models' internal nature and their different versions, and on the other hand, the provision of adequate meta-information and Governance tools such as access control and auditing. In this paper, the Knowledge to Environment (K2E) platform is presented, whose architecture aims to define the necessary components for the creation of environments that allow working with data and model catalogues. By environment creation, we mean providing a workspace populated with the datasets and models of an organization, while tracking their distinct versions by using specialised catalogues. In addition, this workspace will incorporate added-value tools for governance and auditing. Finally, an approach for implementing K2E is detailed.

Original languageEnglish
Title of host publication2022 IEEE 19th International Conference on Software Architecture Companion, ICSA-C 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages206-209
Number of pages4
ISBN (Electronic)9781665494939
DOIs
Publication statusPublished - 2022
Event19th IEEE International Conference on Software Architecture Companion, ICSA-C 2022 - Honolulu, United States
Duration: 12 Mar 202215 Mar 2022

Publication series

Name2022 IEEE 19th International Conference on Software Architecture Companion, ICSA-C 2022

Conference

Conference19th IEEE International Conference on Software Architecture Companion, ICSA-C 2022
Country/TerritoryUnited States
CityHonolulu
Period12/03/2215/03/22

Funding

ACKNOWLEDGMENT The work presented in this paper has been partially supported by the SPRI Basque Government through their ELKA-RTEK program (DAEKIN project, ref.KK-2020/00035).

FundersFunder number
SPRI Basque Governmentref.KK-2020/00035

    Keywords

    • automation
    • catalogues
    • data
    • datalake
    • DataOps
    • dataset
    • management
    • metadata
    • MlOps
    • models
    • versioning

    Fingerprint

    Dive into the research topics of 'K2E: Building MLOps Environments for Governing Data and Models Catalogues while Tracking Versions'. Together they form a unique fingerprint.

    Cite this