Abstract
Data stream mining extracts information from large quantities of data flowing fast and continuously (data streams). They are usually affected by changes in the data distribution, giving rise to a phenomenon referred to as concept drift. Thus, learning models must detect and adapt to such changes, so as to exhibit a good predictive performance after a drift has occurred. In this regard, the development of effective drift detection algorithms becomes a key factor in data stream mining. In this work we propose CURIECURIE, a drift detector relying on cellular automata. Specifically, in CURIECURIE the distribution of the data stream is represented in the grid of a cellular automata, whose neighborhood rule can then be utilized to detect possible distribution changes over the stream. Computer simulations are presented and discussed to show that CURIECURIE, when hybridized with other base learners, renders a competitive behavior in terms of detection metrics and classification accuracy. CURIECURIE is compared with well-established drift detectors over synthetic datasets with varying drift characteristics.
Original language | English |
---|---|
Pages (from-to) | 2655-2678 |
Number of pages | 24 |
Journal | Data Mining and Knowledge Discovery |
Volume | 35 |
Issue number | 6 |
DOIs | |
Publication status | Published - Nov 2021 |
Keywords
- Concept drift
- Drift detection
- Data stream mining
- Cellular automata
Project and Funding Information
- Project ID
- info:eu-repo/grantAgreement/EC/H2020/783163/EU/Integrated Development 4.0/iDev40
- Funding Info
- This work has received funding support from the ECSEL Joint Undertaking (JU) under grant agreement No 783163 (iDev40 project). The JU receives support from the European Union’s Horizon 2020 research and innovation programme, national grants from Austria, Belgium, Germany, Italy, Spain and Romania, as well as the European Structural and Investment Funds. Authors would like to also thank the ELKARTEK and EMAITEK funding programmes of the Basque Government (Spain)
Fingerprint
Dive into the research topics of 'CURIE: a cellular automaton for concept drift detection: a cellular automaton for concept drift detection'. Together they form a unique fingerprint.Datasets
-
Synthetic datasets for concept drift detection purposes
Lopez Lobo, J. (Creator), Harvard Dataverse, 2020
DOI: 10.7910/DVN/5OWRGB, https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/5OWRGB
Dataset