TY - GEN
T1 - On the creation of diverse ensembles for nonstationary environments using bio-inspired heuristics
AU - Lobo, Jesus L.
AU - Del Ser, Javier
AU - Villar-Rodriguez, Esther
AU - Bilbao, Miren Nekane
AU - Salcedo-Sanz, Sancho
N1 - Publisher Copyright:
© Springer Nature Singapore Pte Ltd. 2017.
PY - 2017
Y1 - 2017
N2 - Recently the relevance of adaptive models for dynamic data environments has turned into a hot topic due to the vast number of scenarios generating nonstationary data streams. When a change (concept drift) in data distribution occurs, the ensembles of models trained over these data sources are obsolete and do not adapt suitably to the new distribution of the data. Although most of the research on the field is focused on the detection of this drift to re-train the ensemble, it is widely known the importance of the diversity in the ensemble shortly after the drift in order to reduce the initial drop in accuracy. In a Big Data scenario in which data can be huge (and also the number of past models), achieving the most diverse ensemble implies the calculus of all possible combinations of models, which is not an easy task to carry out quickly in the long term. This challenge can be formulated as an optimization problem, for which bio-inspired algorithms can play one of the key roles in these adaptive algorithms. Precisely this is the goal of this manuscript: to validate the relevance of the diversity right after drifts, and to unveil how to achieve a highly diverse ensemble by using a self-learning optimization technique.
AB - Recently the relevance of adaptive models for dynamic data environments has turned into a hot topic due to the vast number of scenarios generating nonstationary data streams. When a change (concept drift) in data distribution occurs, the ensembles of models trained over these data sources are obsolete and do not adapt suitably to the new distribution of the data. Although most of the research on the field is focused on the detection of this drift to re-train the ensemble, it is widely known the importance of the diversity in the ensemble shortly after the drift in order to reduce the initial drop in accuracy. In a Big Data scenario in which data can be huge (and also the number of past models), achieving the most diverse ensemble implies the calculus of all possible combinations of models, which is not an easy task to carry out quickly in the long term. This challenge can be formulated as an optimization problem, for which bio-inspired algorithms can play one of the key roles in these adaptive algorithms. Precisely this is the goal of this manuscript: to validate the relevance of the diversity right after drifts, and to unveil how to achieve a highly diverse ensemble by using a self-learning optimization technique.
KW - Bioinspired optimization
KW - Concept drift
KW - Diversity
UR - http://www.scopus.com/inward/record.url?scp=85012202120&partnerID=8YFLogxK
U2 - 10.1007/978-981-10-3728-3_8
DO - 10.1007/978-981-10-3728-3_8
M3 - Conference contribution
AN - SCOPUS:85012202120
SN - 9789811037276
T3 - Advances in Intelligent Systems and Computing
SP - 67
EP - 77
BT - Harmony Search Algorithm - Proceedings of the 3rd International Conference on Harmony Search Algorithm (ICHSA 2017)
A2 - Del Ser, Javier
PB - Springer Verlag
T2 - Proceedings of the 3rd International Conference on Harmony Search Algorithm, ICHSA 2017
Y2 - 22 February 2017 through 24 February 2017
ER -