TY - JOUR
T1 - A Self-Supervised Machine Learning Approach for the Estimation of Open-Circuit Voltage Degradation in Photovoltaic Systems
AU - Riaño, Sandra
AU - Santos, Jose Domingo
AU - Esteras, Miguel
AU - Abanda, Amaia
AU - Del Ser, Javier
N1 - Publisher Copyright:
© 2026 Tecnalia Research and Innovation. Progress in Photovoltaics: Research and Applications published by John Wiley & Sons Ltd.
PY - 2026
Y1 - 2026
N2 - In this work, we present a machine learning (ML) approach that integrates physics-based knowledge with data-driven techniques to estimate the open-circuit voltage ((Formula presented.)) of photovoltaic (PV) systems. This self-supervised approach allows the detection of anomalies in operating systems, enabling the identification of potential faults and degradation related to (Formula presented.) without requiring labelled data. Deviations in (Formula presented.) values are detected by analysing the measurements recorded in the supervisory control and data acquisition (SCADA) system. This analysis is performed by a combination of clustering and regression algorithms. The proposed approach is validated on three different PV installations, showcasing its ability to detect variations in open-circuit voltage and to predict performance degradation. The proposed method achieves an (Formula presented.) -squared (r2) value larger than 0.9 when trained on experimental (Formula presented.) data from three distinct PV systems. Moreover, it estimates (Formula presented.) from SCADA data with an average error below 5% compared with (Formula presented.) – (Formula presented.) curve measurements. The results of this study demonstrate that physics informed ML techniques can significantly enhance the performance and reliability of PV systems, enabling early fault detection and degradation forecasting.
AB - In this work, we present a machine learning (ML) approach that integrates physics-based knowledge with data-driven techniques to estimate the open-circuit voltage ((Formula presented.)) of photovoltaic (PV) systems. This self-supervised approach allows the detection of anomalies in operating systems, enabling the identification of potential faults and degradation related to (Formula presented.) without requiring labelled data. Deviations in (Formula presented.) values are detected by analysing the measurements recorded in the supervisory control and data acquisition (SCADA) system. This analysis is performed by a combination of clustering and regression algorithms. The proposed approach is validated on three different PV installations, showcasing its ability to detect variations in open-circuit voltage and to predict performance degradation. The proposed method achieves an (Formula presented.) -squared (r2) value larger than 0.9 when trained on experimental (Formula presented.) data from three distinct PV systems. Moreover, it estimates (Formula presented.) from SCADA data with an average error below 5% compared with (Formula presented.) – (Formula presented.) curve measurements. The results of this study demonstrate that physics informed ML techniques can significantly enhance the performance and reliability of PV systems, enabling early fault detection and degradation forecasting.
KW - failure detection diagnosis
KW - machine learning
KW - open-circuit voltage estimation
KW - operation and maintenance
KW - performance degradation
KW - photovoltaic systems
UR - https://www.scopus.com/pages/publications/105033289601
U2 - 10.1002/pip.70088
DO - 10.1002/pip.70088
M3 - Article
AN - SCOPUS:105033289601
SN - 1062-7995
JO - Progress in Photovoltaics: Research and Applications
JF - Progress in Photovoltaics: Research and Applications
ER -