The value of multiple data sources in machine learning models for power system event prediction
Hoffmann, Volker; Klemets, Jonatan Ralf Axel; Torsæter, Bendik Nybakk; Rosenlund, Gjert Hovland; Andresen, Christian Andre
Chapter, Peer reviewed
Accepted version
View/ Open
Date
2021Metadata
Show full item recordCollections
- Publikasjoner fra CRIStin - SINTEF Energi [1721]
- SINTEF Energi [1857]
Original version
2021 International Conference on Smart Energy Systems and Technologies - SESTAbstract
We describe a method for assessing the value of additional data sources used in the prediction of unwanted events (voltage dips, earth faults) in the power system. Using this method, machine learning models for event prediction using (combinations of) different data sources are developed. The value of each data source is the improvement in model performance it brings. In addition, feature importance is retrieved using SHapley Additive exPlanations (SHAP). The methodology is applied to models that predict faults based on power quality and weather data. We find that models that combine sources outperform models using either in isolation. They predict ground faults and voltage dips with AUCs (Area Under Curve) of 0.74 and 0.80, respectively. Meteorological data appears more valuable than power quality data and the most important features are dew point, month of the year, and the power spectral density at 4.7 Hz