journal-final

African Maintenance Engineering

Predictive Systems & Asset Management | Submitted: Februaryt 2026 | Peer Review Edition

Predictive Maintenance of Power Transformer Using Machine Learning - A Case Study

Amaning Richmond Ofori

Department of Electrical and Electronic Engineering, University of Mines and Technology (UMaT), Tarkwa, Ghana

Supervised by: Dr. Joseph C. Attachie

Email: oforirichmond209@gmail.com

Article Information

Received: February 2026

Revised: March 2026

Accepted: March 20026

DOI: 10.5281/zenodo.19322309

Keywords

Predictive Maintenance; Power Transformer; Dissolved Gas Analysis; Support Vector Machine; K-Nearest Neighbour; Decision Tree; Fault Classification; Machine Learning; IEC TC10; DGA

ABSTRACT

Power transformers are critical assets in electrical power transmission and distribution infrastructure. Their unplanned failures result in substantial economic losses, service disruptions, and safety hazards. Traditional fault detection methods chiefly gas-ratio techniques applied to Dissolved Gas Analysis (DGA) data require complex diagnostic algorithms and specialist interpretation, limiting their scalability. This paper presents a comprehensive investigation into the application of three machine learning classifiers Decision Tree (DT), Support Vector Machine (SVM), and K-Nearest Neighbour (KNN) for predictive maintenance of power transformers using DGA data sourced from the IEC TC10 dataset. A total of 123 transformer oil samples, labelled across six fault classes (PD, D1, D2, T1, T2, T3) as defined by the IEC 60599 and IEEE C57.104 standards, were used to train and evaluate the models. Data pre-processing included feature scaling, class balancing via resampling, and an 80/20 train-test split. The models were evaluated using accuracy, precision, recall, F1-score, Receiver Operating Characteristic (ROC) curves, and Precision-Recall (PR) curves. SVM and KNN both achieved a testing accuracy of 95.65%, outperforming the Decision Tree at 89.13%. SVM demonstrated superior performance in terms of ROC-AUC for critical fault classes (T1/T2: 0.94), and is recommended as the preferred diagnostic model for field deployment. These results validate the viability of machine learning as a robust, scalable alternative to conventional DGA interpretation methods in the context of smart power grid maintenance.

1. Introduction

Power transformers are linchpin assets in electrical power transmission and distribution infrastructure. Operating across voltage levels from 33 kV to 400 kV, they facilitate the efficient long-distance transfer of electrical energy between generation facilities and load centres. In Ghana, the Electricity Company of Ghana (ECG) depends on a fleet of power transformers to maintain stable grid operations across domestic, commercial, and industrial sectors [ [1]]. A single transformer failure can cascade into widespread outages, disrupting critical services, halting industrial production, and imposing substantial financial penalties on utilities and end-users alike.

Transformer failures manifest through several physical mechanisms: dielectric breakdown, thermally induced insulation degradation, winding deformations due to mechanical stress, bushing failure, tap changer malfunction, core and tank failures, and collapse of protection and cooling subsystems [ [2]]. Each failure pathway leaves detectable chemical signatures in the form of dissolved gases in the insulating oil a phenomenon exploited by Dissolved Gas Analysis (DGA). Gases including Hydrogen (H₂), Methane (CH₄), Ethane (C₂H₆), Ethylene (C₂H₄), and Ethyne (C₂H₂) are generated at rates and ratios that encode the type and severity of the underlying fault [ [3]].

Classical DGA interpretation methods Doernenburg ratios, Roger's ratios, and IEC ratio codes provide deterministic rule-based diagnoses. While useful, these methods suffer from diagnostic uncertainty in boundary cases, susceptibility to analyst error, and inability to leverage the full multivariate statistical structure of DGA data [ [4]]. The application of machine learning (ML) to DGA-based fault classification addresses these limitations by learning complex, non-linear decision boundaries from historical labelled data [ [5]].

Since 2010, ML-based predictive maintenance (PDM) has witnessed growing adoption in the power systems domain [ [6]]. Techniques spanning supervised classifiers, neural networks, and ensemble methods have been applied to transformer health monitoring with promising results [ [7]]. However, studies providing systematic comparative evaluation of multiple classifier paradigms on standardised benchmark datasets with rigorous multi-metric reporting remain limited, especially in the African utility context.

This paper addresses that gap. Specifically, it provides: (i) a systematic comparative evaluation of Decision Tree (DT), SVM, and KNN classifiers on the IEC TC10 DGA benchmark; (ii) rigorous pre-processing including class balancing and standardised feature scaling; (iii) multi-metric evaluation via accuracy, precision, recall, F1-score, ROC-AUC, and average precision; and (iv) actionable recommendations for utility operators seeking to transition from reactive to predictive maintenance strategies.

2. Background and Literature Review

2.1 Power Transformer Fundamentals

A power transformer operates on Faraday's principle of electromagnetic induction. An alternating current in the primary winding generates a time-varying magnetic flux, channelled through a ferromagnetic core to induce an electromotive force (emf) in the secondary winding. The turns ratio (N_p/N_s) determines whether the device steps voltage up or down, enabling both economical long-distance transmission and safe distribution to consumers. Power transformers (>200 MVA, 33–400 kV) are distinguished from distribution transformers (230 V – 11 kV) and instrument transformers by their ratings and applications [ [8]].

Transformer designs are classified by turns ratio (step-up, step-down, isolation), phase configuration (single-phase, three-phase, autotransformer), core material (air-core, ferrite, iron, toroidal), and winding construction (core-type, shell-type, Berry-type). Each design exhibits different failure mode susceptibilities, though DGA-based monitoring applies generically across types. The insulating oil serves both as a dielectric medium and thermal coolant; its chemical degradation under electrical and thermal stress is the physical basis for DGA monitoring [ [9]].

2.2 Transformer Fault Taxonomy (IEC 60599 / IEEE C57.104)

Fault classification follows the IEC 60599 and IEEE C57.104 standards, which define six primary fault categories based on characteristic gas generation patterns, as shown in Table 1. Partial Discharge (PD) produces predominantly H₂ and CH₄, while thermal faults at progressively higher temperatures shift the dominant gas signature from ethane through ethylene and acetylene. This temperature-dependent gas evolution underpins all DGA-based diagnostic approaches [ [3], [10]].

Table 1. Transformer fault classification per IEC 60599 and IEEE C57.104 standards with temperature ranges, severity, and characteristic dissolved gas markers.

Code	Fault Type	Temperature	Severity	Key Gases
PD	Partial Discharge	< 150 °C	Low	H₂, CH₄
D1	Low Energy Discharge	< 300 °C	Low–Med	H₂, C₂H₂
D2	High Energy Discharge	< 700 °C	High	H₂, C₂H₂, C₂H₄
T1	Low-Temp Thermal	< 300 °C	Low	CH₄, C₂H₆
T2	Med-Temp Thermal	300–700 °C	Medium	C₂H₄, C₂H₆
T3	High-Temp Thermal	> 700 °C	Critical	C₂H₄, C₂H₂, CO₂

Source: Nanfak et al. [ [3]]; IEC 60599 [ [10]]; IEEE C57.104 [ [11]].

2.3 Dissolved Gas Analysis Methods

DGA quantifies the concentration of key fault gases dissolved in transformer insulating oil using gas chromatography. Traditional ratio-based interpretation methods including Doernenburg, Rogers, and IEC three-ratio techniques map gas concentration ratios to fault codes. While standardised and widely deployed, they exhibit known limitations: diagnostic ambiguity when ratios fall outside defined code boundaries, failure to account for inter-gas covariance, and the requirement for trained human interpretation [ [4]]. ML-based classifiers overcome these limitations by treating DGA as a multivariate pattern recognition problem, learning fault boundaries directly from labelled historical data.

2.4 Review of Related Work

A growing body of literature applies ML to transformer fault detection. Venkataswamy et al. [ [12]] integrated IoT sensor networks with metaheuristic optimisation for reliability-centred maintenance, demonstrating reduced downtime but requiring costly infrastructure. Laayati et al. [ [13]] developed a self-diagnostic smart energy management system for oil-immersed transformers, achieving real-time fault detection though with significant integration complexity. Yu et al. [ [14]] applied an information-granulated SVM to nuclear power plant transformer anomaly detection, obtaining accurate results but with sensitivity to data granularity.

Raghuraman and Darvishi [ [5]] specifically addressed multi-class DGA fault classification using ML, reporting accurate fault-type identification while noting DGA data quality as a key constraint. Vallim Filho et al. [ [15]] proposed a predictive analytics framework grounded in equipment load cycles, validated on real-world case studies. Bai et al. [ [16]] introduced an in-context learning framework for algorithm selection in predictive maintenance, enhancing operational precision at elevated computational cost. Table 2 synthesises key contributions.

Table 2. Summary of related works on machine learning-based transformer fault detection.

Study	Method	Key Finding	Limitation	Application
Venkataswamy et al. ()	IoT + Metaheuristic Optimisation	Reduced downtime	High IoT cost	Distribution TF
Laayati et al. ()	Smart Self-Diagnostic EMS	Real-time detection	Complex integration	Oil-immersed TF
[14]	Information-granulated SVM	Anomaly detection	Data quality sensitive	Nuclear TF
Raghuraman & Darvishi ()	ML on DGA data	Multi-fault classification	DGA variability	Power TF
Vallim [15]	Predictive analytics + load cycles	Maintenance scheduling	Needs load-cycle data	Power TF
Present [23]	DT, SVM, KNN IEC TC10	95.65% accuracy	Small dataset	Power TF

Note: Present study denotes the work described in this paper. TF = transformer.

Despite these advances, a gap persists in the systematic comparative evaluation of multiple ML classifier paradigms on the standardised IEC TC10 DGA benchmark using unified pre-processing and multi-metric evaluation a gap this study addresses directly.

3. Methodology

3.1 Overview and Pipeline

The study adopts a supervised machine learning workflow comprising data acquisition, pre-processing, model training, and multi-metric evaluation. The complete pipeline is illustrated in Figure 1.

Figure 1. Predictive maintenance methodology pipeline from IEC TC10 DGA data acquisition through classifier training and fault diagnosis evaluation.

3.2 Dataset Description

The dataset comprises 123 DGA samples from the IEC TC10 standard reference database. Each sample records the concentration (ppm) of five dissolved fault gases: H₂, CH₄, C₂H₆, C₂H₄, and C₂H₂. Each sample is labelled with one of five fault categories (T1 and T2 are grouped as T1/T2): PD, D1, D2, T1/T2, and T3, as defined by IEC 60599. Gas concentrations span several orders of magnitude (0.001–92,600 ppm), necessitating standardisation prior to model training. The fault class distribution is markedly imbalanced, with D2 at 42.3% and PD at only 7.3% (Table 3 and Figure 2).

Table 3. IEC TC10 dataset fault class distribution with gas pattern summary and thermal indicators.

Code	Fault Type	N	Share	Gas Pattern	Indicator
PD	Partial Discharge	9	7.3%	H₂, CH₄ dominant	High H₂ ppm
D1	Low Energy Discharge	28	22.8%	H₂, C₂H₂ elevated	Moderate C₂H₂
D2	High Energy Discharge	52	42.3%	H₂, C₂H₂, C₂H₄ high	Very high ppm
T1/T2	Thermal Low/Med	16	13.0%	CH₄, C₂H₆ dominant	Low C₂H₂
T3	Thermal High	18	14.6%	C₂H₄ dominant	Temp > 700 °C
Total	—	123	100%	5 dissolved gas features	IEC TC10 standard

Source: IEC TC10 standard database [ [3], [10]]. N = total samples per class.

Figure 2. Fault class distribution in the IEC TC10 DGA dataset. D2 (High Energy Discharge) constitutes the majority class (42.3%), while PD (Partial Discharge) is the minority class (7.3%). Class imbalance was addressed through random oversampling prior to model training.

3.3 Data Pre-Processing

Pre-processing followed a four-stage pipeline. First, Feature Selection retained all five gas concentration features and excluded the equipment identifier, which is not a reliable generalised fault predictor. Second, Feature Engineering addressed the right-skewed, multi-scale nature of gas concentration data through Z-score standardisation via a StandardScaler pipeline, ensuring zero-mean, unit-variance features for scale-sensitive classifiers (SVM, KNN). Third, Class Balancing employed random oversampling with replacement on minority classes (principally PD) to match the majority class count, preventing classifier bias. Fourth, Data Splitting partitioned the balanced dataset into training (80%) and testing (20%) subsets using stratified random sampling (random_state=42), preserving class proportions in both partitions.

3.4 Classifier Selection and Implementation

Decision Tree (DT): A non-parametric supervised classifier that recursively partitions the feature space using Gini impurity criteria. DTs are valued for their interpretability decision rules can be directly inspected by maintenance engineers and capacity to handle non-linear class boundaries without feature scaling. Implemented with default Scikit-learn hyperparameters [ [17]].

Support Vector Machine (SVM): A maximum-margin classifier that identifies the optimal separating hyperplane by maximising the margin between support vectors. A linear kernel was employed with probability calibration enabled to support ROC and PR curve computation. SVM is known for robust generalisation in high-dimensional spaces and resistance to overfitting [ [18]].

K-Nearest Neighbour (KNN): A non-parametric, instance-based classifier that assigns class labels by majority vote among k=5 nearest training samples under Euclidean distance. KNN requires no explicit training phase but demands feature standardisation to ensure distance meaningfulness [ [19]].

3.5 Performance Metrics

Model performance was assessed using the following metrics, computed on the held-out test set (n=46):

Precision = TP / (TP + FP)

Eq. ( [1])

Recall = TP / (TP + FN)

Eq. ( [2])

F1 Score = 2 * (Precision * Recall) / (Precision + Recall)

Eq. ( [3])

Accuracy = (TP + TN) / (TP + TN + FP + FN)

Eq. ( [4])

Where TP = True Positives, TN = True Negatives, FP = False Positives, FN = False Negatives. Receiver Operating Characteristic (ROC) curves and Area Under Curve (AUC) values were computed per class using one-vs-rest binarisation. Precision-Recall (PR) curves and Average Precision (AP) scores were also computed to assess performance under class imbalance, where ROC-AUC can be overly optimistic. Macro-averaged F1-score serves as the primary composite metric, equally weighting all fault classes including the minority PD class.

4. Results and Discussion

4.1 Overall Classification Accuracy

Table 4 presents training and testing accuracy scores for all three classifiers. SVM and KNN both achieved the highest testing accuracy of 95.65%, correctly classifying 44 of 46 test samples. The Decision Tree achieved 89.13% (41/46 correct). Notably, the Decision Tree exhibited a pronounced overfitting signature 100% training versus 89.13% testing accuracy, a gap of 10.87 percentage points indicating the fully unpruned tree memorised training patterns rather than learning generalisable features. SVM and KNN maintained training-testing accuracy gaps of only 3.2 and 4.4 percentage points respectively, demonstrating superior generalisation (Figure 3).

Table 4. Classification accuracy and per-class precision for DT, SVM, and KNN classifiers on the IEC TC10 test set (n=46).

Classifier	PD-P	D1-P	D2-P	T3-P	Correct/Total	Test / Train Acc.
Decision Tree	0.96	0.62	0.89	1.00	41 / 46	89.13% / 100%
SVM (Linear)	0.96	0.86	1.00	1.00	44 / 46	95.65% / 92.86%
KNN (k=5)	0.92	1.00	1.00	1.00	44 / 46	95.65% / 91.21%

P = Precision per class. Test Acc. = Testing Accuracy. Train Acc. = Training Accuracy.

Figure 3. Grouped bar chart comparing training and testing accuracy across classifiers. The Decision Tree's large train-test gap confirms overfitting, while SVM and KNN demonstrate consistent generalisation performance.

4.2 Confusion Matrix Analysis Decision Tree

Figure 4 presents the confusion matrix for the Decision Tree classifier. The model misclassified 5 of 46 test samples: one D1 as D2, two D2 as D1, one T1/T2 as D1, and one T3 as PD. Misclassifications cluster around the low-to-medium energy discharge boundary (D1/D2), reflecting overlapping gas signatures of these neighbouring fault classes. The T1/T2 misclassification is particularly consequential in practice, as thermal faults require immediate thermal management intervention.

Figure 4. Confusion matrix Decision Tree classifier (test set, n=46). Diagonal elements (burgundy) represent correctly classified samples; off-diagonal values indicate misclassifications. 5 errors occur primarily at the D1/D2 and T1/T2 boundaries.

4.3 Confusion Matrix Analysis SVM and KNN

Figure 5 presents the SVM confusion matrix. The SVM misclassified only 2 samples both within the T1/T2 class (1 as PD, 1 as D1) achieving perfect classification of PD, D1, D2, and T3. This is a significant practical improvement: high-energy discharge faults (D2), carrying the highest risk of catastrophic insulation failure, are identified with 100% precision and recall under SVM, enabling confident maintenance interventions. KNN achieved an identical 95.65% accuracy with slightly different T1/T2 confusion (2 misclassified as PD).

Figure 5. Confusion matrices for SVM (left) and KNN (right). Both achieve 44/46 correct predictions. SVM and KNN both struggle only with T1/T2 the thermally ambiguous mid-range fault class.

4.4 Per-Class Precision, Recall, and F1-Score

Table 5 disaggregates classification performance by fault class and metric. All classifiers achieve F1=1.00 for T3 (high-temperature thermal fault), confirming that extreme gas signatures of T3 are readily separable. PD also achieves high F1 scores (0.96–0.98) across all classifiers. The most challenging class remains T1/T2 (F1=0.50 for all classifiers), attributable to its small sample count and gas signature overlap with discharge faults. Figure 6 visualises precision and recall per class for each classifier.

Table 5. Per-class precision (P), recall (R), and F1-score for Decision Tree, SVM, and KNN classifiers.

Fault	DT-P	DT-R	DT-F1	SVM-P	SVM-R	SVM-F1	KNN-F1
PD	0.96	1.00	0.98	0.96	1.00	0.98	0.96
D1	0.62	0.83	0.71	0.86	1.00	0.92	1.00
D2	0.89	0.80	0.84	1.00	1.00	1.00	1.00
T1/T2	1.00	0.33	0.50	1.00	0.33	0.50	0.50
T3	1.00	1.00	1.00	1.00	1.00	1.00	1.00
Macro Avg	0.89	0.79	0.81	0.96	0.87	0.88	0.89

Macro Avg = unweighted macro-average across all five fault classes.

Figure 6. Precision and recall per fault class for all three classifiers. Hatched bars represent recall; solid bars represent precision. SVM achieves the most balanced precision-recall performance, particularly for D1 and D2.

4.5 F1-Score Radar Analysis

Figure 7 presents a radar chart comparing per-class F1-scores across classifiers. SVM and KNN both form larger polygons than the Decision Tree, particularly in the D1 and D2 dimensions, confirming their superior overall diagnostic capability. All three classifiers converge at T3 (F1=1.00), while the visible gap at D1 and D2 between the Decision Tree and the ML alternatives highlights the practical value of choosing SVM or KNN for high-energy discharge fault detection.

Figure 7. F1-score radar chart comparing Decision Tree, SVM, and KNN per fault class. Larger polygon area indicates better overall multi-class performance. SVM and KNN outperform the Decision Tree especially for D1 and D2.

4.6 ROC-AUC and Precision-Recall Analysis

Figure 8 presents ROC-AUC and Average Precision (AP) scores as heatmaps by fault class and classifier. All classifiers achieve AUC=1.00 for T3, confirming its uniquely identifiable gas signature. SVM achieves the highest AUC for T1/T2 (0.94 versus 0.67 for DT and 0.64 for KNN), confirming its superior sensitivity for thermally ambiguous fault classification. The Decision Tree underperforms notably for PD (AUC=0.33) and D2 (AUC=0.17) a serious concern given that D2 constitutes the most frequent and operationally critical fault class.

In the AP analysis (right panel), SVM achieves the highest score for T1/T2 (AP=0.59) while KNN achieves the highest mean AP (0.48) across all classes. For PD the most imbalanced minority class KNN and DT both achieve AP=0.50, while SVM achieves 0.36. This suggests KNN may offer marginal advantages for PD-specific applications, despite SVM's overall superiority.

Figure 8. ROC-AUC (left) and Average Precision (right) heatmaps by fault class and classifier. Darker cells indicate higher discriminative performance. SVM dominates T1/T2 and D1 classification; all classifiers achieve perfect T3 detection.

5. Discussion

5.1 Implications for Power Utility Operations

The validated predictive maintenance framework offers several immediate operational benefits for utilities such as ECG. First, SVM-based fault classification on routine DGA oil samples can provide automated preliminary diagnosis, reducing reliance on specialist DGA analysts and enabling faster maintenance scheduling. Second, the 95.65% testing accuracy compared to the 60–80% diagnostic accuracy commonly reported for traditional ratio methods under boundary conditions [ [3]] represents a clinically significant improvement in early fault detection capability.

Third, the perfect identification of D2 (High Energy Discharge) faults by SVM is particularly valuable: D2 faults represent imminent risk of catastrophic insulation failure and, if missed, can lead to transformer explosion, fire, and extended outages. A false-negative rate of zero for D2 under SVM, compared to a 20% false-negative rate (2/10 D2 samples) under the Decision Tree, could prevent major incidents in field deployment.

Fourth, the framework's computational lightness SVM on a 123-sample dataset trains in milliseconds on commodity hardware makes it suitable for integration into existing SCADA systems or cloud-based asset management platforms without significant infrastructure investment. This is especially pertinent in resource-constrained utility environments such as those common in sub-Saharan Africa.

5.2 Limitations

Several limitations circumscribe the generalisability of these findings. The IEC TC10 dataset, while a recognised benchmark, comprises only 123 samples a relatively small corpus by contemporary ML standards. Class imbalance required artificial resampling, which may introduce distributional artefacts. The combined T1/T2 class may mask intra-class distinctions relevant to maintenance precision. Furthermore, the study does not account for temporal trends in gas accumulation rates Remaining Useful Life (RUL) estimation would require time-series DGA data, which is not present in IEC TC10 [ [6]].

The generalisability of trained models to transformer fleets operating under different thermal conditions, insulating oil compositions, or geographic climates has not been assessed. Transfer learning or domain adaptation techniques may be required for deployment in materially different operational contexts, including the tropical operating environment typical of Ghanaian installations.

5.3 Comparison with Prior Work

The SVM testing accuracy of 95.65% achieved in this study compares favourably with comparable studies. Raghuraman and Darvishi [ [5]] reported ML-based DGA classification accuracies in the range of 88–94% on similar datasets. Yu et al. [ [14]] achieved high anomaly detection rates but did not report comparable multi-class F1 metrics. Vallim Filho et al. [ [15]] achieved strong predictive maintenance performance using load-cycle data but did not address DGA-based fault classification. The present study's unique contribution is its systematic comparative evaluation with unified pre-processing and comprehensive multi-metric reporting, providing a reproducible baseline for future studies.

6. Conclusions and Recommendations

6.1 Conclusions

This paper has demonstrated the effectiveness of three machine learning classifiers Decision Tree, SVM, and KNN for predictive maintenance of power transformers using DGA-based fault classification on the IEC TC10 dataset. The principal findings are:

(i) SVM and KNN both achieve 95.65% testing accuracy, outperforming the Decision Tree (89.13%) and representing a significant improvement over traditional ratio-based DGA methods.

(ii) SVM demonstrates superior discrimination for clinically critical fault classes: perfect precision and recall for D2 (High Energy Discharge) and the highest ROC-AUC for T1/T2 (0.94) among all classifiers evaluated.

(iii) The Decision Tree exhibits overfitting (100% training versus 89.13% testing accuracy), limiting its reliability. Tree depth constraints are recommended if DT is to be deployed.

(iv) T1/T2 classification remains challenging across all models (F1=0.50), suggesting this class boundary requires additional feature engineering or integration of complementary diagnostic data.

(v) SVM is recommended as the primary diagnostic classifier for operational deployment, with KNN as a computationally simpler alternative.

6.2 Recommendations for Practice

Power utilities are strongly encouraged to integrate ML-based DGA diagnostic tools as a complement to not a replacement for periodic physical inspections and traditional ratio analysis. A hybrid diagnostic protocol that triggers ML classification upon detection of elevated gas concentrations during routine sampling would optimise the cost-benefit trade-off. Power stations should establish structured DGA data collection programs with standardised sampling intervals, gas measurement procedures, and fault labelling protocols, enabling the development of increasingly accurate, utility-specific predictive models over time.

6.3 Future Research Directions

Future work should explore: (i) Artificial Neural Networks (ANN) and LSTM architectures for time-series DGA trend modelling and Remaining Useful Life (RUL) estimation; (ii) ensemble learning methods (Random Forest, Gradient Boosting) for improved accuracy and uncertainty quantification; (iii) multi-modal diagnostic frameworks integrating DGA with vibration, temperature, and partial discharge sensor data; and (iv) development and validation of transformer-specific DGA datasets from the Ghanaian and West African power grid context to address geographic transferability.

Nomenclature

ANN	Artificial Neural Network
AUC	Area Under Curve
DGA	Dissolved Gas Analysis
DT	Decision Tree
ECG	Electricity Company of Ghana
F1	F1-Score (harmonic mean of Precision and Recall)
FN	False Negative
FP	False Positive
IEC	International Electrotechnical Commission
KNN	K-Nearest Neighbour
ML	Machine Learning
PD	Partial Discharge
PDM	Predictive Maintenance
PR	Precision-Recall
ROC	Receiver Operating Characteristic
RUL	Remaining Useful Life
SVM	Support Vector Machine
TN	True Negative
TP	True Positive

References

[1] Lin Chen; Zhonghao Chen; Yubing Zhang; Yunfei Liu; Ahmed I. Osman; Mohamed Farghali; Jianmin Hua; Ahmed S. Al‐Fatesh; Ikko Ihara; David W. Rooney; Pow‐Seng Yap, "RETRACTED ARTICLE: Artificial intelligence-based solutions for climate change: a review," Environmental Chemistry Letters, vol. 21, no. 5, pp. 2525-2557, 2023. https://doi.org/10.1007/s10311-023-01617-y [Link]

[2] Sudha, B.; Praveen, L.S.; Vadde, Anusha, "Classification of faults in distribution transformer using machine learning," Materials Today: Proceedings, vol. 58, no. –, pp. 616-622, 2022. https://doi.org/10.1016/j.matpr.2022.04.514 [Link]

[3] Nanfak, Arnaud; Eke, Samuel; Kom, Charles Hubert; Mouangue, Ruben; Fofana, Issouf, "Interpreting dissolved gases in transformer oil: A new method based on the analysis of labelled fault data," IET Generation, Transmission & Distribution, vol. 15, no. 21, pp. 3032-3047, 2021. https://doi.org/10.1049/gtd2.12239 [Link]

[4] Muhammad Aslam; Muhammad Saad Rehan; Fahad R. Albogamy; Sadia Murawwat; Abdul Basit; Ghulam Hafeez, "Prognostication of Failures Using Signal-to-Noise Ratio to Determine Partial Discharges Activities in Power Transformers," IEEE Access, vol. 10, no. –, pp. 87500-87507, 2022. https://doi.org/10.1109/access.2022.3199873 [Link]

[5] Raghuraman, Rohan; Darvishi, Atena, "Detecting Transformer Fault Types from Dissolved Gas Analysis Data Using Machine Learning Techniques," 2022 IEEE 15th Dallas Circuit And System Conference (DCAS), vol. –, no. –, pp. 1-5, 2022. https://doi.org/10.1109/dcas53974.2022.9845611 [Link]

[6] Penrose, Howard W, "Machine Learning for Electric Machine Prognostics and Remaining Useful Life with Basic Motor Data," 2022 IEEE Electrical Insulation Conference (EIC), vol. –, no. –, pp. 245-248, 2022. https://doi.org/10.1109/eic51169.2022.10122613 [Link]

[7] Saman A. Gorji, "Challenges and opportunities in green hydrogen supply chain through metaheuristic optimization," Journal of Computational Design and Engineering, vol. 10, no. 3, pp. 1143-1157, 2023. https://doi.org/10.1093/jcde/qwad043 [Link]

[8] Jussi S. Jauhiainen, "The Metaverse: Innovations and generative AI," International Journal of Innovation Studies, vol. 8, no. 3, pp. 262-272, 2024. https://doi.org/10.1016/j.ijis.2024.04.004 [Link]

[9] Dingkang Wang; Connor A. Watkins; Huikai Xie, "MEMS Mirrors for LiDAR: A Review," Micromachines, vol. 11, no. 5, pp. 456-456, 2020. https://doi.org/10.3390/mi11050456 [Link]

[10] Ugochukwu Elele; Azam Nekahi; Arshad Ali; I. Fofana, "Towards Online Ageing Detection in Transformer Oil: A Review," Sensors, vol. 22, no. 20, pp. 7923-7923, 2022. https://doi.org/10.3390/s22207923 [Link]

[11] U. Mohan Rao; I. Fofana; Janvier Sylvestre N’cho, "On Some Imperative IEEE Standards for Usage of Natural Ester Liquids in Transformers," IEEE Access, vol. 8, no. –, pp. 145446-145456, 2020. https://doi.org/10.1109/access.2020.3014600 [Link]

[12] Venkataswamy, R; Rao, K Uma; Meena, P, "Internet of things based metaheuristic reliability centered maintenance of distribution transformers," IOP Conference Series: Earth and Environmental Science, vol. 463, no. 1, pp. 012047, 2020. https://doi.org/10.1088/1755-1315/463/1/012047 [Link]

[13] Oussama Laayati; Hicham El Hadraoui; Adila El Maghraoui; Nabil El Bazi; Mostafa Bouzi; Ahmed Chebak; Josep M. Guerrero, "An AI-Layered with Multi-Agent Systems Architecture for Prognostics Health Management of Smart Transformers: A Novel Approach for Smart Grid-Ready Energy Management Systems," Energies, vol. 15, no. 19, pp. 7217-7217, 2022. https://doi.org/10.3390/en15197217 [Link]

[14] Yu, Wenmin; Yu, Ren; Li, Cheng, "An Information Granulated Based SVM Approach for Anomaly Detection of Main Transformers in Nuclear Power Plants," Science and Technology of Nuclear Installations, vol. 2022, no. –, pp. 1-11, 2022. https://doi.org/10.1155/2022/3931374 [Link]

[15] Aurora Esteban; Amelia Zafra; Sebastián Ventura, "Data mining in predictive maintenance systems: A taxonomy and systematic review," Wiley Interdisciplinary Reviews Data Mining and Knowledge Discovery, vol. 12, no. 5, 2022. https://doi.org/10.1002/widm.1471 [Link]

[16] Biagio La Rosa; Graziano Blasilli; Romain Bourqui; David Auber; Giuseppe Santucci; Roberto Capobianco; Enrico Bertini; Romain Giot; Marco Angelini, "State of the Art of Visual Analytics for eXplainable Deep Learning," Computer Graphics Forum, vol. 42, no. 1, pp. 319-355, 2023. https://doi.org/10.1111/cgf.14733 [Link]

[17] M. J. Hardcastle; M. A. Horton; W. L. Williams; K. J. Duncan; L. Alegre; B. Barkus; J. H. Croston; H. J. Dickinson; E. Osinga; H. J. A. Röttgering; J. Sabater; T. W. Shimwell; D. J. B. Smith; P. N. Best; A. Botteon; M. Brüggen; A. Drabent; F. de Gasperin; G. Gürkan; M. Hajduk; Catherine Hale; M. Hoeft; M. Jamrozy; M. Kunert‐Bajraszewska; R. Kondapally; M. Magliocchetti; V. H. Mahatma; R. I. J. Mostert; S. P. O’Sullivan; Urszula Pajdosz-Śmierciak; M. Banerji; J C S Pierce; I. Prandoni; Dominik J. Schwarz; A. Shulewski; Thilo M. Siewert; J. P. Stott; Hongming Tang; M. Vaccari; Xiaoxiao Zheng; T. B. Bailey; S. Desbled; A. Goyal; V. Gonano; M. Hanset; Wolfgang Kurtz; S. M. Lim; L. Mielle; C. S. Molloy; R. Roth; Ivan A. Terentev; M. A. P. Torres, "The LOFAR Two-Metre Sky Survey," Astronomy and Astrophysics, vol. 678, no. –, pp. A151-A151, 2023. https://doi.org/10.1051/0004-6361/202347333 [Link]

[18] Rosita Guido; Stefania Ferrisi; Danilo Lofaro; Domenico Conforti, "An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review," Information, vol. 15, no. 4, pp. 235-235, 2024. https://doi.org/10.3390/info15040235 [Link]

[19] Qingxiu Dong; Lei Li; Damai Dai; Ce Zheng; Jingyuan Ma; Rui Li; Heming Xia; Jingjing Xu; Zhiyong Wu; Baobao Chang; Xu Sun; Lei Li; Zhifang Sui, "A Survey on In-context Learning," Source, vol. –, no. –, pp. 1107-1128, 2024. https://doi.org/10.18653/v1/2024.emnlp-main.64 [Link]

[20] Abraham, Alexandre; Pedregosa, Fabian; Eickenberg, Michael; Gervais, Philippe; Mueller, Andreas; Kossaifi, Jean; Gramfort, Alexandre; Thirion, Bertrand; Varoquaux, Gaël, "Machine learning for neuroimaging with scikit-learn," Frontiers in Neuroinformatics, vol. 8, no. –, pp. 14-14, 2014. https://doi.org/10.3389/fninf.2014.00014 [Link]

[21] Viktor Sebestyén; Tímea Czvetkó; János Abonyi, "The Applicability of Big Data in Climate Change Research: The Importance of System of Systems Thinking," Frontiers in Environmental Science, vol. 9, no. –, 2021. https://doi.org/10.3389/fenvs.2021.619092 [Link]

[22] Kaštelan, Nediljko; Vujović, Igor; Krčum, Maja; Assani, Nur, "Switchgear Digitalization—Research Path, Status, and Future Work," Sensors, vol. 22, no. 20, pp. 7922, 2022. https://doi.org/10.3390/s22207922 [Link]

[23] Rasha Rashid; Catrin Sohrabi; Ahmed Kerwan; Thomas Franchi; Ginimol Mathew; Maria Nicola; Riaz Agha, "The STROCSS 2024 guideline: strengthening the reporting of cohort, cross-sectional, and case–control studies in surgery," International Journal of Surgery, vol. 110, no. 6, pp. 3151-3165, 2024. https://doi.org/10.1097/js9.0000000000001268 [Link]