Publication: Abnormalities and fraud electric meter detection using hybrid support vector machine & genetic algorithm
Date
2007
Authors
Yap K.S.
Abidin I.Z.
Ahmad A.R.
Hussien Z.F.
Pok H.L.
Ismail F.I.
Mohamad A.M.
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
This paper presents an intelligent system to reduce Non Technical Loss (NTL) using hybrid Support Vector Machine (SVM) and Genetic Algorithm (GA). The main motivation for this research is to assist Sabah Electricity Sdn. Bhd. (SESB) to reduce their distribution loss, estimated around 15% at present in Sabah State, Malaysia. The hybrid algorithm is able to preselect customers to be inspected on-site for abnormalities or potential fraud according to their consumption patterns. SVM is a classification technique developed by Vapnik [1] but a practical difficulty of using SVM is the selection of parameters such as C and kernel parameter, � in Gaussian RBF kernel. The purpose of choosing parameters is to get the best generalization performance. Genetic Algorithm (GA) is used to search for the best parameter of SVM classification by using combination of random and pre-populated genomes from Pre-Populated Database (PPD). It provides an increased convergence and globally optimized solutions. The algorithm has been tested using actual customer consumption data from SESB. 10 fold cross validation method is used to confirm the consistency of the detection accuracy. The paper also highlights comparison results between typical SVM and SVM-GA. The highest fraud detection accuracy for SVMGA is 94%.
Description
Keywords
Dual lagrangian optimization , Dynamic crossover point , Genetic algorithm , Pre-populated database , Support vector machine , Algorithms , Computer science , Computers , Database systems , Diesel engines , Genetic algorithms , Image retrieval , Intelligent systems , Learning systems , Multilayer neural networks , Neural networks , Vectors , 10 fold cross validations , And genetic algorithms , Classification techniques , Comparison results , Consumption patterns , Customer consumption datums , Detection accuracies , Distribution losses , Dual lagrangian optimization , Dynamic crossover point , Fraud detections , Gaussian , Generalization performances , Hybrid algorithms , Kernel parameters , Malaysia , Optimized solutions , Pre-populated database , RBF kernels , Support vector machine , Support vectors , SVM classifications , Technical losses , Support vector machines