Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm

Al-Jumaili A.H.A.; Muniyandi R.C.; Hasan M.K.; Singh M.J.; Paw J.K.S.; Al-Jumaily A.

doi:10.3233/IDA-230573

Publication:
Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm

dc.citedby	0
dc.contributor.author	Al-Jumaili A.H.A.	en_US
dc.contributor.author	Muniyandi R.C.	en_US
dc.contributor.author	Hasan M.K.	en_US
dc.contributor.author	Singh M.J.	en_US
dc.contributor.author	Paw J.K.S.	en_US
dc.contributor.author	Al-Jumaily A.	en_US
dc.contributor.authorid	57212194331	en_US
dc.contributor.authorid	14030355800	en_US
dc.contributor.authorid	55057479600	en_US
dc.contributor.authorid	58765817900	en_US
dc.contributor.authorid	58168727000	en_US
dc.contributor.authorid	57208087596	en_US
dc.date.accessioned	2025-03-03T07:45:58Z
dc.date.available	2025-03-03T07:45:58Z
dc.date.issued	2024
dc.description.abstract	Parallel power loads anomalies are processed by a fast-density peak clustering technique that capitalizes on the hybrid strengths of Canopy and K-means algorithms all within Apache Mahout's distributed machine-learning environment. The study taps into Apache Hadoop's robust tools for data storage and processing, including HDFS and MapReduce, to effectively manage and analyze big data challenges. The preprocessing phase utilizes Canopy clustering to expedite the initial partitioning of data points, which are subsequently refined by K-means to enhance clustering performance. Experimental results confirm that incorporating the Canopy as an initial step markedly reduces the computational effort to process the vast quantity of parallel power load abnormalities. The Canopy clustering approach, enabled by distributed machine learning through Apache Mahout, is utilized as a preprocessing step within the K-means clustering technique. The hybrid algorithm was implemented to minimise the length of time needed to address the massive scale of the detected parallel power load abnormalities. Data vectors are generated based on the time needed, sequential and parallel candidate feature data are obtained, and the data rate is combined. After classifying the time set using the canopy with the K-means algorithm and the vector representation weighted by factors, the clustering impact is assessed using purity, precision, recall, and F value. The results showed that using canopy as a preprocessing step cut the time it proceeds to deal with the significant number of power load abnormalities found in parallel using a fast density peak dataset and the time it proceeds for the k-means algorithm to run. Additionally, tests demonstrate that combining canopy and the K-means algorithm to analyze data performs consistently and dependably on the Hadoop platform and has a clustering result that offers a scalable and effective solution for power system monitoring. ? 2024 - IOS Press. All rights reserved.	en_US
dc.description.nature	Final	en_US
dc.identifier.doi	10.3233/IDA-230573
dc.identifier.epage	1346
dc.identifier.issue	5
dc.identifier.scopus	2-s2.0-85215378155
dc.identifier.spage	1321
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85215378155&doi=10.3233%2fIDA-230573&partnerID=40&md5=16f6c6c7217638414360d4fe7b424d1a
dc.identifier.uri	https://irepository.uniten.edu.my/handle/123456789/36943
dc.identifier.volume	28
dc.pagecount	25
dc.publisher	IOS Press BV	en_US
dc.source	Scopus
dc.sourcetitle	Intelligent Data Analysis
dc.subject	Anomaly detection
dc.subject	Cluster analysis
dc.subject	Abnormality detection
dc.subject	Abnormality detection and adjustment
dc.subject	Apache mahout
dc.subject	Canopy algorithm
dc.subject	Hybrid (CKMA)
dc.subject	K-mean algorithm
dc.subject	K-mean algorithms
dc.subject	Load data
dc.subject	Power load
dc.subject	Power load data
dc.subject	K-means clustering
dc.title	Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm	en_US
dc.type	Article	en_US
dspace.entity.type	Publication

Collections

SCOPUS

Publication: Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm

Options

Files

Collections

Publication:
Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm