Treffer: A hybrid divisive K-means framework for big data–driven poverty analysis in Central Java Province.

Title:
A hybrid divisive K-means framework for big data–driven poverty analysis in Central Java Province.
Source:
Indonesian Journal of Electrical Engineering & Computer Science; Jan2026, Vol. 41 Issue 1, p258-269, 12p
Geographic Terms:
Database:
Complementary Index

Weitere Informationen

Clustering is essential in big data analytics, especially for partitioning highdimensional socioeconomic datasets to support interpretation and policy decisions. While K-Means is widely used for its simplicity and scalability, its strong sensitivity to initial centroid selection often leads to unstable results and slower convergence. Previous hybrid approaches, such as Agglomerative–K-Means, attempted to address this issue by using hierarchical clustering for centroid initialization; however, these methods rely on bottom-up merging, which can produce suboptimal initial partitions and increase computational overhead for larger datasets. To overcome these limitations, this study proposes a hybrid divisive–K-Means (DHC) model that employs top-down hierarchical splitting to generate more coherent initial centroids before refinement with K-Means. Using a multidimensional poverty dataset from Central Java Province provided by the Indonesian Central Bureau of Statistics (BPS), the performance of DHC was evaluated against standard K-Means and Agglomerative–K-Means. The assessment included execution time, convergence iterations, and cluster validity indices (Silhouette, Davies–Bouldin, and Calinski–Harabasz). Experimental results demonstrate that DHC reduces execution time by up to 97% and requires 40% fewer iterations than standard K-Means, while achieving comparable or improved cluster quality (e.g., CH Index increasing from 14.3 to 15.8). These findings indicate that the DHC model offers a more efficient and stable clustering solution, addressing the shortcomings of previous standard K-Means methods and improving performance for large-scale socioeconomic data analysis. [ABSTRACT FROM AUTHOR]

Copyright of Indonesian Journal of Electrical Engineering & Computer Science is the property of Institute of Advanced Engineering & Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)