Industry Class Clustering of Software Expertise Competency at SMKN 2 Kraksaan Using Constrained K-Means Clustering Algorithm
DOI:
10.47709/cnahpc.v6i3.4214Keywords:
Constrained K-Means Clustering, Linear Programming Algorithm, Academic Proficiency Test, Software Engineering CompetencyDimension Badge Record
Abstract
Addressing the gap between school education and industry needs is a recurring concern, as many graduates struggle to enter the workforce due to lacking practical skills. Industry Classes aim to bridge this gap by preparing students with relevant skills and knowledge aligned with real-world industry demands. This study focuses on the application of Constrained K-Means Clustering to categorize students in the software engineering competency classes at SMKN 2 Kraksaan. This algorithm modifies traditional K-Means by utilizing Linear Programming Algorithm (LPA), ensuring each cluster meets predefined subject requirements. The research involves analyzing academic proficiency test data (TKDA) from 96 X-grade students, evaluating their abilities in analogy, series, figural, mathematical, and recall skills. Using a 3-cluster approach, each with 32 to 60 student capacity constraints, the study aims to optimize student distribution for effective learning outcomes. Evaluation through silhouette method yielded a score of 0.3199, indicating satisfactory separation between clusters with overlap to address. Cluster analysis revealed Cluster 2 as the most proficient, showcasing strengths in recall and series attributes critical for software engineering. These findings suggest that Constrained K-Means Clustering is effective in classifying students, highlighting Cluster 2 as optimal for software engineering competencies at SMKN 2 Kraksaan. Future research should focus on enhancing data quality, expanding sample size, and refining algorithms for improved clustering accuracy and effectiveness.
Downloads
Abstract viewed = 74 times
References
Alboaouh, K. (2018). The Gap Between Engineering Schools and Industry: A Strategic Initiative. 2018 IEEE Frontiers in Education Conference (FIE), 1-6. https://doi.org/10.1109/FIE.2018.8659314.
Alesiani, F., Ermis, G., & Gkiotsalitis, K. (2022). Constrained clustering for the capacitated vehicle routing problem (cc-cvrp). Applied artificial intelligence, 36(1), 1995658.
Baumann, P. (2019). A Binary Linear Programming-Based K-Means Approach for the Capacitated Centered Clustering Problem. 2019 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), 335-339. https://doi.org/10.1109/IEEM44572.2019.8978840.
Bibi, A., Alqahtani, A., & Ghanem, B. (2023). Constrained clustering: General pairwise and cardinality constraints. IEEE Access, 11, 5824-5836.
Buatoom, U., Kongprawechnon, W., & Theeramunkong, T. (2020). Document clustering using K-means with term weighting as similarity-based constraints. Symmetry, 12(6), 967.
Elfrianto, H., Nasrun, M. S., & Arifin, M. (2023). Buku Ajar Manajemen Pendidikan. umsu press.
Forestyanto, Y., Syamwil, R., & Wijaya, M. (2019). School Constraints in Recruitment and Implementation of Industrial Classes. , 4. https://doi.org/10.15294/JVCE.V4I1.21766.
Huang, P., Yao, P., Hao, Z., Peng, H., & Guo, L. (2021). Improved constrained k-means algorithm for clustering with domain knowledge. Mathematics, 9(19), 2390.
Judijanto, L., Mayasari, N., Baruno, Y. H. E., Tasrip, T., & Rusdi, M. (2024). Analisis Pengaruh Kemitraan Sekolah-Industri dan Program Magang terhadap Keterampilan Kerja dan Kesiapan Karier Siswa SMK di Jawa Tengah. Jurnal Multidisiplin West Science, 3(03), 378-388.
Lampert, T., Dao, T., Lafabregue, B., Serrette, N., Forestier, G., Crémilleux, B., Vrain, C., & Gançarski, P. (2018). Constrained distance based clustering for time-series: a comparative and experimental study. Data Mining and Knowledge Discovery, 32, 1663 - 1707. https://doi.org/10.1007/s10618-018-0573-y.
Melnykov, I., & Melnykov, V. (2020). A Note on the Formal Implementation of the K-means Algorithm with Hard Positive and Negative Constraints. Journal of Classification, 37, 789-809. https://doi.org/10.1007/s00357-019-09349-x.
Mengash, H. (2020). Using Data Mining Techniques to Predict Student Performance to Support Decision Making in University Admission Systems. IEEE Access, 8, 55462-55470. https://doi.org/10.1109/ACCESS.2020.2981905.
Mulyani, H., Setiawan, R., & Fathi, H. (2023). Optimization of K Value in Clustering Using Silhouette Score (Case Study: Mall Customers Data). Journal of Information Technology and Its Utilization. https://doi.org/10.56873/jitu.6.2.5243.
Noor, S., Kusumasari, T., & Hasibuan, M. (2019). Data Cleansing with PDI for Improving Data Quality. Proceedings of the International Conference on Creative Economics, Tourism and Information Management. https://doi.org/10.5220/0009868102560261.
Risal, A. A. N., Zainuddin, Z., & Niswar, M. (2022). School Zoning System for Student Admission using Constrained K-Means Algorithms. In 2022 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT) (pp. 174-178). IEEE.
Downloads
ARTICLE Published HISTORY
How to Cite
Issue
Section
License
Copyright (c) 2024 Matlubul Khairi, M. Syafiih, Ahmad Khairi
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.