Performance Comparison K-Nearest Neighbors and Random Forest on Predicting The Performance New Polimedia Student Admissions

Dwi Riyono, Cholid Mawardi

Abstract


New student admissions are at the forefront of the school's operational process. the success of each college's input stems from this. Polimedia always conducts new student admissions every year with various strategies used. Polimedia has 23 study programmes that can enable it to move in the creative industry that can be utilised by the community. in this study, a strategy using a prediction algorithm is used to be able to see the possible opportunities that occur if implemented in the coming year. with a dataset of 3738 data received by new students, an analysis will be carried out on prospective students who have re-registered or who have not re-registered. The classification model with 2 classes will be used. by conducting a data analysis process using exploratory data analysis (EDA) and also performing data cleansing so that the data modelling process runs well. The method used uses the main model of K-Nearest Neighbors by comparing with other machine learning models such as decision tree and random forest. It is expected that this research can produce high accuracy values 86.90% with powerful machine learning model comparisons. This research is also expected to be a reference for other studies that also conduct performance testing processes with machine learning models using various objects.

Full Text:

PDF

References


Mawardi C, Buono A, Priandana K, Herianto. Performance Analysis of ResNet50 and Inception-V3 Image Classification for Defect Detection in 3D Food Printing. Int J Adv Sci Eng Inf Technol. 2024;14(2):798-804. doi:10.18517/ijaseit.14.2.19863

Marsuciati R, Gumelar G, Prietno R. Klasifikasi Metode Naïve Bayes dan K-Nearest Neighbor untuk Menentukan Keluarga Tidak Mampu. Pros SISFOTEK. 2021;5(1):246-249.

Aisyah A, Anraeni S. Analisis Penerapan Metode K-Nearest Neighbor (K-NN) pada Dataset Citra Penyakit Malaria. Indones J Data Sci. 2022;3(1):17-29. doi:10.56705/ijodas.v3i1.22

Chai BX, Eisenbart B, Nikzad M, et al. Process Prediction. Published online 2023.

Zhang S, Li J. KNN Classification With One-Step Computation. IEEE Trans Knowl Data Eng. 2023;35(3):2711-2723. doi:10.1109/TKDE.2021.3119140

Mienye ID, Sun Y. A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects. IEEE Access. 2022;10(September):99129-99149. doi:10.1109/ACCESS.2022.3207287

Han S, Kim H, Lee YS. Double random forest. Mach Learn. 2020;109(8):1569-1586. doi:10.1007/s10994-020-05889-1

Performance A, Alshdaifat E, Alshdaifat D, Alsarhan A, Hussein F, Moh S. The Effect of Preprocessing Techniques , Applied to Numeric. Data. 2021;6(11).

Fitriyadi F, Muqorobin M. Prediction System for the Spread of Corona Virus in Central Java with K-Nearest Neighbor (KNN) Method. Int J Comput Inf Syst. 2021;2(3):80-85. doi:10.29040/ijcis.v2i3.41

Koziarski M, Woźniak M, Krawczyk B. Combined Cleaning and Resampling algorithm for multi-class imbalanced data with label noise. Knowledge-Based Syst. 2020;204. doi:10.1016/j.knosys.2020.106223

Cui L, Zhang Y, Zhang R, Liu QH. A Modified Efficient KNN Method for Antenna Optimization and Design. IEEE Trans Antennas Propag. 2020;68(10):6858-6866. doi:10.1109/TAP.2020.3001743

Krstinić D, Braović M, Šerić L, Božić-Štulić D. Multi-label Classifier Performance Evaluation with Confusion Matrix. Published online 2020:01-14. doi:10.5121/csit.2020.100801




DOI: https://doi.org/10.29040/ijcis.v6i2.237

Article Metrics

Abstract view : 9 times
PDF - 3 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License