Used Car Price Prediction Model: A Machine Learning Approach
Abstract
The impact of the Covid-19 pandemic over the past two years has slowed down the economy, including the market of used cars. However, the recent decline in the number of cases infected with Covid-19 has reignited interest in the used car market. One of many persisting issues found in the used car market is that sellers want the highest price possible; but, buyers and used car dealers bid the lowest price due to economic stability uncertainty. To accelerate the recovery of the used car industry, various innovations are required. This study proposes the use of the K-Nearest Neighbors (KNN) regression model to predict used car prices to address this issue. The proposed KNN model is a machine learning algorithm which is capable of handling multi-dimensional data and its robustness to noisy data, making it suitable for predicting used car prices based on multiple factors. By analyzing collected data on used car prices, a machine learning-based regression model can be developed to predict used car prices based on factors commonly used in the used car industry, such as year of production, car type, car condition, and others. This study makes use of 504 used car data collected through web scraping as a secondary data collection method. With a relatively small error rate of 8.3% and an R2 value of 98.8%, the results of this analysis can provide insight for used car buyers and sellers, to better gauge the price of used cars in the market.
Full Text:
PDFReferences
I. Infootomotif, “Manfaat Mobil untuk Aktivitas Manusia,” Kumparan, Nov. 05, 2021. Accessed: Feb. 25, 2023. [Online]. Available: https://kumparan.com/info-otomotif/manfaat-mobil-untuk-aktivitas-manusia-1wqysMbGRz0
A. Ferdian, “Harga BBM Naik, Penjualan Mobil Bekas Meningkat 10 Persen,” KOMPAS.com, Oct. 04, 2022. Accessed: Feb. 25, 2023. [Online]. Available: https://otomotif.kompas.com/read/2022/10/04/144100015/harga-bbm-naik-penjualan-mobil-bekas-meningkat-10-persen
A. Admin, “Pilih Membeli Mobil Bekas atau Mobil Baru?” Accessed: Feb. 25, 2023. [Online]. Available: https://www.otonesia.co.id/blog/post/pilih-membeli-mobil-bekas-atau-mobil-baru
S. Y. Ahmed, B. J. Ali, and C. Top, “Understanding the Impact of Trust, Perceived Risk, and Perceived Technology on the Online Shopping Intentions: Case Study in Kurdistan Region of Iraq,” Journal of Contemporary Issues in Business and Government, vol. 27, no. 3, Apr. 2021, doi: 10.47750/cibg.2021.27.03.264.
M. Thamarai and S. P. Malarvizhi, “House Price Prediction Modeling Using Machine Learning,” International Journal of Information Engineering and Electronic Business, vol. 12, no. 2, pp. 15–20, Apr. 2020, doi: 10.5815/ijieeb.2020.02.03.
A. H. M. Hassan, A. A. M. Qasem, W. F. M. Abdalla, and O. H. Elhassan, “Visualization & Prediction of COVID-19 Future Outbreak by Using Machine Learning,” International Journal of Information Technology and Computer Science, vol. 13, no. 3, pp. 16–32, Jun. 2021, doi: 10.5815/ijitcs.2021.03.02.
T. Wiradinata, “Folding Bicycle Prospective Buyer Prediction Model,” International Journal of Information Engineering and Electronic Business, vol. 13, no. 5, pp. 1–8, Oct. 2021, doi: 10.5815/ijieeb.2021.05.01.
P. Chapman et al., “CRISP-DM 1.0: Step-by-step data mining guide,” 2000.
Y. Suhanda, I. Kurniati, and S. Norma, “Penerapan Metode Crisp-DM Dengan Algoritma K-Means Clustering Untuk Segmentasi Mahasiswa Berdasarkan Kualitas Akademik,” Jurnal Teknologi Informatika dan Komputer, vol. 6, no. 2, pp. 12–20, Sep. 2020, doi: 10.37012/jtik.v6i2.299.
C. Soares, Y. Peng, J. Meng, T. Washio, and Z. H. Zhou, “Applications of data mining in E-business finance: Introduction,” Frontiers in Artificial Intelligence and Applications, vol. 177, no. 1, pp. 1–9, 2008, doi: 10.3233/978-1-58603-890-8-1.
M. Guha Majumder, S. Dutta Gupta, and J. Paul, “Perceived usefulness of online customer reviews: A review mining approach using machine learning & exploratory data analysis,” J Bus Res, vol. 150, pp. 147–164, Nov. 2022, doi: 10.1016/j.jbusres.2022.06.012.
D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput Sci, vol. 7, p. e623, Jul. 2021, doi: 10.7717/peerj-cs.623.
A. Singh, M. N., and R. Lakshmiganthan, “Impact of Different Data Types on Classifier Performance of Random Forest, Naïve Bayes, and K-Nearest Neighbors Algorithms,” International Journal of Advanced Computer Science and Applications, vol. 8, no. 12, 2017, doi: 10.14569/IJACSA.2017.081201.
V. Plevris, G. Solorzano, N. Bakas, and M. Ben Seghier, “Investigation of performance metrics in regression analysis and machine learning-based prediction models,” in 8th European Congress on Computational Methods in Applied Sciences and Engineering, CIMNE, 2022. doi: 10.23967/eccomas.2022.155.
Longani, C., Potharaju, S. P., & Deore, S. (2021). Price prediction for pre-owned cars using ensemble machine learning techniques. Advances in Parallel Computing, 39, 178–187. https://doi.org/10.3233/APC210194
F. R. Amik, A. Lanard, A. Ismat, and S. Momen, “Application of Machine Learning Techniques to Predict the Price of Pre-Owned Cars in Bangladesh,” Information (Switzerland), vol. 12, no. 12, Dec. 2021, doi: 10.3390/info12120514.
L. Bukvić, J. Pašagić Škrinjar, T. Fratrović, and B. Abramović, “Price Prediction and Classification of Used-Vehicles Using Supervised Machine Learning,” Sustainability (Switzerland), vol. 14, no. 24, Dec. 2022, doi: 10.3390/su142417034.
F. Abdullah, Md. A. Rahman, M. Shidujaman, M. Hasan, and Md. T. Habib, “Machine learning modeling for reconditioned car selling price prediction,” SPIE-Intl Soc Optical Eng, Sep. 2023, p. 100. doi: 10.1117/12.2689745.
Amit Kumar Tyagi and P. Chahal, “Artificial Intelligence and Machine Learning Algorithms,” IGI Global eBooks, pp. 421–446, May 2022, doi: https://doi.org/10.4018/978-1-6684-6291-1.ch024.
J. Alzubi, A. Nayyar, and A. Kumar, “Machine Learning from Theory to Algorithms: An Overview,” Journal of Physics: Conference Series, vol. 1142, no. 012012, Nov. 2018, doi: https://doi.org/10.1088/1742-6596/1142/1/012012.
E. F. Morales and H. J. Escalante, “A brief introduction to supervised, unsupervised, and reinforcement learning,” Biosignal Processing and Classification Using Computational Learning and Intelligence, pp. 111–129, 2022. doi:10.1016/b978-0-12-820125-1.00017-8.
D. C. Montgomery, E. A. Peck, and G. G. Vining, Introduction to Linear Regression Analysis. Estats Units d’Amèrica: Wiley, 2021.
E. Ostertagová, “Modelling using polynomial regression,” Procedia Engineering, vol. 48, pp. 500–506, 2012. doi:10.1016/j.proeng.2012.09.545
Z. Zhang, “Introduction to machine learning: k-nearest neighbors,” Annals of Translational Medicine, vol. 4, no. 11, pp. 218–218, Jun. 2016, doi: https://doi.org/10.21037/atm.2016.03.37.
DOI: https://doi.org/10.29040/ijcis.v5i1.147
Article Metrics
Abstract view : 701 timesPDF - 336 times
Refbacks
- There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 4.0 International License
















