A Query Expansion Using Support Vector Machine (SVM) and Best Matching 25 (BM25)

Muhadi Ramelan

Abstract


The information retrieval system needs to be upgraded constantly to improving retrieval performance. Expansion Query is one of the information retrieval methods for retrieve more relevant documents. Several technique expansion queries are synonym expansion, Thesaurus Expansion, and Word Embedding Expansion. This research uses a Support Vector Machine ( SVM ) to get new terms from the corpus. Some people are sometimes confused about what must to write to get the desired document or are too lazy to write a lot of words. In this research, Based on the input query data SVM will search for condition data from the existing corpus so that the data obtained will later become an expansion for the query. Adding additional terms to capture a broader range of relevant documents will retrieve more documents and improve the relevance of search results.

Full Text:

PDF

References


. Silva, Sergio, Adrián Seara Vieira, Pedro Celard, Eva Lorenzo Iglesias, and Lourdes Borrajo. 2021. “A Query Expansion Method Using Multinomial Naive Bayes.” Applied Sciences (Switzerland)

. Afuan, Lasmedi, Ahmad Ashari, and Yohanes Suyanto. 2019. “A Study: Query Expansion Methods in Information Retrieval.” Journal of Physics: Conference Series 1367(1).

. Azad, Hiteshwar Kumar, and Akshay Deepak. 2019. “Query Expansion Techniques for Information Retrieval: A Survey.” Information Processing and Management 56(5): 1698–1735.

. Blanco, Roi, and Paolo Boldi. 2012. “Extending BM25 with Multiple Query Operators.” SIGIR’12 - Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval: 921–30.

. Hou, Fenglei, and Bingxi Wang. 2001. “Text-Independent Speaker Recognition Using Support Vector Machine.” 2001 International Conferences on Info-Tech and Info-Net: A Key to Better Life, ICII 2001 - Proceedings 3(306): 402–7.

. Hwang, Young Sup. 2014. “Wrapper-Based Feature Selection Using Support Vector Machine.” Life Science Journal 11(7): 632–36.

. Maier, J., and K. Ferens. 2009. “Classification of English Phrases and SMS Text Messages Using Bayes and Support Vector Machine Classifiers.” Canadian Conference on Electrical and Computer Engineering: 415–18.

. Ramadhani, P. P., and S. Hadi. 2021. “Text Classification on the Instagram Caption Using Support Vector Machine.” Journal of Physics: Conference Series 1722(1).

. Svore, Krysta M., and Christopher J.C. Burges. 2009. “A Machine Learning Approach for Improved BM25 Retrieval.” International Conference on Information and Knowledge Management, Proceedings: 1811–14.

. Whissell, John S., and Charles L.A. Clarke. 2011. “Improving Document Clustering Using Okapi BM25 Feature Weighting.” Information Retrieval 14(5): 466–87.




DOI: https://doi.org/10.29040/ijcis.v5i2.162

Article Metrics

Abstract view : 302 times
PDF - 124 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License