Development of Semantic-Based Voicebots and Natural Language Processing for E-Commerce Product Searches
Main Article Content
Maskur*
Yosi Afandi
Wahyu Widyananda
Ahmad Fauzi
Zhulvardyan Armayrishtya
Searching for products online is often an inefficient and confusing process, especially when users do not know the exact name of the product or use terms that differ from the search system. Keyword-based searches tend to produce irrelevant results because the system only matches text literally without understanding the meaning. As users increasingly talk to digital devices, voice-based search technology has become a more natural and intuitive alternative. This research aims to develop a semantic-based voicebot supported by Natural Language Processing (NLP) to improve the effectiveness of product searches on e-commerce platforms. The designed system not only recognizes user speech but also understands the context, intent, and semantic meaning of the given commands. The research stages include collecting user voice data, training the Automatic Speech Recognition (ASR) model for voice-to-text conversion, and applying the semantic NLP model for interpreting the context of product searches. The testing was conducted using Indonesian voice commands in a simulated e-commerce scenario. The results showed that the system achieved an average Word Error Rate (WER) of 1.29%, indicating a high level of accuracy in recognizing speech and understanding user intent. The integration between ASR and semantic NLP proved capable of creating a more natural, responsive search experience that resembles the way humans think and communicate when interacting with online search systems.
Alammar, M., El-Hindi, K., & Al-Khalifa, H. (2025). English-Arabic Hybrid Semantic Text Chunking Based on Fine-Tuning BERT. Computation, 13(6). Scopus. https://doi.org/10.3390/computation13060151
Alotaibi, R. S., Alotaibi, F. M., Nooh, S. A., & Alsulami, A. A. (2025). AI-Driven Textual Feedback Analysis in E-Training Using Enhanced RoBERTa. International Journal of Advanced Computer Science and Applications, 16(7), 255–266. Scopus. https://doi.org/10.14569/IJACSA.2025.0160727
Anidjar, O. H., Yozevitch, R., Bigon, N., Abdalla, N., Myara, B., & Marbel, R. (2023). Crossing language identification: Multilingual ASR framework based on semantic dataset creation & Wav2Vec 2.0. Machine Learning with Applications, 13, 100489. https://doi.org/10.1016/j.mlwa.2023.100489
Canchila, S., Meneses-Eraso, C., Casanoves-Boix, J., Cortés-Pellicer, P., & Castelló-Sirvent, F. (2024). Natural Language Processing: An Overview of Models, Transformers and Applied Practices. Computer Science and Information Systems, 21(3), 1097–1145. Scopus. https://doi.org/10.2298/CSIS230217031C
Chahal, H., & Mahajan, M. (2025). Voice Unbound: The Impact of Localization and Experience on Continuous Personal Voice Assistant Usage and Its Drivers. International Journal of Human–Computer Interaction, 41(14), 8606–8623. https://doi.org/10.1080/10447318.2024.2413282
Chao, M.-H., Trappey, A. J. C., & Wu, C.-T. (2021). Emerging Technologies of Natural Language-Enabled Chatbots: A Review and Trend Forecast Using Intelligent Ontology Extraction and Patent Analytics. Complexity, 2021(1), 5511866. https://doi.org/10.1155/2021/5511866
Chen, G., Xiao, S., Zhang, C., & Zhao, H. (2023). A Theory-Driven Deep Learning Method for Voice Chat–Based Customer Response Prediction. Information Systems Research, 34(4), 1513–1532. https://doi.org/10.1287/isre.2022.1196
Chuang, H.-M., & Cheng, D.-W. (2022). Conversational AI over Military Scenarios Using Intent Detection and Response Generation. Applied Sciences, 12(5), 2494. https://doi.org/10.3390/app12052494
Daniel, G., & Cabot, J. (2024). Applying model-driven engineering to the domain of chatbots: The Xatkit experience. Science of Computer Programming, 232, 103032. https://doi.org/10.1016/j.scico.2023.103032
Deng, Y., Li, Y., Zhang, W., Ding, B., & Lam, W. (2022). Toward Personalized Answer Generation in E-Commerce via Multi-perspective Preference Modeling. ACM Transactions on Information Systems, 40(4). Scopus. https://doi.org/10.1145/3507782
Edén, A. S., Sandlund, P., Faraon, M., & Rönkkö, K. (2024). VoiceBack: Design of Artificial Intelligence-Driven Voice-Based Feedback System for Customer-Agency Communication in Online Travel Services. Information, 15(8), 468. https://doi.org/10.3390/info15080468
Firmansyah, B. A., & Bachtiar, F. A. (2021). Automatic Speech Recognition Bahasa Indonesia menggunakan Unidirectional Gated Recurrent Unit. Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, 5(12), 5180–5187. https://j-ptiik.ub.ac.id/index.php/j-ptiik/article/view/10200
Gao, Y., & Liu, H. (2022). Artificial intelligence-enabled personalization in interactive marketing: A customer journey perspective. Journal of Research in Interactive Marketing, 17(5), 663–680. https://doi.org/10.1108/JRIM-01-2022-0023
Grau, M., Sieber, D., Zierau, N., & Blohm, I. (2025). Vocalizing User Feedback: The Impact of Input Modality on Self-Disclosure. Proc. ACM Hum.-Comput. Interact., 9(7), CSCW520:1-CSCW520:25. https://doi.org/10.1145/3757701
Jarin, A., Santosa, A., Uliniansyah, M. T., Aini, L. R., Nurfadhilah, E., & Gunarso, G. (2024). Automatic speech recognition for Indonesian medical dictation in cloud environment. IAES International Journal of Artificial Intelligence (IJ-AI), 13(2), 1762–1772. https://doi.org/10.11591/ijai.v13.i2.pp1762-1772
Kamoonpuri, S. Z., & Sengar, A. (2024). Love it or hate it? Deconstructing consumers’ attitudes towards AI enabled voice shopping. Journal of Consumer Behaviour, 23(5), 2395–2412. https://doi.org/10.1002/cb.2347
Karimova, G. Z. (2025). Not in our image: Rethinking anthropomorphism in expert chatbot design. AI & SOCIETY. https://doi.org/10.1007/s00146-025-02438-z
Kim, R. Y. (2024). What makes things catch on? Understanding consumer engagement with video content on social media. Electronic Commerce Research. https://doi.org/10.1007/s10660-024-09926-2
King, D., Auschaitrakul, S., & Lin, C.-W. J. (2022). Search modality effects: Merely changing product search modality alters purchase intentions. Journal of the Academy of Marketing Science, 50(6), 1236–1256. https://doi.org/10.1007/s11747-021-00820-z
Kıyak, F. M., & Oflazoğlu Dora, S. (2025). Virtual Assistant Usage Habit: The Mediating Role of Emotional and Cognitive Trust in the Context of Anthropomorphism, Enjoyment, Intelligence. Services Marketing Quarterly, 0(0), 1–29. https://doi.org/10.1080/15332969.2025.2531683
Li, D., Gao, Y., Zhu, C., Wang, Q., & Wang, R. (2023). Improving Speech Recognition Performance in Noisy Environments by Enhancing Lip Reading Accuracy. Sensors, 23(4), 2053. https://doi.org/10.3390/s23042053
Mari, A., Mandelli, A., & Algesheimer, R. (2024). Empathic voice assistants: Enhancing consumer responses in voice commerce. Journal of Business Research, 175, 114566. https://doi.org/10.1016/j.jbusres.2024.114566
Nalluri, V., Wang, Y.-Y., Jeng, W.-D., & Chen, L.-S. (2025). Extracting Advertising Elements and the Voice of Customers in Online Game Reviews. Journal of Theoretical and Applied Electronic Commerce Research, 20(4), 321. https://doi.org/10.3390/jtaer20040321
Planas, E., Daniel, G., Brambilla, M., & Cabot, J. (2021). Towards a model-driven approach for multiexperience AI-based user interfaces. Software and Systems Modeling, 20(4), 997–1009. https://doi.org/10.1007/s10270-021-00904-y
Rzepka, C., Berger, B., & Hess, T. (2022). Voice Assistant vs. Chatbot – Examining the Fit Between Conversational Agents’ Interaction Modalities and Information Search Tasks. Information Systems Frontiers, 24(3), 839–856. https://doi.org/10.1007/s10796-021-10226-5
Sel, I., & Hanbay, D. (2024). Efficient Adaptation: Enhancing Multilingual Models for Low-Resource Language Translation. Mathematics, 12(19), 3149. https://doi.org/10.3390/math12193149
Xu, K., Chen, X., Liu, F., & Huang, L. (2024). What did you hear and what did you see? Understanding the transparency of facial recognition and speech recognition systems during human–robot interaction. New Media and Society. Scopus. https://doi.org/10.1177/14614448241256899
Zhou, Z., Ding, N., Fan, X., Shang, Y., Qiu, Y., Zhuo, J., Ge, Z., Wang, S., Liu, L., Xu, S., & Zhang, H. (2023). Semantic-enhanced Modality-asymmetric Retrieval for Online E-commerce Search. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 3405–3409. https://doi.org/10.1145/3539618.3591863









