Research Article

CLASSIFICATION OF CUSTOMER SENTIMENTS BASED ON ONLINE REVIEWS: COMPARATIVE ANALYSIS OF MACHINE LEARNING AND DEEP LEARNING ALGORITHMS

Volume: 27 Number: 3 September 3, 2024
TR EN

CLASSIFICATION OF CUSTOMER SENTIMENTS BASED ON ONLINE REVIEWS: COMPARATIVE ANALYSIS OF MACHINE LEARNING AND DEEP LEARNING ALGORITHMS

Abstract

E-commerce's transformation of consumer behavior has increased the importance of understanding customer emotions, especially in the transition from traditional retail models to online platforms. The proliferation of online shopping has fundamentally changed not only shopping habits but also consumer interactions and purchase decisions. This research aims to compare and analyze the performance of various text mining and machine learning algorithms in the context of sentiment analysis and online review data. For this purpose, analyses were performed with a total of five supervised classification algorithms including Logistic Regression, Naive Bayes, Support Vector Machine, Random Forest, AdaBoost, and a deep learning model, CNN Model. The dataset used in the study includes customer reviews obtained from a women's clothing e-commerce platform. The missing data were completed by pre-processing the dataset. Count Vectorizer and TF-IDF vectorization were performed to transform the textual data. In addition, various text preprocessing steps were applied. According to the findings obtained from the research, AdaBoost and Naive Bayes algorithms were the most effective algorithms in terms of classifying customer sentiments. No significant difference was detected in terms of the vectorization method used. Although the CNN Model showed high performance, the generalizability of the model was considered low because overfitting was detected during the training of the model.

Keywords

References

  1. Agarap, A. F. (2018). Statistical analysis on E-commerce reviews, with sentiment classification using bidirectional recurrent neural network (RNN). arXiv preprint arXiv:1805.03687.
  2. Aizawa, A. (2003). An information-theoretic perspective of TF-IDF measures. Information Processing & Management, 39(1), 45-65.
  3. Alantari, H. J., Currim, I. S., Deng, Y., & Singh, S. (2022). An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews. International Journal of Research in Marketing, 39(1), 1-19.
  4. Alexopoulou, T., Michel, M., Murakami, A., & Meurers, D. (2017). Task effects on linguistic complexity and accuracy: A large‐scale learner corpus analysis employing natural language processing techniques. Language Learning, 67(S1), 180-208.
  5. Angulakshmi, G., & ManickaChezian, R. (2014). An analysis on opinion mining: techniques and tools. International Journal of Advanced Research in Computer and Communication Engineering, 3(7), 2319-5940.
  6. Badaro, G., Baly, R., Hajj, H., Habash, N., & El-Hajj, W. (2014, October). A large scale Arabic sentiment lexicon for Arabic opinion mining. In Proceedings of the EMNLP 2014 workshop on arabic natural language processing (ANLP) (pp. 165-173).
  7. Bafna, P., Pramod, D., & Vaidya, A. (2016, March). Document clustering: TF-IDF approach. In 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) (pp. 61-66). IEEE.
  8. Barik, K., Misra, S., Ray, A. K., & Bokolo, A. (2023). LSTM-DGWO-Based sentiment analysis framework for analyzing online customer reviews. Computational Intelligence and Neuroscience, 2023.

Details

Primary Language

English

Subjects

Data Mining and Knowledge Discovery , Natural Language Processing

Journal Section

Research Article

Publication Date

September 3, 2024

Submission Date

January 16, 2024

Acceptance Date

July 31, 2024

Published in Issue

Year 2024 Volume: 27 Number: 3

APA
Sinap, V. (2024). CLASSIFICATION OF CUSTOMER SENTIMENTS BASED ON ONLINE REVIEWS: COMPARATIVE ANALYSIS OF MACHINE LEARNING AND DEEP LEARNING ALGORITHMS. Kahramanmaraş Sütçü İmam Üniversitesi Mühendislik Bilimleri Dergisi, 27(3), 779-799. https://doi.org/10.17780/ksujes.1420530

Cited By