Evaluating machine learning models efficacy in sentiment analysis for Moroccan Darija: An exploration with MAC dataset

2024;
: pp. 539–545
https://doi.org/10.23939/mmc2024.02.539
Received: December 18, 2023
Accepted: June 25, 2024

Sakhi H., Elfilali S.  Evaluating machine learning models efficacy in sentiment analysis for Moroccan Darija: An exploration with MAC dataset.  Mathematical Modeling and Computing. Vol. 11, No. 2, pp. 539–545 (2024)

1
Faculty of Sciences Ben M'Sik, Hassan II University
2
Faculty of Sciences Ben M'Sik, Hassan II University

Sentiment analysis is an essential technique for classifying and extracting emotions from several data sets.  While many basic methods distinguish between negative and positive emotions, advanced approaches may consider additional categories, such as neutral emotions.  This becomes very important and difficult when we need to deal with less parsed languages and dialects, such as Moroccan Darija.  Our study highlights the nuances of conducting sentiment analysis implementing the MAC dataset, which includes comments in Moroccan Darija.  Our main target is to do comparative study and research of the most used machine learning models for Arabic sentiment analysis, especially SVM, NB and KNN.  These models have proven their effectiveness in classifying and analyzing emotions in widely studied languages such as English and Arabic.  Through this comparative analysis, we aim to realize their effectiveness and adaptability in the Moroccan Darija dialect context.

  1. Liu B.  Sentiment Analysis and Opinion Mining.  Synthesis Lectures on Human Language Technologies.  5 (1), 1–167 (2012).
  2. Al-Ayyoub M., Khamaiseh A. A., Jararweh Y., Al-Kabi M. N.  A comprehensive survey of arabic sentiment analysis.  Information Processing and Management.  56 (2), 320–342 (2019).
  3. Keramatfar A., Amirkhani H.  Bibliometrics of sentiment analysis literature.  Journal of Information Science.  45 (1), 3–15 (2019).
  4. Ahmad M., Aftab S., Bashir M. S., Hameed N.  Sentiment analysis using SVM: A systematic literature review.  International Journal of Advanced Computer Science and Applications.  9 (2), 182–188 (2018).
  5. Alayba A. M., Palade V., England M., Iqbal R.  Arabic language sentiment analysis on health services.  2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR). 114–118 (2017).
  6. Al-Ayyoub M., Khamaiseh A. A., Jararweh Y., Al-Kabi M. N.  A comprehensive survey of arabic sentiment analysis.  Information Processing \& Management.  56 (2), 320–342 (2019).
  7. Arabic Fifth Most Spoken Language in the World: www.Moroccoworldnews.com.
  8. List of countries with Arabic as an official language: www.wikipedia.org.
  9. Garouani M., Kharroubi J.  MAC: An Open and Free Moroccan Arabic Corpus for Sentiment Analysis.  Innovations in Smart Cities Applications Volume 5. 849–858 (2022).
  10. Atwan J., Wedyan M., Bsoul Q., Hammadeen A., Alturki R.  The Use of Stemming in the Arabic Text and Its Impact on the Accuracy of Classification.  Scientific Programming.  2021, 1367210  (2021).
  11. Freihat A. A., Abbas M., Bella G., Giunchiglia F.  Towards an Optimal Solution to Lemmatization in Arabic.  Procedia Computer Science.  142, 132–140 (2018).
  12. Syarief M. G., Kurahman O. T., Huda A. F., Darmalaksana W.  Improving Arabic Stemmer: ISRI Stemmer.  2019 IEEE 5th International Conference on Wireless and Telematics (ICWT).  1–4 (2019).
  13. Alammary A. S.  Arabic Questions Classification Using Modified TF-IDF.  IEEE Access.  9, 95109–95122 (2021).
  14. Bekkali M., Lachkar A.  Arabic Sentiment Analysis using Different Representation Models. International Journal of Emerging Trends in Engineering Research.  8 (7), 3368–3372 (2020).
  15. Duwairi R. M., Qarqaz I.  Arabic sentiment analysis using supervised classification.  2014 International Conference on Future Internet of Things and Cloud. 579–583 (2014).
  16. Tan S., Cheng X., Wang Y., Xu H.  Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis.  Advances in Information Retrieval.  337–349 (2009).