The Mitigating Overfitting in Sentiment Analysis Insights from CNN-LSTM Hybrid Models

Susandri Susandri; Ahmad Zamsuri; Nurliana Nasution; Yoyon Efendi; Hiba Basim Alwan

doi:10.30812/matrik.v24i2.4742

Authors

Susandri Susandri Universitas Lancang Kuning http://orcid.org/0000-0002-3875-9747
Ahmad Zamsuri Universitas Lancang Kuning , Pekanbaru, Indonesia
Nurliana Nasution Universitas Lencang Kuning, Pekanbaru, Indonesia
Yoyon Efendi University Utara Malaysia, Kedah, Malaysia
Hiba Basim Alwan University of Technology, Baghdad, Iraq

DOI:

https://doi.org/10.30812/matrik.v24i2.4742

Keywords:

Convolutional Neural Networks, Hybrid Models, Long Short-Term Memory, Mitigating Overfitting, Sentiment Analysis

Abstract

This study aims to improve sentiment analysis accuracy and address overfitting challenges in deep learning models by developing a hybrid model based on Convolutional Neural Networks and Long Short-Term Memory Networks. The research methodology involved multiple stages, starting with preprocessing a dataset of 5,456 rows. This process included removing duplicate data, empty entries, and neutral sentiments, resulting in 2,685 usable rows. To overcome data quantity limitations, data augmentation expanded the training dataset from 2,148 to 10,740 samples. Data transformation was carried out using tokenization, padding, and embedding techniques, leveraging Word2Vec and GloVe to produce numerical representations of textual data. The hybrid model demonstrated strong performance, achieving a training accuracy of 99.51%, validation accuracy of 99.25%, and testing accuracy of 87.34%, with a loss value of 0.56. Evaluation metrics showed precision, recall, and F1-Score values of 86%, 87%, and 86%, respectively. The hybrid model outperformed individual models, including Convolutional Neural Networks (70% accuracy) and Long Short-Term Memory Networks (81% accuracy). It also surpassed other hybrid models, such as the multiscale Convolutional Neural Network-Long Short-Term Memory Network, which achieved a maximum accuracy of 89.25%. The implications of this study demonstrate that the hybrid model based on Convolutional Neural Networks and Long Short-Term Memory Networks effectively improves sentiment analysis accuracy while reducing the risk of overfitting, particularly in small or imbalanced datasets. Future research is recommended to enhance data quality, adopt more advanced embedding techniques, and optimize model configurations to achieve better performance.

Downloads

Download data is not yet available.

Author Biography

Susandri Susandri, Universitas Lancang Kuning

Susandri, S.Kom, M.Kom obtained a bachelor's degree with a bachelor's degree
in computer engineering from STMIK UPI YPTK Padang and a master's degree
in information technology at UPI YPTK Padang.
Serves as a Lecturer at Universitas Lancang Kuning.
current research interests include data science, mobile computing.

References

[1] J. Khan, N. Ahmad, S. Khalid, F. Ali, and Y. Lee, â€œSentiment and Context-Aware Hybrid DNN With Attention for Text Sentiment
Classification,â€ IEEE Access, vol. 11, no. 3, pp. 28 162â€“28 179, 2023, https://doi.org/10.1109/ACCESS.2023.3259107.
[2] N. A. Semary, W. Ahmed, K. Amin, P. PÅ‚awiak, and M. Hammad, â€œImproving sentiment classification using a RoBERTa-based
hybrid model,â€ Frontiers in Human Neuroscience, vol. 17, no. 12, pp. 1â€“10, 2023, https://doi.org/10.3389/fnhum.2023.1292010.
[3] M. S. Islam, M. N. Kabir, N. A. Ghani, K. Z. Zamli, N. S. A. Zulkifli, M. M. Rahman, and M. A. Moni, â€œâ€Challenges and future
in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approachâ€,â€ Artificial Intelligence
Review, vol. 57, no. 3, pp. 1â€“79, 2024, https://doi.org/10.1007/s10462-023-10651-9.
[4] R. Geethanjali and A. Valarmathi, â€œA novel hybrid deep learning IChOA-CNN-LSTM model for modality-enriched and multilingual
emotion recognition in social media,â€ Scientific reports, vol. 14, no. 1, p. 22270, 2024, https://doi.org/10.1038/
s41598-024-73452-2.
[5] A. Wahdan, S. Hantoobi, S. A. Salloum, and K. Shaalan, â€œA systematic review of text classification research based on deep
learning models in Arabic language,â€ International Journal of Electrical and Computer Engineering, vol. 10, no. 6, pp. 6629â€“
6643, 2020, https://doi.org/10.11591/IJECE.V10I6.PP6629-6643.
[6] K. M. Hasib, S. Azam, A. Karim, A. A. Marouf, F. M. M. Shamrat, S. Montaha, K. C. Yeo, M. Jonkman, R. Alhajj, and J. G.
Rokne, â€œMCNN-LSTM: Combining CNN and LSTM to Classify Multi-Class Text in Imbalanced News Data,â€ IEEE Access,
vol. 11, no. 9, pp. 93 048â€“93 063, 2023, https://doi.org/10.1109/ACCESS.2023.3309697.
[7] N. Zhang, J. Xiong, Z. Zhao, M. Feng, X. Wang, Y. Qiao, and C. Jiang, â€œDose My Opinion Count? A CNN-LSTM Approach
for Sentiment Analysis of Indian General Elections,â€ Journal of Theory and Practice of Engineering Science, vol. 4, no. 05, pp.
40â€“50, 2024, https://doi.org/10.53469/jtpes.2024.04(05).06.
[8] A. Sungheetha, â€œTransCapsule Model for Sentiment Classification,â€ Journal of Artificial Intelligence and Capsule Networks,
vol. 02, no. 03, pp. 163â€“169, 2020, https://doi.org/10.36548/jaicn.2020.3.003.
[9] D. Chai, W. Wu, Q. Han, W. Fei, and J. Li, â€œDescription based text classification with reinforcement learning,â€ in 37th International
Conference on Machine Learning, ICML 2020, vol. 119, no. 1, 2020, pp. 1348â€“1359.
[10] P. Sudhir and V. Deshakulkarni, â€œComparative study of various approaches , applications and classifiers for sentiment analysis,â€
Global Transitions Proceedings, vol. 2, no. 2, pp. 205â€“211, 2021, https://doi.org/10.1016/j.gltp.2021.08.004.
[11] L. Irfan, S. Hussain, M. Ayoub, Y. Yu, and A. Khan, â€œA Comparative Analysis of Social Communication Applications using
Aspect Based Sentiment Analysis,â€ Pakistan Journal of Engineering and Technology, PakJET, vol. 5, no. 3, pp. 44â€“50, 2022,
https://doi.org/10.51846/vol5iss3pp44-50.
[12] O. Iparraguirre-villanueva, A. Alvarez-risco, J. Luis, H. Salazar, S. Beltozar-clemente, J. Zapata-paulini, A. Y. Jaime, and
M. Cabanillas-carbonell, â€œThe Public Health Contribution of Sentiment Analysis of Monkeypox Tweets to Detect Polarities
Using the CNN-LSTM Model number,â€ vaccines, vol. 11, no. 312, pp. 1â€“12, 2023, https://doi.org/10.3390/vaccines11020312.
[13] A. Gupta and P. Agarwal, â€œIntegrating CRM and ERP Insights for Optimized Product Development Using CNN-LSTM Hybrid
Models,â€ International Journal of Computer Trends and Technology, vol. 72, no. 8, pp. 91â€“97, 2024, https://doi.org/10.14445/
22312803/IJCTT-V72I8P113.
[14] Y. Zhou, Q. Zhang, D. Wang, and X. Gu, â€œText Sentiment Analysis Based on a New Hybrid Network Model,â€ p. 6774320,
2022, https://doi.org/10.1155/2022/6774320.
[15] N. Jin, J. Wu, X. Ma, K. Yan, and Y. Mo, â€œMulti-task learning model based on Multi-scale CNN and LSTM for sentiment
classification,â€ IEEE Access, vol. 8, no. 4, pp. 77 060â€“77 072, 2020, https://doi.org/10.1109/ACCESS.2020.2989428.
[16] S. Soumya and K. V. Pramod, â€œHybrid Deep Learning Approach for Sentiment Classification of Malayalam Tweets,â€ (IJACSA)
International Journal of Advanced Computer Science and Applications, vol. 13, no. 4, pp. 891â€“899, 2022, https://doi.org/10.
14569/IJACSA.2022.01304103.
[17] A. Mohta, A. Jain, A. Saluja, and S. Dahiya, â€œPre-processing and emoji classification of whatsapp chats for sentiment analysis,â€
in Proceedings of the 4th International Conference on IoT in Social, Mobile, Analytics and Cloud, ISMAC 2020, 2020, pp.
514â€“519, https://doi.org/10.1109/I-SMAC49090.2020.9243443.
[18] S. Wu, K. Roberts, S. Datta, J. Du, Z. Ji, Y. Si, S. Soni, Q. Wang, Q. Wei, Y. Xiang, B. Zhao, and H. Xu, â€œDeep learning in
clinical natural language processing: A methodical review,â€ pp. 457â€“470, 2020, https://doi.org/10.1093/jamia/ocz200.
[19] L. Cabral, J. Monteiro, J. Franco da Silva, C. Mattos, and P. MourËœao, â€œFakeWhastApp.BR: NLP and Machine Learning Techniques
for Misinformation Detection in Brazilian Portuguese WhatsApp Messages,â€ in Proceedings of the 23rd International
Conference on Enterprise Information Systems, vol. 1, 2021, pp. 63â€“74, https://doi.org/10.5220/0010446800630074.
[20] P. K. Jain, V. Saravanan, and R. Pamula, â€œA Hybrid CNN-LSTM : A Deep Learning Approach for Consumer Sentiment Analysis
Using Qualitative User-Generated Contents,â€ vol. 20, no. 5, pp. 1â€“15, 2021, https://doi.org/10.1145/3457206.
[21] J. Sun, R. Jin, X. Ma, J.-y. Park, K.-a. Sohn, and T.-s. Chung, â€œGated Convolutional Neural Networks for Text Classification,â€
in Advances in Computer Science and Ubiquitous Computing, J. J. Park, S. J. Fong, Y. Pan, and Y. Sung, Eds. Singapore:
Springer Singapore, 2021, pp. 309â€“316, https://doi.org/10.1007/978-981-15-9343-7 43.
[22] Y. Zhou, Q. Zhang, D. Wang, and X. Gu, â€œText Sentiment Analysis Based on a New Hybrid Network Model,â€ Computational
Intelligence and Neuroscience, vol. 2022, no. 12, pp. 1â€“15, 2022, https://doi.org/10.1155/2022/6774320.
[23] L. Khan, A. Amjad, K. M. Afaq, and H.-t. Chang, â€œDeep Sentiment Analysis Using CNN-LSTM Architecture of English
and Roman Urdu Text Shared in Social Media,â€ Applied Sciences, vol. 12, no. 6, pp. 1â€“18, 2022, https://doi.org/10.3390/
app12052694.
[24] S. Riyadi, A. Divayu Andriyani, and S. Noraini Sulaiman, â€œImproving Hate Speech Detection Using Double-Layers Hybrid
CNN-RNN Model on Imbalanced Dataset,â€ IEEE Access, vol. 12, no. 10, pp. 159 660â€“159 668, 2024, https://doi.org/10.1109/
ACCESS.2024.3487433.
[25] S. Susandri, S. Defit, and M. Tajuddin, â€œEnhancing Text Sentiment Classification with Hybrid CNN-BiLSTM Model on WhatsApp
Group,â€ Journal of Advances in Information Technology, vol. 15, no. 3, pp. 355â€“363, 2024, https://doi.org/10.12720/jait.
15.3.355-363.
[26] G. Chao, J. Liu, M. Wang, and D. Chu, â€œData augmentation for sentiment classification with semantic preservation and diversity,â€
Knowledge-Based Systems, vol. 280, no. 11, p. 111038, 2023, https://doi.org/10.1016/j.knosys.2023.111038.
[27] J. Chen, Z. Yang, and D. Yang, â€œMixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text
Classification,â€ Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, vol. 1, no. 1, pp.
2147â€“2157, 2020, https://doi.org/10.18653/v1/2020.acl-main.194.

The Mitigating Overfitting in Sentiment Analysis Insights from CNN-LSTM Hybrid Models

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biography

References

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)

menubaru

tools

citation