The Mitigating Overfitting in Sentiment Analysis Insights from CNN-LSTM Hybrid Models
DOI:
https://doi.org/10.30812/matrik.v24i2.4742Keywords:
Convolutional Neural Networks, Hybrid Models, Long Short-Term Memory, Mitigating Overfitting, Sentiment AnalysisAbstract
This study aims to improve sentiment analysis accuracy and address overfitting challenges in deep learning models by developing a hybrid model based on Convolutional Neural Networks and Long Short-Term Memory Networks. The research methodology involved multiple stages, starting with preprocessing a dataset of 5,456 rows. This process included removing duplicate data, empty entries, and neutral sentiments, resulting in 2,685 usable rows. To overcome data quantity limitations, data augmentation expanded the training dataset from 2,148 to 10,740 samples. Data transformation was carried out using tokenization, padding, and embedding techniques, leveraging Word2Vec and GloVe to produce numerical representations of textual data. The hybrid model demonstrated strong performance, achieving a training accuracy of 99.51%, validation accuracy of 99.25%, and testing accuracy of 87.34%, with a loss value of 0.56. Evaluation metrics showed precision, recall, and F1-Score values of 86%, 87%, and 86%, respectively. The hybrid model outperformed individual models, including Convolutional Neural Networks (70% accuracy) and Long Short-Term Memory Networks (81% accuracy). It also surpassed other hybrid models, such as the multiscale Convolutional Neural Network-Long Short-Term Memory Network, which achieved a maximum accuracy of 89.25%. The implications of this study demonstrate that the hybrid model based on Convolutional Neural Networks and Long Short-Term Memory Networks effectively improves sentiment analysis accuracy while reducing the risk of overfitting, particularly in small or imbalanced datasets. Future research is recommended to enhance data quality, adopt more advanced embedding techniques, and optimize model configurations to achieve better performance.
Downloads
References
Classification,†IEEE Access, vol. 11, no. 3, pp. 28 162–28 179, 2023, https://doi.org/10.1109/ACCESS.2023.3259107.
[2] N. A. Semary, W. Ahmed, K. Amin, P. Pławiak, and M. Hammad, “Improving sentiment classification using a RoBERTa-based
hybrid model,†Frontiers in Human Neuroscience, vol. 17, no. 12, pp. 1–10, 2023, https://doi.org/10.3389/fnhum.2023.1292010.
[3] M. S. Islam, M. N. Kabir, N. A. Ghani, K. Z. Zamli, N. S. A. Zulkifli, M. M. Rahman, and M. A. Moni, “â€Challenges and future
in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approachâ€,†Artificial Intelligence
Review, vol. 57, no. 3, pp. 1–79, 2024, https://doi.org/10.1007/s10462-023-10651-9.
[4] R. Geethanjali and A. Valarmathi, “A novel hybrid deep learning IChOA-CNN-LSTM model for modality-enriched and multilingual
emotion recognition in social media,†Scientific reports, vol. 14, no. 1, p. 22270, 2024, https://doi.org/10.1038/
s41598-024-73452-2.
[5] A. Wahdan, S. Hantoobi, S. A. Salloum, and K. Shaalan, “A systematic review of text classification research based on deep
learning models in Arabic language,†International Journal of Electrical and Computer Engineering, vol. 10, no. 6, pp. 6629–
6643, 2020, https://doi.org/10.11591/IJECE.V10I6.PP6629-6643.
[6] K. M. Hasib, S. Azam, A. Karim, A. A. Marouf, F. M. M. Shamrat, S. Montaha, K. C. Yeo, M. Jonkman, R. Alhajj, and J. G.
Rokne, “MCNN-LSTM: Combining CNN and LSTM to Classify Multi-Class Text in Imbalanced News Data,†IEEE Access,
vol. 11, no. 9, pp. 93 048–93 063, 2023, https://doi.org/10.1109/ACCESS.2023.3309697.
[7] N. Zhang, J. Xiong, Z. Zhao, M. Feng, X. Wang, Y. Qiao, and C. Jiang, “Dose My Opinion Count? A CNN-LSTM Approach
for Sentiment Analysis of Indian General Elections,†Journal of Theory and Practice of Engineering Science, vol. 4, no. 05, pp.
40–50, 2024, https://doi.org/10.53469/jtpes.2024.04(05).06.
[8] A. Sungheetha, “TransCapsule Model for Sentiment Classification,†Journal of Artificial Intelligence and Capsule Networks,
vol. 02, no. 03, pp. 163–169, 2020, https://doi.org/10.36548/jaicn.2020.3.003.
[9] D. Chai, W. Wu, Q. Han, W. Fei, and J. Li, “Description based text classification with reinforcement learning,†in 37th International
Conference on Machine Learning, ICML 2020, vol. 119, no. 1, 2020, pp. 1348–1359.
[10] P. Sudhir and V. Deshakulkarni, “Comparative study of various approaches , applications and classifiers for sentiment analysis,â€
Global Transitions Proceedings, vol. 2, no. 2, pp. 205–211, 2021, https://doi.org/10.1016/j.gltp.2021.08.004.
[11] L. Irfan, S. Hussain, M. Ayoub, Y. Yu, and A. Khan, “A Comparative Analysis of Social Communication Applications using
Aspect Based Sentiment Analysis,†Pakistan Journal of Engineering and Technology, PakJET, vol. 5, no. 3, pp. 44–50, 2022,
https://doi.org/10.51846/vol5iss3pp44-50.
[12] O. Iparraguirre-villanueva, A. Alvarez-risco, J. Luis, H. Salazar, S. Beltozar-clemente, J. Zapata-paulini, A. Y. Jaime, and
M. Cabanillas-carbonell, “The Public Health Contribution of Sentiment Analysis of Monkeypox Tweets to Detect Polarities
Using the CNN-LSTM Model number,†vaccines, vol. 11, no. 312, pp. 1–12, 2023, https://doi.org/10.3390/vaccines11020312.
[13] A. Gupta and P. Agarwal, “Integrating CRM and ERP Insights for Optimized Product Development Using CNN-LSTM Hybrid
Models,†International Journal of Computer Trends and Technology, vol. 72, no. 8, pp. 91–97, 2024, https://doi.org/10.14445/
22312803/IJCTT-V72I8P113.
[14] Y. Zhou, Q. Zhang, D. Wang, and X. Gu, “Text Sentiment Analysis Based on a New Hybrid Network Model,†p. 6774320,
2022, https://doi.org/10.1155/2022/6774320.
[15] N. Jin, J. Wu, X. Ma, K. Yan, and Y. Mo, “Multi-task learning model based on Multi-scale CNN and LSTM for sentiment
classification,†IEEE Access, vol. 8, no. 4, pp. 77 060–77 072, 2020, https://doi.org/10.1109/ACCESS.2020.2989428.
[16] S. Soumya and K. V. Pramod, “Hybrid Deep Learning Approach for Sentiment Classification of Malayalam Tweets,†(IJACSA)
International Journal of Advanced Computer Science and Applications, vol. 13, no. 4, pp. 891–899, 2022, https://doi.org/10.
14569/IJACSA.2022.01304103.
[17] A. Mohta, A. Jain, A. Saluja, and S. Dahiya, “Pre-processing and emoji classification of whatsapp chats for sentiment analysis,â€
in Proceedings of the 4th International Conference on IoT in Social, Mobile, Analytics and Cloud, ISMAC 2020, 2020, pp.
514–519, https://doi.org/10.1109/I-SMAC49090.2020.9243443.
[18] S. Wu, K. Roberts, S. Datta, J. Du, Z. Ji, Y. Si, S. Soni, Q. Wang, Q. Wei, Y. Xiang, B. Zhao, and H. Xu, “Deep learning in
clinical natural language processing: A methodical review,†pp. 457–470, 2020, https://doi.org/10.1093/jamia/ocz200.
[19] L. Cabral, J. Monteiro, J. Franco da Silva, C. Mattos, and P. Mour˜ao, “FakeWhastApp.BR: NLP and Machine Learning Techniques
for Misinformation Detection in Brazilian Portuguese WhatsApp Messages,†in Proceedings of the 23rd International
Conference on Enterprise Information Systems, vol. 1, 2021, pp. 63–74, https://doi.org/10.5220/0010446800630074.
[20] P. K. Jain, V. Saravanan, and R. Pamula, “A Hybrid CNN-LSTM : A Deep Learning Approach for Consumer Sentiment Analysis
Using Qualitative User-Generated Contents,†vol. 20, no. 5, pp. 1–15, 2021, https://doi.org/10.1145/3457206.
[21] J. Sun, R. Jin, X. Ma, J.-y. Park, K.-a. Sohn, and T.-s. Chung, “Gated Convolutional Neural Networks for Text Classification,â€
in Advances in Computer Science and Ubiquitous Computing, J. J. Park, S. J. Fong, Y. Pan, and Y. Sung, Eds. Singapore:
Springer Singapore, 2021, pp. 309–316, https://doi.org/10.1007/978-981-15-9343-7 43.
[22] Y. Zhou, Q. Zhang, D. Wang, and X. Gu, “Text Sentiment Analysis Based on a New Hybrid Network Model,†Computational
Intelligence and Neuroscience, vol. 2022, no. 12, pp. 1–15, 2022, https://doi.org/10.1155/2022/6774320.
[23] L. Khan, A. Amjad, K. M. Afaq, and H.-t. Chang, “Deep Sentiment Analysis Using CNN-LSTM Architecture of English
and Roman Urdu Text Shared in Social Media,†Applied Sciences, vol. 12, no. 6, pp. 1–18, 2022, https://doi.org/10.3390/
app12052694.
[24] S. Riyadi, A. Divayu Andriyani, and S. Noraini Sulaiman, “Improving Hate Speech Detection Using Double-Layers Hybrid
CNN-RNN Model on Imbalanced Dataset,†IEEE Access, vol. 12, no. 10, pp. 159 660–159 668, 2024, https://doi.org/10.1109/
ACCESS.2024.3487433.
[25] S. Susandri, S. Defit, and M. Tajuddin, “Enhancing Text Sentiment Classification with Hybrid CNN-BiLSTM Model on WhatsApp
Group,†Journal of Advances in Information Technology, vol. 15, no. 3, pp. 355–363, 2024, https://doi.org/10.12720/jait.
15.3.355-363.
[26] G. Chao, J. Liu, M. Wang, and D. Chu, “Data augmentation for sentiment classification with semantic preservation and diversity,â€
Knowledge-Based Systems, vol. 280, no. 11, p. 111038, 2023, https://doi.org/10.1016/j.knosys.2023.111038.
[27] J. Chen, Z. Yang, and D. Yang, “MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text
Classification,†Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, vol. 1, no. 1, pp.
2147–2157, 2020, https://doi.org/10.18653/v1/2020.acl-main.194.
Downloads
Published
Issue
Section
How to Cite
Similar Articles
- Vivin Nur Aziza, Utami Dyah Syafitri, Anwar Fitrianto, Optimizing Currency Circulation Forecasts in Indonesia: A Hybrid Prophet- Long Short Term Memory Model with Hyperparameter Tuning , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 1 (2024)
- Rahman Rahman, Teguh Iman Hermanto, Meriska Defriani, Hyperparamaters Fine Tuning for Bidirectional Long Short Term Memory on Food Delivery , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 1 (2023)
- Mukhlis Mukhlis, Puput Yuniar Maulidia, Achmad Mujib, Adi Muhajirin, Alpi Surya Perdana, Integration of Deep Learning and Autoregressive Models for Marine Data Prediction , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 1 (2024)
- Dimas Afryzal Hanan, Ario Yudo Husodo, Regania Pasca Rassy, Sentiment Study of ChatGPT on Twitter Data with Hybrid K-Means and LSTM , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 2 (2025)
- Frans Mikael Sinaga, Sio Jurnalis Pipin, Sunaryo Winardi, Karina Mannita Tarigan, Ananda Putra Brahmana, Analyzing Sentiment with Self-Organizing Map and Long Short-Term Memory Algorithms , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 1 (2023)
- Helna Wardhana, I Made Yadi Dharma, Khairan Marzuki, Ibjan Syarif Hidayatullah, Implementation of Neural Machine Translation in Translating from Indonesian to Sasak Language , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)
- Ahmad Ashril Rizal, Siti Soraya, Multi Time Steps Prediction dengan Recurrent Neural Network Long Short Term Memory , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 18 No. 1 (2018)
- Aini Suri Talita, Aristiawan Wiguna, Implementasi Algoritma Long Short-Term Memory (LSTM) Untuk Mendeteksi Ujaran Kebencian (Hate Speech) Pada Kasus Pilpres 2019 , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Willy Riyadi, Jasmir Jasmir, Performance Prediction of Airport Traffic Using LSTM and CNN-LSTM Models , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 22 No. 3 (2023)
- Achmad Lukman, Wahju Tjahjo Saputro, Erni Seniwati, Improving Performance Convolutional Neural Networks Using Modified Pooling Function , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Erlin Erlin, Yenny Desnelita, Nurliana Nasution, Laili Suryati, Fransiskus Zoromi, Dampak SMOTE terhadap Kinerja Random Forest Classifier berdasarkan Data Tidak seimbang , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 3 (2022)
- Susandri susandri, Sarjon Defit, Fristi Riandari, Bosker Sinaga, Ekplorasi Timeline : Waktu Respon Pesan Terbaik WhatSapp Group “Gurauan kita STMIK Amik†, MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 2 (2021)