Sentiment Study of ChatGPT on Twitter Data with Hybrid K-Means and LSTM: Analisis Sentimen Berdasarkan Hasil Klasterisasi K-Means pada Data Pengguna ChatGPT Menggunakan LSTM

Dimas Afryzal Hanan; Ario Yudo Husodo; Regania Pasca Rassy

doi:10.30812/matrik.v24i2.4791

Authors

Dimas Afryzal Hanan Universitas Mataram, Mataram, Indonesia
Ario Yudo Husodo Universitas Mataram, Mataram, Indonesia
Regania Pasca Rassy Universitas Mataram, Mataram, Indonesia

DOI:

https://doi.org/10.30812/matrik.v24i2.4791

Keywords:

Chat-Gpt, K-Means, Long Short-Term Memory, Sentiment Analysis, TF-IDF, Word2Vec

Abstract

The rapid evolution of artificial intelligence (AI) has transformed the way people interact with technology, with ChatGPT emerging as a standout innovation in natural language processing (NLP). While it offers immense benefits, such as improving productivity and accessibility, it has also sparked debates about trust, transparency, and user experience. This makes understanding public sentiment about ChatGPT both timely and essential.This study explores user sentiments by combining K-Means clustering and Long Short-Term Memory (LSTM) models for analysis. The research utilized a dataset from Kaggle, which underwent extensive preprocessing, including text cleaning, tokenization, and lemmatization. Key features were extracted using TF-IDF and Word2Vec techniques, while clustering was refined with the Elbow Method and Silhouette Score. The data was grouped into three clusters focusing on ChatGPTâ€™s functions, its developers, and user activities. Sentiment analysis using LSTM achieved an impressive accuracy of 98% after five training cycles. The findings highlight that negative sentiments, particularly around technical challenges and transparency, dominate user feedback, signaling areas for improvement. While positive sentiments exist, they remain overshadowed by critical perspectives. This study underscores the importance of enhancing user trust and experience while ensuring ethical and transparent AI development. The insights provided aim to guide developers and policymakers in creating AI technologies that are more user-focused and socially responsible. Future research should include multilingual and cross-platform data to paint a more comprehensive picture.

Downloads

Download data is not yet available.

References

[1] D. Transiska, â€œAnalisis Sentimen Terhadap Penggunaan ChatGPT Berdasarkan Twitter Menggunakan Algoritma NaÃ¯ve Bayes,â€ Jurnal Media Informatika Budidarma, vol. 8, no. 2, pp. 1077â€“1086, 2024. DOI: 10.30865/mib.v8i2.7540.
[2] F. M. Sinaga, S. J. Pipin, S. Winardi, K. M. Tarigan, and A. P. Brahmana, â€œAnalyzing Sentiment with Self-Organizing Map and Long Short-Term Memory Algorithms,â€ Matrik: Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 23, no. 1, 2024. DOI: 10.30812/matrik.v23i1.3332.
[3] Y. Akbar and T. Sugiharto, â€œAnalisis Sentimen Pengguna Twitter di Indonesia Terhadap ChatGPT Menggunakan Algoritma C4.5 dan NaÃ¯ve Bayes,â€ Jurnal Sains dan Teknologi, vol. 5, no. 1, pp. 115â€“122, 2023. DOI: 10.55338/saintek.v4i3.1368.
[4] D. Setiawan, D. Arsa, L. E. Fitri, and F. F. P. Zahardy, â€œComparative Analysis of Clustering Approaches in Assessing ChatGPT User Behavior,â€ Cogito: Jurnal Penelitian dan Pengabdian Masyarakat, vol. 10, no. 2, pp. 366â€“379, 2024. DOI: 10.31154/cogito.v10i2.661.366-379.
[5] P. Yani and H. Baturohmah, â€œAnalisis Sentimen Terhadap ChatGPT Plus Menggunakan NaÃ¯ve Bayes di Twitter,â€ Sismatik: Jurnal Sistem Informasi dan Teknik Informatika, 2024. [Online]. Available: https://sismatik.nusaputra.ac.id/index.php/sismatik/article/view/210.
[6] I. Kurniasari, A. A. Alfin, and E. Widodo, â€œImplementasi Long Short-Term Memory (LSTM) dan Word Embedding Model pada Analisis Sentimen Layanan Uang Elektronik Ovo dan Link Aja,â€ Informasi: Jurnal Informatika dan Sistem Informasi, vol. 15, no. 2, 2023. DOI: 10.37424/informasi.v15i2.273.
[7] I. Dergaa, K. Chamari, P. Zmijewski, and H. B. Saad, â€œFrom human writing to artificial intelligence generated text: examining the prospects and potential threats of ChatGPT in academic writing,â€ Biology of Sport, 2023. DOI: 10.5114/biolsport.2023.125623.
[8] A. Nurkholis, D. Alita, and A. Munandar, â€œComparison of Kernel Support Vector Machine Multi-Class in PPKM Sentiment Analysis on Twitter,â€ Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 2, pp. 227â€“233, 2022. DOI: 10.29207/resti.v6i2.3906.
[9] K. Maharana, S. Mondal, and B. Nemade, â€œA review: Data pre-processing and data augmentation techniques,â€ Global Transitions Proceedings, 2022. DOI: 10.1016/j.gltp.2022.04.020.
[10] R. R. Salam, M. F. Jamil, Y. Ibrahim, R. Rahmaddeni, S. Soni, and H. Herianto, â€œSentiment Analysis of Cash Direct Assistance Distribution for Fuel Oil Using Support Vector Machine,â€ Jurnal Malcom, vol. 3, no. 1, 2023. DOI: 10.57152/malcom.v3i1.590.
[11] R. Sulthana, J. A. K., H. Harikrishnan, and V. Varadarajan, â€œSentiment Analysis on Movie Reviews Dataset Using Support Vector Machines and Ensemble Learning,â€ International Journal of Information Technology and Web Engineering, vol. 17, no. 1, 2023. DOI: 10.4018/IJITWE.311428.
[12] F. Muftie, K. M. Yafi, and Q. M. Addina, â€œPerbandingan Performa Deteksi Cyberbullying dengan Transformer, Deep Learning, dan Machine Learning,â€ Jurnal Pendidikan Informatika dan Sains, vol. 13, no. 1, pp. 75â€“87, 2024. DOI: 10.31571/saintek.v13i1.4002.
[13] A. Sanmorino, Suryati, R. Gustrianysah, S. Puspasari, and N. Ariati, â€œFeature Extraction vs Fine-tuning for Cyber Intrusion
Detection Model,â€ Jurnal INFOTEL, vol. 16, no. 2, pp. 302â€“315, 2024. DOI: 10.20895/infotel.v16i2.996.
[14] A. Nurdin, B. A. S. Aji, A. Bustamin, and Z. Abidin, â€œPerbandingan Kinerja Word Embedding Word2Vec, GloVe, dan FastText pada Klasifikasi Teks,â€ Jurnal TEKNOKOMPAK, vol. 14, no. 2, pp. 74â€“79, 2020. DOI: 10.33365/jtk.v14i2.732.
[15] E. T. Wijaya, â€œPerancangan Information Retrieval (IR) Berbasis Term Frequency-Inverse Document Frequency (TF-IDF) untuk Peringkasan Teks Tugas Khusus Berbahasa Indonesia,â€ Jurnal Ilmiah Teknologi dan Informasi ASIA, vol. 7, no. 1, 2013. [Online]. Available: https://jurnal.stmikasia.ac.id/index.php/jitika/article/view/78.
[16] M. Guntara and N. Lutfi, â€œCacah Klaster pada Klasterisasi dengan Algoritma K-Means Menggunakan Silhouette Coeficient
dan Elbow Method,â€ JuTI: Jurnal Teknologi Informasi, vol. 2, no. 1, Aug. 2023. DOI: 10.26798/juti.v2i1.944.
[17] A. T. Rahman, Wiranto, and R. Anggrainingsih, â€œCoal Trade Data Clusterung Using K-Means (Case Study PT. Global Bangkit Utama),â€ ITSMART: Jurnal Ilmiah Teknologi dan Informasi, vol. 6, no. 1, Jun. 2017. DOI: 10.20961/itsmart.v6i1.11296.
[18] M. Anggara, H. Sujiani, and H. Nasution, â€œPemilihan Distance Measure Pada K-Means Clustering Untuk Pengelompokkan Member Di Alvaro Fitness,â€ Jurnal Teknik Informatika Universitas Tanjungpura (JUSTIN), 2023. [Online]. Available: https://jurnal.untan.ac.id/index.php/justin/article/view/13119.
[19] M. Robani and A. Widodo, â€œAlgoritma K-Means Clustering Untuk Pengelompokan Ayat Al Quran Pada Terjemahan Bahasa Indonesia,â€ Jurnal Sistem Informasi Bisnis, vol. 6, no. 2, pp. 164â€“176, Dec. 2016. DOI: 10.21456/vol6iss2pp164- 176.

Sentiment Study of ChatGPT on Twitter Data with Hybrid K-Means and LSTM

Analisis Sentimen Berdasarkan Hasil Klasterisasi K-Means pada Data Pengguna ChatGPT Menggunakan LSTM

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

How to Cite

Similar Articles

menubaru

tools

citation