Gender Classification of Twitter Users Using Convolutional Neural Network

Fitra Ahya Mubarok; Mohammad Reza Faisal; Dwi Kartini; Dodon Turianto Nugrahadi; Triando Hamonangan Saragih

doi:10.30812/matrik.v23i1.3318

Authors

Fitra Ahya Mubarok Universitas Lambung Mangkurat, Banjarmasin, Indonesia
Mohammad Reza Faisal Universitas Lambung Mangkurat, Banjarmasin, Indonesia http://orcid.org/0000-0001-5748-7639
Dwi Kartini Universitas Lambung Mangkurat, Banjarmasin, Indonesia
Dodon Turianto Nugrahadi Universitas Lambung Mangkurat, Banjarmasin, Indonesia
Triando Hamonangan Saragih Universitas Lambung Mangkurat, Banjarmasin, Indonesia

DOI:

https://doi.org/10.30812/matrik.v23i1.3318

Keywords:

Gender classification, Social media analysis, Twitter, Word2vec

Abstract

Social media has become a place for social media analysts to obtain data to gain deeper insights and understanding of user behavior, trends, public opinion, and patterns associated with social media usage. Twitter is one of the most popular social media platforms where users can share messages or â€tweetsâ€ in a short text format. However, on Twitter, user information such as gender is not shown, but without realizing it or not, there is information about it in an unstructured manner. In social media analytics, gender is one of the important data that someone likes, so this research was conducted to determine the best accuracy for gender classification. The purpose of this study was to determine whether using combined data can improve the accuracy of gender classification using data from Twitter, tweets, and descriptions. The method used was word vector representation using word2vec and the application of a 2D Convolutional Neural Network (CNN) model. Word2vec was used to generate word vector representations that take into account the context and meaning of words in the text. The 2D CNN model extracted features from the word vector representation and performed gender classification. The research aimed to compare tweet data, descriptions, and a combination of tweets and descriptions to find the most accurate. The result of this study was that combined data between tweets and

Downloads

Download data is not yet available.

References

[1] U. Sivarajah, Z. Irani, S. Gupta, and K. Mahroof, â€œRole of big data and social media analytics for business to business sustainability:
A participatory web context,â€ Industrial Marketing Management, vol. 86, no. April, pp. 163â€“179, apr 2020.
[2] J. Choi, J. Yoon, J. Chung, B. Y. Coh, and J. M. Lee, â€œSocial media analytics and business intelligence research: A systematic
review,â€ Information Processing and Management, vol. 57, no. 6, pp. 1â€“18, nov 2020.
[3] M. Vicente, F. Batista, and J. P. Carvalho, â€œGender detection of Twitter users based on multiple information sources,â€ in Studies
in Computational Intelligence. Springer Verlag, 2019, vol. 794, pp. 39â€“54.
[4] E. Fosch-Villaronga, A. Poulsen, R. SÃ¸raa, and B. Custers, â€œA little bird told me your gender: Gender inferences in social
media,â€ Information Processing & Management, vol. 58, no. 3, pp. 1â€“13, may 2021.
[5] A. Selma Zakia, â€œKlasifikasi Jenis Kelamin Pengguna Twitter dengan menggunakan Metode BM25 dan K-Nearest Neighbor
(KNN),â€ Tech. Rep. 10, 2020.
[6] S. Park and J. Woo, â€œGender classification using sentiment analysis and deep learning in a health web forum,â€ Applied Sciences
(Switzerland), vol. 9, no. 6, pp. 1â€“12, 2019.
[7] R. Alroobaea, S. Alafif, S. Alhomidi, A. Aldahass, R. Hamed, R. Mulla, and B. Alotaibi, â€œA Decision Support System for Detecting
Age and Gender from Twitter Feeds based on a Comparative Experiments,â€ International Journal of Advanced Computer
Science and Applications, vol. 11, no. 12, pp. 370â€“376, dec 2020.
[8] P. Vashisth and K. Meehan, â€œGender Classification using Twitter Text Data,â€ in 2020 31st Irish Signals and Systems Conference
(ISSC). IEEE, jun 2020, pp. 1â€“6.
[9] I. R. Hendrawan, E. Utami, and A. D. Hartanto, â€œAnalisis Perbandingan Metode Tf-Idf dan Word2vec pada Klasifikasi Teks
Sentimen Masyarakat Terhadap Produk Lokal di Indonesia,â€ Smart Comp, vol. 11, no. 3, pp. 497â€“503, 2022.
[10] M. R. Faisal, M. I. Mazdadi, R. A. Nugroho, F. Abadi, and Others, â€œEyeWitness Message Identification on Forest Fires Disaster
Using Convolutional Neural Network,â€ Journal of Data Science and Software Engineering, vol. 2, no. 2, pp. 100â€“108, 2021.
[11] K. Y. Firlia, M. R. Faisal, D. Kartini, R. A. Nugroho, and F. Abadi, â€œAnalysis of New Features on the Performance of the
Support Vector Machine Algorithm in Classification of Natural Disaster Messages,â€ in Proceedings - 2021 4th International
Conference on Computer and Informatics Engineering: IT-Based Digital Industrial Innovation for the Welfare of Society, IC2IE
2021, 2021, pp. 317â€“322.
[12] M. Rusli, â€œEkstraksi Fitur Menggunakan ModelWord2vec pada Sentiment Analysis Kolom Komentar Kuisioner Evaluasi Dosen
oleh Mahasiswa,â€ KLIK - Kumpulan Jurnal Ilmu Komputer, vol. 7, no. 1, pp. 35â€“47, mar 2020.
[13] M. Padhilah, D. Kartini, and D. T. Nugrahadi, â€œImplementasi Neural Network Multilayer Perceptron Dan Stemming Nazief &
Adriani Pada Chatbot Faq Prakerja,â€ Jurnal Sains Komputer & Informatika (J-SAKTI), vol. 6, no. 2, pp. 671â€“685, 2022.
[14] A. Nurdin, B. Anggo, S. Aji, A. Bustamin, and Z. Abidin, â€œPerbandingan Kinerja Word Embedding Word2vec, Glove, dan
Fasttext pada Klasifikasi Teks,â€ Jurnal Tekno Kompak, vol. 14, no. 2, pp. 74â€“79, 2020.
[15] L. Islami, I. Budiman, M. R. Faisal, and F. Abadi, â€œPrototype Generation Berdasarkan Geometric Mean Untuk Data Reduction
pada Algoritma K Nearest Neighbour,â€ Jurnal Data Science & Informatika ( JDSI ), vol. 2, no. 2, pp. 53â€“59, 2022.
[16] E. M. Dharma, F. L. Gaol, H. L. H. S. Warnars, and B. Soewito, â€œthe Accuracy Comparison Among Word2Vec, Glove, and
Fasttext Towards Convolution Neural Network (CNN) Text Classification,â€ Journal of Theoretical and Applied Information
Technology, vol. 100, no. 2, pp. 349â€“359, 2022.
[17] J. Bai, I. Shim, and S. Park, â€œMEXN: Multi-Stage Extraction Network for Patent Document Classification,â€ Applied Sciences,
vol. 10, no. 18, pp. 1â€“14, sep 2020.
[18] N. Ketkar and J. Moolayil, Deep Learning with Python. Berkeley: Apress, 2021.
[19] G. S. Nandini, A. S. Kumar, and C. K, â€œDropout technique for image classification based on extreme learning machine,â€ Global
Transitions Proceedings, vol. 2, no. 1, pp. 111â€“116, 2021.
[20] E. E.-D. Hemdan, M. A. Shouman, and M. E. Karar, â€œCOVIDX-Net: A Framework of Deep Learning Classifiers to Diagnose
COVID-19 in X-Ray Images,â€ 2020.

Gender Classification of Twitter Users Using Convolutional Neural Network

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

How to Cite

Similar Articles

menubaru

tools

whatsapp

citation