Penerapan Algoritma Cosine Similarity dan Pembobotan TF-IDF System Penerimaan Mahasiswa Baru pada Kampus Swasta

  • Apriani Apriani Universitas Bumigora
  • Hizbu Zakiyudin
  • Khairan Marzuki Universitas Bumigora

Abstract

The era of globalization is marked by the development of technology and
information, this has an impact on the human need for information. PMB
(Reception New Students) is a routine college activity at each opening of new
teachings. The implementation of PMB is not without questions that has been
asked before. By making use of technology information the an FAQ (Frequently
Asked Questions) was born which contain answers of the questions that are
often asked by people who are need information. To reduce the question
repeatedly, then an FAQ answering system was built by applying TF-IDF (Term
Frequency - Inverse Document) and the cosine similarity algorithm. TF-IDF
weighting is a method for giving weights the relationship of a word (term) to a
document is based on two concepts, namely frequency of occurrence of words in
a document and frequency inverse documents containing the word. Meanwhile,
cosine similarity is a method used to calculate the level of similarity between
two objects. This method calculates the similarity between two pieces the object
represented in two vectors using keywords of a document as a measure. This
study uses 7 data samples from all FAQ data obtained from an interview with
Ms. Susilawati, S.Kom. The sample data used will go through a process
preprocessing, TF-IDF weighting, and the cosine similarity method for
determines the highest level of similarity that will come out as a result end. By
using TF-IDF weighting and the cosine similarity method on 7 sample data can
get an accuracy rate of up to 64,28%.

References

[1] M. Rifauddin, “Pengelolaan Arsip Elektronik Berbasis Teknologi,” Khizanah Al- Hikmah Jurnal Ilmu Perpustakaan, Informasi, dan Kearsipan, vol. 4, no. 2, pp. 168–178, 2016.
[2] E. Mulyawati, “Model Perilaku Pencarian Informasi guna Memenuhi Kebutuhan Informasi (Studi Literatur),” Publis, vol. 1, no. 2, pp. 14–20, 2011.
[3] K. Marzuki and A. Apriani, “Evaluasi Penerapan Teknologi Informasi E-Learning Pada Kampus Swasta Menggunakan Cobit 4.1,” Jurnal Bumigora Information Technology (BITe), vol. 1, no. 2, pp. 161–166, 2019.
[4] S. Y. Bayquni, N. Kurniasih, and R. K. Anwar, “Pertukaran Informasi Oleh Mahasiswa Jurusan Ilmu Jurnalistik Melalui Media Kompasiana,” Jurnal Kajian Informasi dan Perpustakaan, vol. 3, no. 1, p. 71, 2015, doi: 10.24198/jkip.v3i1.9490.
[5] O. Nurdiana, J. Jumadi, and D. Nursantika, “Perbandingan Metode Cosine Similarity Dengan Metode Jaccard Similarity Pada Aplikasi Pencarian Terjemah Al-Qur’an Dalam Bahasa Indonesia,” Jurnal Online Informatika, vol. 1, no. 1, p. 59, 2016, doi: 10.15575/join.v1i1.12.
[6] R. Afandi, “Sistem Penjawab FAQ ( Frequently Asked Question ) Seputar Universitas Bumigora Menggunakan Metode Pembobotan TD - IDF dan Jaccard Similarity,” pp. 1–11, 2020.
[7] A. Deolika, K. Kusrini, and E. T. Luthfi, “Analisis Pembobotan Kata Pada Klasifikasi Text Mining,” Jurnal Teknologi Informasi, vol. 3, no. 2, p. 179, 2019, doi: 10.36294/jurti.v3i2.1077.
[8] V. Amrizal, “Penerapan Metode Term Frequency Inverse Document Frequency (Tf-Idf) Dan Cosine Similarity Pada Sistem Temu Kembali Informasi Untuk Mengetahui Syarah Hadits Berbasis Web (Studi Kasus: Hadits Shahih Bukhari-Muslim),” Jurnal Teknik Informatika, vol. 11, no. 2, pp. 149–164, 2018, doi: 10.15408/jti.v11i2.8623.
[9] I. Oyong, K. Marzuki, T. A. Lorosae, and Kusrini, “Prediksi Popularitas Artikel Berdasarkan,” in Seminar Nasional Teknologi Informasi dan Multimedia 2018 ISSN: 2302-3805, 2018, pp. 43–48, [Online]. Available: https://ojs.amikom.ac.id/index.php/semnasteknomedia/article/view/2055.
[10] M. E. Sulistyo, R. Saptono, A. Asshidiq, J. Informatika, and U. S. Maret, “Penilaian Ujian Bertype Essay Menggunakan Metode Text Similarity,” vol. 12, no. 02, pp. 146–158, 2015.
Published
2021-07-10
How to Cite
Apriani, A., Zakiyudin, H., & Marzuki, K. (2021). Penerapan Algoritma Cosine Similarity dan Pembobotan TF-IDF System Penerimaan Mahasiswa Baru pada Kampus Swasta. Jurnal Bumigora Information Technology (BITe), 3(1), 19-27. https://doi.org/https://doi.org/10.30812/bite.v3i1.1110
Section
Articles