IMPLEMENTASI METODE PROBABILISTIC LATENT SEMANTIC ANALYSIS UNTUK OPINION RETRIEVAL

  • Yusup Miftahuddin
  • Jasman Pardede
  • Afdhalul Zikri
Keywords: Pre-processing Text, Opinion Retrieval, PLSA, Query, Cosine Similarity

Abstract

Opinion retrieval is a search system by the user, where in the information needed is more of opinion than a fact. The method used for opinion retrieval system is probabilistic latent semantic analysis (PLSA). The search proses opinion sentences with opinion retrieval system have some steps, the documents processed in text processing, should be form matrix value term. The PLSA method gives matrix values to calculate e-step, m-step, decomposition matrix, and likelihood value. The similarities calculating process in opinion sentence and query used cosine similarity formula. So it has similarity value as identified opinion sentence. The result of testing in document with 5 query words, it has highest score for kappa statistic testing 0.152432875 with kappa slight interpretation.

References

[1] Ratri Anggardani Prayitno, Warih Maharani, Adhe Romadhony, 2012, Opinion Retrieval Dengan Menggunakan Probabilistic Latent Semantic Analysis. Program studi S1 Teknik Informatika (Telkom University) 2012.
[2] Darwin Suhartono, 2014, Probabilistic Latent Semantic Analysis (PLSA) untuk Klasifikasi Dokumen Teks Berbahasa Indonesia. Technical Report Program Studi Doktor Ilmu Komputer Fakultas Ilmu Komputer Universitas Indonesia, Desember 2014.
[3] Agusta, L., 2009, Perbandingan Algoritma Stemming Porter dengan Algoritma Nazief dan Adriani Untuk Stemming Dokumen Teks Bahasa Indonesia. Konferensi Nasional Sistem dan Informatika, KNS&109-036.
[4] Nazief, B. A. A. and Adriani, M. (1996) Confix-stripping: Approach to stemming algorithm For Bahasa Indonesia. Internal publication, Faculty of Computer Science, University of Indonesia, Depok, Jakarta.
Published
2016-10-29
Section
Articles