Ekstraksi Informasi Destinasi Wisata Populer Jawa Timur Menggunakan Depth-First Crawling
DOI:
https://doi.org/10.30812/matrik.v21i1.1081Keywords:
Web Mining, Web Crawling, Tripadvisor, Jawa TimurAbstract
Travel Destinations are an inseparable part of human life today. As one of the provinces with a large area, East Java is one of the most visited areas for its tourism. Many people are competing in finding information related to these tourist destinations on the internet, one of which is the Tripadvisor application. Of the many tourist attractions, several tourist attractions have different attractions and experiences each time. Tourists have widely used the Tripadvisor application in determining the location where they will visit on their vacation activities. With various features ranging from reviews and recommendations for sharing photos, TripAdvisor is one of the best applications in the inventory of tourist attractions. Of the many tourist destinations, it is necessary to analyze and evaluate both tourist attractions that have many visitors with tourist attractions that are rarely visited by both local and foreign visitors. This goal, information mining (web mining), was carried out on the TripAdvisor application to obtain information on East Java Province's popular destinations. Crawling results on the TripAdvisor website, obtained various kinds of information such as names of tourist attractions, locations, visitor reviews, photos, and ratings of these tourist attractions. Spatial Analysis, a Tourist Sentiment Analyst on tourist objects, can then be carried out. It can also be developed into the recommendation system for the best tourist attractions in East Java Province
Downloads
References
[2] “pariwisata.†http://disbudpar.jatimprov.go.id/.
[3] R. Hanifah and I. S. Nurhasanah, “Implementasi Web Crawling Untuk Mengumpulkan Web Crawling Implementation for Collecting,†J. Teknol. Inf. dan Ilmu Komput., vol. 5, no. 5, pp. 531–536, 2018, doi: 10.25126/jtiik20185842.
[4] E. Susanti and K. Mustofa, “Ekstraksi Informasi Halaman Web Menggunakan Pendekatan Bootstrapping pada Ontology-Based Information Extraction,†IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 9, no. 2, p. 111, 2015, doi: 10.22146/ijccs.7540.
[5] R. Qian, K. Zhang, and G. Zhao, “A topic-specific Web crawler based on content and structure mining,†Proc. 2013 3rd Int. Conf. Comput. Sci. Netw. Technol. ICCSNT 2013, pp. 458–461, 2014, doi: 10.1109/ICCSNT.2013.6967153.
[6] N. Pawar, “Search Medicinal Plants and Relevant Diseases.â€
[7] H. Kang, S. J. Yoo, and D. Han, “Modeling web crawler wrappers to collect user reviews on shopping mall with various hierarchical tree structure,†2009 Int. Conf. Web Inf. Syst. Mining, WISM 2009, pp. 69–73, 2009, doi: 10.1109/WISM.2009.22.
[8] A. B. Archana and J. Kumar, “Location based semantic information retrieval from web documents using web crawler,†Proc. 2015 Int. Conf. Appl. Theor. Comput. Commun. Technol. iCATccT 2015, pp. 370–375, 2016, doi: 10.1109/ICATCCT.2015.7456912.
[9] L. B. Ilmawan, “Membangun Web Crawler Berbasis Web Service Untuk Data Crawling Pada Website Google Play Store,†Ilk. J. Ilm., vol. 10, no. 2, pp. 215–224, 2018, doi: 10.33096/ilkom.v10i2.282.215-224.
[10] Z. Shi, M. Shi, and W. Lin, “The Implementation of Crawling News Page Based on Incremental Web Crawler,†Proc. - 4th Int. Conf. Appl. Comput. Inf. Technol. 3rd Int. Conf. Comput. Sci. Appl. Informatics, 1st Int. Conf. Big Data, Cloud Comput. Data Sci. Eng. ACIT-CSII-BCD 2016, pp. 348–351, 2017, doi: 10.1109/ACIT-CSII-BCD.2016.073.
[11] Y. Wang, Z. Hong, and M. Shi, “Research on LDA Model Algorithm of News-oriented Web Crawler,†Proc. - 17th IEEE/ACIS Int. Conf. Comput. Inf. Sci. ICIS 2018, pp. 748–753, 2018, doi: 10.1109/ICIS.2018.8466502.
[12] N. C. C. A. Phitaloka, “Web Content Mining Di Sektor Perbankan Pada Lq45 Untuk Pendukung Keputusan Investasi Saham,†Telematika, vol. 16, no. 1, p. 18, 2019, doi: 10.31315/telematika.v16i1.2989.
[13] S. P. Kristanto, J. A. Prasetyo, and E. Pramana, “Naive Bayes Classifier on Twitter Sentiment Analysis BPJS of HEALTH,†Proc. - 2019 2nd Int. Conf. Comput. Informatics Eng. Artif. Intell. Roles Ind. Revolut. 4.0, IC2IE 2019, pp. 24–28, 2019, doi: 10.1109/IC2IE47452.2019.8940900.
[14] S. Budi, “Text Mining Untuk Analisis Sentimen Review Film Menggunakan Algoritma K-Means,†Techno.Com, vol. 16, no. 1, pp. 1–8, 2017, doi: 10.33633/tc.v16i1.1263.
[15] M. Ibrahim, O. Abdillah, A. F. Wicaksono, and M. Adriani, “Buzzer Detection and Sentiment Analysis for Predicting Presidential Election Results in a Twitter Nation,†in Proceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015, Jan. 2016, pp. 1348–1353, doi: 10.1109/ICDMW.2015.113.
[16] W. A. Luqyana, I. Cholissodin, and R. S. Perdana, “Analisis Sentimen Cyberbullying Pada Komentar Instagram dengan Metode Klasifikasi Support Vector Machine,†J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 11, pp. 4704–4713, 2018.
[17] “No Title.†https://www.scrapehero.com/how-to-scrape-tripadvisor/.
Downloads
Published
Issue
Section
How to Cite
Similar Articles
- Mayadi Mayadi, Anthony Anggrawan, Pengembangan Sistem Informasi Pemantauan Harga Beras dan Gabah dengan Short Message Gateway , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 2 (2022)
- Anthony Anggrawan, Raisul Azhar, Bambang Krismono Triwijoyo, Mayadi Mayadi, Developing Application in Anticipating DDoS Attacks on Server Computer Machines , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 2 (2021)
- Muhammad Ilhamdi Rusydi, Yoan Winata, Dhiny Yurichy Putri, Budi Agung Santoso, Nurul Azizah Dhuha, Muhammad Khalish, Ikhwan Arief, Hermawan Nugroho, Perancangan Platform Pengaduan Perundungan Berlandasarkan Bukti menggunakan Metode Agile , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 2 (2022)
- Aini Suri Talita, Aristiawan Wiguna, Implementasi Algoritma Long Short-Term Memory (LSTM) Untuk Mendeteksi Ujaran Kebencian (Hate Speech) Pada Kasus Pilpres 2019 , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Bobby Poerwanto, Baso Ali, Implementasi Algoritma Fuzzy C-Means dalam Mengelompokkan Kecamatan di Tana Luwu Berdasarkan Produktifitas Hasil Perkebunan , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Anthony Anggrawan, Dwi Kurnianingsih, Christofer Satria, Sistem Aplikasi Cerdas Klasterisasi Penerima Bantuan Covid-19 , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 2 (2022)
- Amir Ali, Klasterisasi Data Rekam Medis Pasien Menggunakan Metode K-Means Clustering di Rumah Sakit Anwar Medika Balong Bendo Sidoarjo , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Darmansah Darmansah, Zulya Suhendro, Sistem Informasi Sekolah Pada Sekolah Dasar Negeri 21 Sungai Geringging Kabupaten Padang Pariaman Berbasis Web , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 2 (2020)
- sri suharti, Anton Yudhana, Imam Riadi, Forensik Jaringan DDoS menggunakan Metode ADDIE dan HIDS pada Sistem Operasi Proprietary , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 3 (2022)
- Siti Sauda, Eka Puji Agustini, Implementasi Prototype Model dalam Pengembangan Aplikasi Smart Cleaning Sebagai Pendukung Aplikasi Smart City , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 1 (2020)
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Yuniar Farida, Afanin Hamidah, Silvia Kartika Sari, Lutfi Hakim, Modeling the Farmer Exchange Rate in Indonesia Using the Vector Error Correction Model Method , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)