Ekstraksi Informasi Destinasi Wisata Populer Jawa Timur Menggunakan Depth-First Crawling
DOI:
https://doi.org/10.30812/matrik.v21i1.1081Keywords:
Web Mining, Web Crawling, Tripadvisor, Jawa TimurAbstract
Travel Destinations are an inseparable part of human life today. As one of the provinces with a large area, East Java is one of the most visited areas for its tourism. Many people are competing in finding information related to these tourist destinations on the internet, one of which is the Tripadvisor application. Of the many tourist attractions, several tourist attractions have different attractions and experiences each time. Tourists have widely used the Tripadvisor application in determining the location where they will visit on their vacation activities. With various features ranging from reviews and recommendations for sharing photos, TripAdvisor is one of the best applications in the inventory of tourist attractions. Of the many tourist destinations, it is necessary to analyze and evaluate both tourist attractions that have many visitors with tourist attractions that are rarely visited by both local and foreign visitors. This goal, information mining (web mining), was carried out on the TripAdvisor application to obtain information on East Java Province's popular destinations. Crawling results on the TripAdvisor website, obtained various kinds of information such as names of tourist attractions, locations, visitor reviews, photos, and ratings of these tourist attractions. Spatial Analysis, a Tourist Sentiment Analyst on tourist objects, can then be carried out. It can also be developed into the recommendation system for the best tourist attractions in East Java Province
Downloads
References
[2] “pariwisata.†http://disbudpar.jatimprov.go.id/.
[3] R. Hanifah and I. S. Nurhasanah, “Implementasi Web Crawling Untuk Mengumpulkan Web Crawling Implementation for Collecting,†J. Teknol. Inf. dan Ilmu Komput., vol. 5, no. 5, pp. 531–536, 2018, doi: 10.25126/jtiik20185842.
[4] E. Susanti and K. Mustofa, “Ekstraksi Informasi Halaman Web Menggunakan Pendekatan Bootstrapping pada Ontology-Based Information Extraction,†IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 9, no. 2, p. 111, 2015, doi: 10.22146/ijccs.7540.
[5] R. Qian, K. Zhang, and G. Zhao, “A topic-specific Web crawler based on content and structure mining,†Proc. 2013 3rd Int. Conf. Comput. Sci. Netw. Technol. ICCSNT 2013, pp. 458–461, 2014, doi: 10.1109/ICCSNT.2013.6967153.
[6] N. Pawar, “Search Medicinal Plants and Relevant Diseases.â€
[7] H. Kang, S. J. Yoo, and D. Han, “Modeling web crawler wrappers to collect user reviews on shopping mall with various hierarchical tree structure,†2009 Int. Conf. Web Inf. Syst. Mining, WISM 2009, pp. 69–73, 2009, doi: 10.1109/WISM.2009.22.
[8] A. B. Archana and J. Kumar, “Location based semantic information retrieval from web documents using web crawler,†Proc. 2015 Int. Conf. Appl. Theor. Comput. Commun. Technol. iCATccT 2015, pp. 370–375, 2016, doi: 10.1109/ICATCCT.2015.7456912.
[9] L. B. Ilmawan, “Membangun Web Crawler Berbasis Web Service Untuk Data Crawling Pada Website Google Play Store,†Ilk. J. Ilm., vol. 10, no. 2, pp. 215–224, 2018, doi: 10.33096/ilkom.v10i2.282.215-224.
[10] Z. Shi, M. Shi, and W. Lin, “The Implementation of Crawling News Page Based on Incremental Web Crawler,†Proc. - 4th Int. Conf. Appl. Comput. Inf. Technol. 3rd Int. Conf. Comput. Sci. Appl. Informatics, 1st Int. Conf. Big Data, Cloud Comput. Data Sci. Eng. ACIT-CSII-BCD 2016, pp. 348–351, 2017, doi: 10.1109/ACIT-CSII-BCD.2016.073.
[11] Y. Wang, Z. Hong, and M. Shi, “Research on LDA Model Algorithm of News-oriented Web Crawler,†Proc. - 17th IEEE/ACIS Int. Conf. Comput. Inf. Sci. ICIS 2018, pp. 748–753, 2018, doi: 10.1109/ICIS.2018.8466502.
[12] N. C. C. A. Phitaloka, “Web Content Mining Di Sektor Perbankan Pada Lq45 Untuk Pendukung Keputusan Investasi Saham,†Telematika, vol. 16, no. 1, p. 18, 2019, doi: 10.31315/telematika.v16i1.2989.
[13] S. P. Kristanto, J. A. Prasetyo, and E. Pramana, “Naive Bayes Classifier on Twitter Sentiment Analysis BPJS of HEALTH,†Proc. - 2019 2nd Int. Conf. Comput. Informatics Eng. Artif. Intell. Roles Ind. Revolut. 4.0, IC2IE 2019, pp. 24–28, 2019, doi: 10.1109/IC2IE47452.2019.8940900.
[14] S. Budi, “Text Mining Untuk Analisis Sentimen Review Film Menggunakan Algoritma K-Means,†Techno.Com, vol. 16, no. 1, pp. 1–8, 2017, doi: 10.33633/tc.v16i1.1263.
[15] M. Ibrahim, O. Abdillah, A. F. Wicaksono, and M. Adriani, “Buzzer Detection and Sentiment Analysis for Predicting Presidential Election Results in a Twitter Nation,†in Proceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015, Jan. 2016, pp. 1348–1353, doi: 10.1109/ICDMW.2015.113.
[16] W. A. Luqyana, I. Cholissodin, and R. S. Perdana, “Analisis Sentimen Cyberbullying Pada Komentar Instagram dengan Metode Klasifikasi Support Vector Machine,†J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 11, pp. 4704–4713, 2018.
[17] “No Title.†https://www.scrapehero.com/how-to-scrape-tripadvisor/.
Downloads
Published
Issue
Section
How to Cite
Similar Articles
- Achmad Lukman, Wahju Tjahjo Saputro, Erni Seniwati, Improving Performance Convolutional Neural Networks Using Modified Pooling Function , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)
- Muchlis Nurseno, Umar Aditiawarman, Haris Al Qodri Maarif, Teddy Mantoro, Detecting Hidden Illegal Online Gambling on .go.id Domains Using Web Scraping Algorithms , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)
- Debby Ummul Hidayah, Ika Romadoni Yunita, Gustin Setyaningsih, Evaluasi Website Kuliah Online STMIK Amikom Purwokerto Menggunakan Metode Heuristik (Studi Kasus Mata Kuliah Enterprise Resource Management) , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 18 No. 2 (2019)
- Christofer Satria, Anthony Anggrawan, Aplikasi K-Means berbasis Web untuk Klasifikasi Kelas Unggulan , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 1 (2021)
- Sri Nawangsari, Robby Kurniawan Harahap, Harun Al Rasyid, Nina Herlina, Erik Ekowati, Anugriaty Indah Asmarany, Design of Mobile Digital Healthcare Application For Pregnant Women Based on Android , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 2 (2022)
- Romi Choirudin, Ahmat Adil, Implementasi Rest Api Web Service dalam Membangun Aplikasi Multiplatform untuk Usaha Jasa , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 18 No. 2 (2019)
- Khaerul Yasin, Ahmat Adil, Implementasi Google Maps API Pemetaan Jalur Evakuasi Bencana Alam di Kabupatem Lombok Utara , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Ni Wayan Sumartini Saraswati, I Gusti Ayu Agung Diatri Indradewi, Recognize The Polarity of Hotel Reviews using Support Vector Machine , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 22 No. 1 (2022)
- Meidyan Permata Putri, Bobby Bobby, Sistem Informasi Manajemen Proyek PT. Samudera Perkasa Konstruksi Berbasis Web , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 1 (2020)
- Supangat Supangat, Mohd Zainuri Bin Saringat, Mochamad Yovi Fatchur Rochman, Predicting Handling Covid-19 Opinion using Naive Bayes and TF-IDF for Polarity Detection , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 22 No. 2 (2023)
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Yuniar Farida, Afanin Hamidah, Silvia Kartika Sari, Lutfi Hakim, Modeling the Farmer Exchange Rate in Indonesia Using the Vector Error Correction Model Method , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 23 No. 2 (2024)