Optimizing Rain Prediction Model Using Random Forest and Grid Search Cross-Validation for Agriculture Sector
DOI:
https://doi.org/10.30812/matrik.v23i3.3891Keywords:
Agriculture, Cross-Validation, Grid Search, Rain Prediction, Random ForestAbstract
Agriculture, as a sector that is highly influenced by weather conditions, faces challenges due to increasingly unpredictable changes in weather patterns. The aim of this research is to create an optimal rainfall prediction model to help farmers create irrigation schedules, use fertilizer, and planting schedules, and protect plants from extreme weather events. The method used in this research to obtain the best rain prediction model is to use the random forest algorithm and the grid search cross-validation algorithm. Random Forest, known for its robustness and accuracy, emerged as a suitable algorithm for predicting rain. utilizing a substantial dataset from the West Nusa Tenggara Meteorology, Climatology, and Geophysics Agency covering the period 2000 to 2023. The data is then processed first to ensure its readiness for use. This process involves removing outlier data points, empty data entries, and unused features. After the preprocessing stage, the data underwent training using the Random Forest algorithm, resulting in an R-squared value of 0.1334. To obtain the optimal model, Grid Search Cross Validation is used. The results of this research obtained the best rain prediction model with an R-squared value of 0.0268. This model will be used to predict rain in the agricultural sector. This research concludes that we can get the best rain prediction model by combining Random Forest and Gird Search Cross-Validation. For further research, we can compare other rain prediction methods, add features, and combine datasets from a wider area.
Downloads
References
[2] M. R. Aprillya and U. Chasanah, “Geographic Information System Multi Attribute Utility Theory for Flood Mitigation in Agricultural Sector,†MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 22, No. 1, PP. 117–182, 2022, DOI: https://doi.org/10.30812/matrik.v22i1.1511
[3] G. Meti, R. K. G. Krishnegowda, and G. S. Swamy, “Rainfall analysis and prediction using ensemble learning for Karnataka State,†Indonesian Journal of Electrical Engineering and Computer Science, Vol. 32, No. 2, PP. 1187–1198, Nov. 2023, DOI: http://doi.org/10.11591/ijeecs.v32.i2.pp1187-1198
[4] T. P. Agyekum, P. Antwi-Agyei, and A. J. Dougill, “The contribution of weather forecast information to agriculture, water, and energy sectors in East and West Africa: A systematic review,†Frontiers in Environmental Science, Vol. 10, No. 1, PP. 1–14, 2022. DOI: https://doi.org/10.3389/fenvs.2022.935696
[5] E. Erlin, Y. Desnelita, N. Nasution, L. Suryati, and F. Zoromi, “Dampak SMOTE terhadap Kinerja Random Forest Classifier berdasarkan Data Tidak seimbang,†MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 21, No. 3, PP. 677–690, 2022, doi: https://doi.org/10.30812/matrik.v21i3.1726
[6] P. R. Sihombing and I. F. Yuliati, “Penerapan Metode Machine Learning dalam Klasifikasi Risiko Kejadian Berat Badan Lahir Rendah di Indonesia,†MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 20, No. 2, PP. 417–426, 2021, DOI: https://doi.org/10.30812/matrik.v20i2.1174
[7] M. Schonlau and R. Y. Zou, “The random forest algorithm for statistical learning,†The Stata Journal, Vol. 20, No. 1, PP. 3–29, 2020, DOI: https://doi.org/10.1177/1536867X20909688
[8] H. Yao, X. Li, H. Pang, L. Sheng, and W. Wang, “Application of random forest algorithm in hail forecasting over Shandong Peninsula,†Atmospheric Research, Vol. 244, No. 6, PP. 1-39, 2020, DOI: https://doi.org/10.1016/j.atmosres.2020.105093
[9] V. Shalamov, V. Efimova, and A. Filchenkov, “Faster Hyperparameter Optimization via Finding Minimal Regions in Random Forest Regressor,†Procedia Computer Science, Vol. 212, No. 1, PP. 378–386, 2022. DOI: https://doi.org/10.1016/j.procs.2022.11.022.
[10] R. Rofik, R. A. Hakim, J. Unjung, B. Prasetiyo, and M. A. Muslim, “Optimization of SVM and Gradient Boosting Models Using GridSearchCV in Detecting Fake Job Postings,†MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 23, No. 2, PP. 419–430, 2024, DOI: https://doi.org/10.30812/matrik.v23i2.3566.
[11] S. Rasheed, G. K. Kumar, D. M. Rani, M. V. V. P. Kantipudi, and M. Anila, “Heart Disease Prediction Using GridSearchCV and Random Forest,†EAI Endorsed Transactions on Pervasive Health and Technology, Vol. 10, No. 1, PP. 1–8, 2024, DOI: https://doi.org/10.4108/eetpht.10.5523.
[12] S. George and B. Sumathi, “Grid Search Tuning of Hyperparameters in Random Forest Classifier for Customer Feedback Sentiment Prediction,†International Journal of Advanced Computer Science and Applications (IJACSA), Vol. 11, No. 9, PP. 173-178, 2020. DOI: https://doi.org/10.14569/IJACSA.2020.0110920.
[13] R. Meenal, P. A. Michael, D. Pamela, and E. Rajasekaran, “Weather prediction using random forest machine learning model,†Indonesian Journal of Electrical Engineering and Computer Science, Vol. 22, No. 2, PP. 1208–1215, 2021, DOI: http://doi.org/10.11591/ijeecs.v22.i2.pp1208-1215.
[14] C. D. Usman, A. P. Widodo, K. Adi, and R. Gernowo, “Rainfall prediction model in Semarang City using machine learning,†Indonesian Journal of Electrical Engineering and Computer Science, Vol. 30, No. 2, PP. 1224–1231, 2023, DOI: http://doi.org/10.11591/ijeecs.v30.i2.pp1224-1231.
[15] A. J. Hill, G. R. Herman, and R. S. Schumacher, “Forecasting severe weather with random forests,†Monthly Weather Review, Vol. 148, No. 5, PP. 2135–2161, 2020, DOI: http://doi.org/10.1175/MWR-D-19-0344.1.
[16] E. D. Loken, A. J. Clark, and A. McGovern, “Comparing and Interpreting Differently Designed Random Forests for Next-Day Severe Weather Hazard Prediction,†Weather and Forecasting, Vol. 37, No. 6, PP. 871–899, 2022, DOI: http://doi.org/10.1175/waf-d-21-0138.1.
[17] S. Efendi and P. Sihombing, “Sentiment Analysis of Food Order Tweets to Find Out Demographic Customer Profile Using SVM,†MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 21, No. 3, PP. 583–594, 2022, DOI: http://doi.org/10.30812/matrik.v21i3.1898.
[18] C. Fan, M. Chen, X. Wang, J. Wang, and B. Huang, “A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data,†Frontiers in Energy Research, Vol. 9, No. 29, PP. 1-17, 2021. DOI: https://doi.org/10.3389/fenrg.2021.652801.
[19] Y. Shaikh, V. Parvati, and S. R. Biradar, “Early disease prediction algorithm for hypertension-based diseases using data aware algorithms,†Indonesian Journal of Electrical Engineering and Computer Science, Vol. 27, No. 2, PP. 1100–1108, 2022, DOI: http://doi.org/10.11591/ijeecs.v27.i2.pp1100-1108.
[20] R. Siddabathuni, S. Palanivel, and G. L. N. Murthy, “Alzheimer image registration using hybrid random forest and deep regression network algorithm,†Indonesian Journal of Electrical Engineering and Computer Science, Vol. 33, No. 2, PP. 824–831, 2024, DOI: http://doi.org/10.11591/ijeecs.v33.i2.pp824-831.
[21] M. Adnan, A. A. S. Alarood, M. I. Uddin, and I. ur Rehman, “Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models,†PeerJ Computer Science, Vol. 8, No. 1 PP. 1–29, 2022, DOI: http://doi.org/10.7717/PEERJ-CS.803.
[22] N. W. Hesty, D. G. Cendrawati, A. Aminuddin, B. Pranoto, and A. Fahim, “Estimasi Potensi Energi Angin Indonesia Menggunakan Model Weather Research and Forecast - Four Dimension Data Assimiliation (WRF-FDDA),†Jurnal Sains Dirgantara, Vol. 19, No. 2, PP. 11–20, 2022, DOI: http://doi.org/10.30536/j.jsd.2022.v19.3614.
Downloads
Published
Issue
Section
How to Cite
Similar Articles
- Ardi Mardiana, Ade Bastian, Ano Tarsono, Dony Susandi, Safari Yonasi, Optimized YOLOv8 Model for Accurate Detection and Quantificationof Mango Flowers , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 3 (2025)
- Wahyu Styo Pratama, Didik Dwi Prasetya, Triyanna Widyaningtyas, Muhammad Zaki Wiryawan, Lalu Ganda Rady Putra, Tsukasa Hirashima, Performance Evaluation of Artificial Intelligence Models for Classification in Concept Map Quality Assessment , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 3 (2025)
- Darwan Darwan, Penggunaan Jaringan Syaraf Tiruan dan Wavelet Pada Citra EKG 12 Lead , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 2 (2021)
- Pardomuan Robinson Sihombing, Istiqomatul Fajriyah Yuliati, Penerapan Metode Machine Learning dalam Klasifikasi Risiko Kejadian Berat Badan Lahir Rendah di Indonesia , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 20 No. 2 (2021)
- Muhammad Ibnu Choldun Rachmatullah, The Application of Repeated SMOTE for Multi Class Classification on Imbalanced Data , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 22 No. 1 (2022)
- Lusiana Efrizoni, Sarjon Defit, Muhammad Tajuddin, Anthony Anggrawan, Komparasi Ekstraksi Fitur dalam Klasifikasi Teks Multilabel Menggunakan Algoritma Machine Learning , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 21 No. 3 (2022)
- Christofer Satria, Peter Wijaya Sugijanto, Anthony Anggrawan, I Nyoman Yoga Sumadewa, Aprilia Dwi Dayani, Rini Anggriani, Multi-Algorithm Approach to Enhancing Social Assistance Efficiency Through Accurate Poverty Classification , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 1 (2024)
- Angga Rahagiyanto, Identifikasi Ekstraksi Fitur untuk Gerakan Tangan dalam Bahasa Isyarat (SIBI) Menggunakan Sensor MYO Armband , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 19 No. 1 (2019)
- Rahmaddeni Rahmaddeni, M. Teguh Wicaksono, Denok Wulandari, Agustriono Agustriono, Sang Adji Ibrahim, Enhancing Multiple Linear Regression with Stacking Ensemble for Dissolved Oxygen Estimation , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 1 (2024)
- Miftahus Sholihin, Mohd Farhan Bin Md. Fudzee, Lilik Anifah, A Novel CNN-Based Approach for Classification of Tomato Plant Diseases , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 3 (2025)
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Pradita Dwi Rahman, Heri Wijayanto, Royana Afwani, Wirarama Wesdawara, Ahmad Zafrullah Mardiansyah, Blockchain-Based TraditionalWeaving Certification and Elliptic Curve Digital Signature , MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer: Vol. 24 No. 1 (2024)