Improved Chi Square Automatic Interaction Detection on Student’s Discontinuation to Secondary School
Abstract
Improved Chi Square Automatic Interaction Detection (CHAID) with bias correction
is the development of the CHAID method by relying on Tschuprow's T test
calculations with bias correction in the process of forming a classification tree. This
study aims to obtain a classification of factors which influence students for not
continuing their education from junior high school or equivalent to high school or
equivalent. The results obtained in the classification tree produce nine classifications.
Based on the results of the classification tree, the classification of students who do not
continue their education to high school or equivalent is: students with disabilities
who do not have access to ICTs (0.89); students who work without disability but do
not have access to ICTs (0.73); and students who do not work without disability but
do not have access to in ICTs (0.60). Based on the classification obtained the factors
which influence students for not continuing their education to high school or
equivalent are access to ICTs, employment status, and persons with disabilities. The
classification accuracy of the results uses the Improved-CHAID method with bias
correction with a proportion of 80% training data and 20% testing data, namely
72.3033% on training data and an increase of 73.3300% on testing data.
References
CHAID Analysis and Decision Tree Methods. Eurasian Journal of Educational Research, 2019(84), 115–
134.
Badan Pusat Statistik. (2020). Potret Pendidikan Indonesia, Statistik Pendidikan.
Bergsma, W. (2013). A Bias-Correction for Cramér’s V and Tschuprow’s T. Journal of the Korean Statistical
Society, 42(3), 323–328.
Çetinkaya, Z., & Horasan, F. (2021). Decision Trees in Large Data Sets. Uluslararası Muhendislik Arastirma ve
Gelistirme Dergisi, 13(1), 140–151.
Damayanti, C., Kusnandar, D., & Yudhi. (2018). Perbandingan Hasil Pembentukan Pohon Klasifikasi Metode
CHAID dan Improved CHAID. Buletin Ilmiah Mat, Stat, Dan Terapannya, 07(4), 10–27.
Eherler, D., & Lehmann, T. (2001). Responder Profiling with CHAID and Dependency Analysis. European
Conference on Machine Learning, 12, 49–58.
El-Muslih, S. A., Vionanda, D., Amalita, N., & Salma, A. (2023). Comparison of Error Rate Prediction Methods in
Classification Modeling with the CHAID Method for Imbalanced Data. UNP Journal of Statistics and Data
Science, 1(4), 321-328.
Kumar, A., & Kaur, A. (2023). Predicting complaint voicing or exit amidst Indian consumers: a CHAID analysis.
Journal of Advances in Management Research, 20(1), 55-78.
Lin, C. L., & Fan, C. L. (2019). Evaluation of CART, CHAID, and QUEST Algorithms: a Case Study of Construction
Defects in Taiwan. Journal of Asian Architecture and Building Engineering, 18(6), 539–553.
Muhajir, M. (2016). Metode Improved CHAID (Chi-Squared Automatic Interaction Detection) pada Analisis
Kredit Macet BMT (Baitul Mal Wa Tamwil). Jurnal Ilmu-Ilmu MIPA, 16(1), 55–63.
Nugraha, J. (2014). Pengantar Analisis Data Kategorik: Metode dan Aplikasi Menggunakan Program R.
Deepublish.
Shahidul, S. M., & Karim, A. H. M. Z. (2015). Factors Contributing to School Dropout Among The Girls: A Review
Literature. European Journal of Research and Reflection in Educational Sciences, 3(2), 25–36.
Singhal, R., & Rana, R. (2015). Chi-square test and its application in hypothesis testing. Journal of the Practice
of Cardiovascular Sciences, 1(1), 69.
Sulviana, V., Wigena, A. H., & Indahwati. (2018). Implementasi Metode CHAID (Chi-Squared Automatic
Interaction Detection) pada Segmentasi Trend Penjualan Minuman Ringan di Indonesia. Xplore, 2(2),
24–31.
Yang, Y., Yi, F., Deng, C., & Sun, G. (2023). Performance Analysis of the CHAID Algorithm for Accuracy.
Mathematics, 11(11), 2558.
Temu, C. C., Tolok, M. S., Azmi, P. V., & Marsisno, W. (2019). Faktor-faktor yang Memengaruhi Putus Sekolah
Usia SMA di Provinsi NTT Tahun 2016. Seminar Nasional Official Statistics, 2019(1), 583–592

This work is licensed under a Creative Commons Attribution 4.0 International License.