PERBANDINGAN METODE KLASIFIKASI KEGAGALAN SIMULASI MODEL IKLIM Perbandingan Metode Klasifikasi Kegagalan Simulasi Model Iklim
DOI:
https://doi.org/10.31004/koloni.v2i1.438Abstract
Simulation of climate model is used to produce climate models used to estimate climate in the future using some software. Simulation of climate model has two probability, they are success or failure. The problem is when the simulation is fail. There are 18 variables that used to predict the simulation. Feature selection is used to reduce the dimension of variables using RFECV method. There are 11 variables that important to simulation of climate. There are 46 from 540 simulations that fail. Furthermore, SMOTE is used to handle imbalance cases. The classification method used in this paper are Logistic Regression, Naïve Bayes, Support Vector Machine (SVM), and Random Forest. The AUC value were not significantly different for the four methods when using SMOTE. However, the highest AUC was obtained by SVM method, so the simulation of climate model can be predicted by SVM method.
Keywords: AUC, SMOTE, RFECV, Logistic Regression, SVM, Random Forest, Naïve Bayes
References
Agresti, A. (2007). An Introduction to Categorical Data Analysis. New Jersey: John Wiley & Sons, Inc.
Breiman, L. (2001). Random Forest. Machine Learning, 45, 5-32.
Breiman, L., & Cutler, A. (2003). Manual on Setting Up, Using, and Understanding Random Forest V4.0, 33. Available: https://www.stat.berkeley.edu/~breiman/Using_random_forests_v4.0.pdf.
Byrne, et al (2011). Support Vector Machine-Based Ultrawideband Breast Cancer Detection System. Journal of Electromagnetics Waves and Applications, 25(13), 1807-1816.
Chawla, et al. (2002). SMOTE: Synthetic Minority Over Sampling Technique. In Journal of Artificial Intelligence Research, 16, 321-357.
Gorunescu, F. (2011). Data Mining Concepts, Models and Techniques. Australia: Springer-Verlag Berlin Heidelberg.
Han, J., Kamber, M., & Pei, J. (2012). Data Mining: Concepts and Techniques. United States of America: Elsevier Inc.
Jacob et al. (2012). Efficient Classifier for Classification of Prognostic Breast Cancer Data Through Data Mining Techniques. Proceedings of the World Congress on Engineering and Computer Science. San Fransisco.
Khandezamin, Ziba., Naderan, Marjan., & Rasthi, M. J. (2020). Detection and Classification of Breast Cancer Using Logistic Regression Feature Selection and GMDH Calssifier. Journal of Biomedical Informatics.
Lucas, D.D. et al. (2013). Failure Analysis of Parameter-Induced Simulation Crashes in Climate Models. Geoscientific Model Development, 6, 1157-1171.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Caecilia Bintang Girik Allo, Nicea Roona Paranoan, Winda Ade Fitriya B, Sitti Rosnafi’an Sumardi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.