Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem (CROSBI ID 209361)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Zekić-Sušac, Marijana ; Pfeifer, Sanja ; Šarlija, Nataša A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem // Business systems research, 5 (2014), 3; 82-96. doi: 10.2478/bsrj-2014-0021

Podaci o odgovornosti

Zekić-Sušac, Marijana ; Pfeifer, Sanja ; Šarlija, Nataša

engleski

A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross- validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.

machine learning ; support vector machines ; artificial neural networks ; CART classification trees ; k-nearest neighbour ; large-dimensional data ; cross-validation

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

5 (3)

2014.

82-96

objavljeno

1847-8344

1847-9375

10.2478/bsrj-2014-0021

Povezanost rada

Ekonomija, Informacijske i komunikacijske znanosti

Poveznice