dc.contributor.author | Gachoki, P. K. | |
dc.contributor.author | Njoroge, G. G. | |
dc.contributor.author | Muraya, M. M. | |
dc.date.accessioned | 2022-04-19T21:21:50Z | |
dc.date.available | 2022-04-19T21:21:50Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Gachoki, P. K., Njoroge, G. G. and Muraya, M. M. (2021). Selection of optimal features in statistical modelling. In: Isutsa, D. K. (Ed.). Proceedings of the 7th International Research Conference held in Chuka University from 3rd to 4th December 2020, Chuka, Kenya, p. 555-564 | en_US |
dc.identifier.uri | http://repository.chuka.ac.ke/handle/chuka/16215 | |
dc.description | pkgachoki@gmail.com; moses.muraya@chuka.ac.ke | en_US |
dc.description.abstract | In statistical modelling, selection of optimal features entails making a selection of relevant predictor variables to be used
in development of statistical models. Most modelling studies have focused on construction of statistical models skipping
out or failing to put on record the process of selection of best features which is an integral part of statistical modeling.
This failure might lead to use of duplicated features, features that are less relevant or other that have low variance in
addition to random features which could result to poor performing prediction models. This study seeks to discuss how
feature selection can be done as a pre-requisite for statistical modeling. Some of the methods used in selection of best
features include; forward selection, backward elimination, recursive elimination, entropy selection, variance threshold
elimination, chi-square statistics, tree based selection, feature importance and correlation matrix with heat maps. This
study is vital to researchers building statistical models since use of optimal features in statistical modeling would lead to
high performing statistical models. | en_US |
dc.description.sponsorship | Chuka University | en_US |
dc.language.iso | en | en_US |
dc.publisher | Chuka University | en_US |
dc.subject | Feature selection | en_US |
dc.subject | forward selection | en_US |
dc.subject | feature importance | en_US |
dc.subject | correlation matrix with heatmaps | en_US |
dc.title | SELECTION OF OPTIMAL FEATURES IN STATISTICAL MODELLING | en_US |
dc.type | Article | en_US |