FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

Ratri Enggar Pawening; Tio Darmawan; Rizqa Raaiqa Bintana; Agus Zainal Arifin; Darlis Herumurti

doi:10.21609/jiki.v9i2.384

FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

Authors

Ratri Enggar Pawening Department of Informatics, STT Nurul Jadid Paiton, Jl. Pondok Pesantren Nurul Jadid Paiton
Tio Darmawan Department of Informatics, Faculty of Information Technology, Institut Teknologi Sepuluh Nopember
Rizqa Raaiqa Bintana Department of Informatics, Faculty of Information Technology, Institut Teknologi Sepuluh Nopember (ITS), Surabaya, 60111, Indonesia Department of Informatics, Faculty of Science and Technology, UIN Sultan Syarif Kasim Riau, Jl. H.R Soebrantas, Pekanbaru, 28293, Indonesia
Agus Zainal Arifin Department of Informatics, Faculty of Information Technology, Institut Teknologi Sepuluh Nopember (ITS), Surabaya, 60111, Indonesia
Darlis Herumurti Department of Informatics, Faculty of Information Technology, Institut Teknologi Sepuluh Nopember (ITS), Surabaya, 60111, Indonesia

DOI:

https://doi.org/10.21609/jiki.v9i2.384

Keywords:

Feature selection, Heterogeneous features, Joint mutual information maximation, Support vector machine, Unsupervised feature transformation

Abstract

Datasets with heterogeneous features can affect feature selection results that are not appropriate because it is difficult to evaluate heterogeneous features concurrently. Feature transformation (FT) is another way to handle heterogeneous features subset selection. The results of transformation from non-numerical into numerical features may produce redundancy to the original numerical features. In this paper, we propose a method to select feature subset based on mutual information (MI) for classifying heterogeneous features. We use unsupervised feature transformation (UFT) methods and joint mutual information maximation (JMIM) methods. UFT methods is used to transform non-numerical features into numerical features. JMIM methods is used to select feature subset with a consideration of the class label. The transformed and the original features are combined entirely, then determine features subset by using JMIM methods, and classify them using support vector machine (SVM) algorithm. The classification accuracy are measured for any number of selected feature subset and compared between UFT-JMIM methods and Dummy-JMIM methods. The average classification accuracy for all experiments in this study that can be achieved by UFT-JMIM methods is about 84.47% and Dummy-JMIM methods is about 84.24%. This result shows that UFT-JMIM methods can minimize information loss between transformed and original features, and select feature subset to avoid redundant and irrelevant features.

Downloads

Published

2016-06-25

How to Cite

Pawening, R. E., Darmawan, T., Bintana, R. R., Arifin, A. Z., & Herumurti, D. (2016). FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES. Jurnal Ilmu Komputer Dan Informasi, 9(2), 106–112. https://doi.org/10.21609/jiki.v9i2.384

Download Citation

Issue

Vol. 9 No. 2 (2016): Jurnal Ilmu Komputer dan Informasi (Journal of Computer Science and Information)

Section

Articles

License

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Journal Information

Call for Papers

SINTA accreditation

Indexed in

Our journal is implementing Double Blind Review for each submitted article.

Visitors