High Accuracy Data Classification and Feature Selection for Incomplete Information Systems Using Extended Limited Tolerance Relation and Conditional Entropy Approach

Deris, Mustafa Mat and Abawajy, Jemal H. and Yanto, Iwan Tri Riyadi and Adiwijaya, Adiwijaya and Herawan, Tutut and Rofiq, Ainur and Efendi, Riswan and Jaafar, Mohamad Jazli Shafizan (2025) High Accuracy Data Classification and Feature Selection for Incomplete Information Systems Using Extended Limited Tolerance Relation and Conditional Entropy Approach. IEEE Access, 13. pp. 27657-27669. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2025.3538278.

Full text not available from this repository.

Abstract

Data classification and feature/attribute selection approaches play important role in enabling organizations to extract meaningful insights from vast and complex datasets. Besides, the accuracy and processing time are two parameters of interest to determine which approach is favourable or suitable for enormous data. Moreover, the presence of redundant, incomplete, noisy and inconsistent data made more concern to accuracy and computational resources. The issue of incomplete data is addressed in limited studies due to its complexities, particularly on data classification and accuracy as well as attribute selection. The limited tolerance relation between objects is the favourable approach used in this scenario. However, the accuracy and the data classification rate need to be improved. In this paper, a new approach called extended limited tolerance relation with the similarity precision among objects to improve the data classification with high accuracy will be presented and the feature/attribute selection is performed using conditional entropy. Comparative analysis and experiment result between the proposed approach with limited tolerance relation approach in terms of data classification and accuracy are presented. The proposed approach comparatively improved the accuracy with better data classification rate and feature selection while preserving the consistency of the information in incomplete information systems that is worthy of attention.

Item Type: Article
Funders: Adjunct Professor Program, Faculty of Economic and Business, Universitas Brawijaya (01036/UN10.A0101/B/TU.01.00.1/2024)
Uncontrolled Keywords: Information systems; Accuracy; Feature extraction; Entropy; Support vector machines; Uncertainty; Rough sets; Principal component analysis; Organizations; Information technology; Extended tolerance relation; accuracy; data reduction; similarity precision
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Computer Science & Information Technology
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 14 May 2025 08:01
Last Modified: 14 May 2025 08:01
URI: http://eprints.um.edu.my/id/eprint/48029

Actions (login required)

View Item View Item