Improved reptile search optimization algorithm using chaotic map and simulated annealing for feature selection in medical field

Elgamal, Zenab and Md Sabri, Aznul Qalid and Tubishat, Mohammad and Tbaishat, Dina and Makhadmeh, Sharif Naser and Alomari, Osama Ahmad (2022) Improved reptile search optimization algorithm using chaotic map and simulated annealing for feature selection in medical field. IEEE Access, 10. pp. 51428-51446. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2022.3174854.

Full text not available from this repository.

Abstract

The increased volume of medical datasets has produced high dimensional features, negatively affecting machine learning (ML) classifiers. In ML, the feature selection process is fundamental for selecting the most relevant features and reducing redundant and irrelevant ones. The optimization algorithms demonstrate its capability to solve feature selection problems. Reptile Search Algorithm (RSA) is a new nature-inspired optimization algorithm that stimulates Crocodiles' encircling and hunting behavior. The unique search of the RSA algorithm obtains promising results compared to other optimization algorithms. However, when applied to high-dimensional feature selection problems, RSA suffers from population diversity and local optima limitations. An improved metaheuristic optimizer, namely the Improved Reptile Search Algorithm (IRSA), is proposed to overcome these limitations and adapt the RSA to solve the feature selection problem. Two main improvements adding value to the standard RSA; the first improvement is to apply the chaos theory at the initialization phase of RSA to enhance its exploration capabilities in the search space. The second improvement is to combine the Simulated Annealing (SA) algorithm with the exploitation search to avoid the local optima problem. The IRSA performance was evaluated over 20 medical benchmark datasets from the UCI machine learning repository. Also, IRSA is compared with the standard RSA and state-of-the-art optimization algorithms, including Particle Swarm Optimization (PSO), Genetic Algorithm (GA), Grasshopper Optimization algorithm (GOA) and Slime Mould Optimization (SMO). The evaluation metrics include the number of selected features, classification accuracy, fitness value, Wilcoxon statistical test (p-value), and convergence curve. Based on the results obtained, IRSA confirmed its superiority over the original RSA algorithm and other optimized algorithms on the majority of the medical datasets.

Item Type: Article
Funders: UNSPECIFIED
Uncontrolled Keywords: Reptile search algorithm (RSA); feature selection (FS); optimization algorithm; chaos theory; simulated annealing (SA)
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Computer Science & Information Technology > Department of Artificial Intelligence
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 12 Oct 2023 02:48
Last Modified: 12 Oct 2023 02:48
URI: http://eprints.um.edu.my/id/eprint/42371

Actions (login required)

View Item View Item