LCSS-Based Algorithm for Computing Multivariate Data Set Similarity: A Case Study of Real-Time WSN Data

Khan, Rahim and Ali, Ihsan and Altowaijri, Saleh and Zakarya, Muhammad and Rahman, Atiq Ur and Ahmedy, Ismail and Khan, Anwar and Gani, Abdullah (2019) LCSS-Based Algorithm for Computing Multivariate Data Set Similarity: A Case Study of Real-Time WSN Data. Sensors, 19 (1). p. 166. ISSN 1424-8220, DOI https://doi.org/10.3390/s19010166.

Full text not available from this repository.
Official URL: https://doi.org/10.3390/s19010166

Abstract

Multivariate data sets are common in various application areas, such as wireless sensor networks (WSNs) and DNA analysis. A robust mechanism is required to compute their similarity indexes regardless of the environment and problem domain. This study describes the usefulness of a non-metric-based approach (i.e., longest common subsequence) in computing similarity indexes. Several non-metric-based algorithms are available in the literature, the most robust and reliable one is the dynamic programming-based technique. However, dynamic programming-based techniques are considered inefficient, particularly in the context of multivariate data sets. Furthermore, the classical approaches are not powerful enough in scenarios with multivariate data sets, sensor data or when the similarity indexes are extremely high or low. To address this issue, we propose an efficient algorithm to measure the similarity indexes of multivariate data sets using a non-metric-based methodology. The proposed algorithm performs exceptionally well on numerous multivariate data sets compared with the classical dynamic programming-based algorithms. The performance of the algorithms is evaluated on the basis of several benchmark data sets and a dynamic multivariate data set, which is obtained from a WSN deployed in the Ghulam Ishaq Khan (GIK) Institute of Engineering Sciences and Technology. Our evaluation suggests that the proposed algorithm can be approximately 39.9% more efficient than its counterparts for various data sets in terms of computational time.

Item Type: Article
Funders: UNSPECIFIED
Uncontrolled Keywords: multivariate data set; longest common subsequence; dynamic programming; WSN data
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Computer Science & Information Technology
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 17 Jan 2019 05:52
Last Modified: 17 Jan 2019 05:52
URI: http://eprints.um.edu.my/id/eprint/20050

Actions (login required)

View Item View Item