A novel association rule mining approach using TID intermediate itemset

Aqra, Iyad and Herawan, Tutut and Ghani, Norjihan Abdul and Akhunzada, Adnan and Ali, Akhtar and Bin Razali, Ramdan and Ilahi, Manzoor and Raymond Choo, Kim-Kwang (2018) A novel association rule mining approach using TID intermediate itemset. PLoS ONE, 13 (1). e0179703. ISSN 1932-6203, DOI https://doi.org/10.1371/journal.pone.0179703.

Full text not available from this repository.
Official URL: https://doi.org/10.1371/journal.pone.0179703

Abstract

Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets.

Item Type: Article
Funders: UNSPECIFIED
Uncontrolled Keywords: Algorithms; Data Mining; Databases, Factual; Datasets as Topic
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Computer Science & Information Technology
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 20 Sep 2019 01:57
Last Modified: 20 Sep 2019 01:57
URI: http://eprints.um.edu.my/id/eprint/22458

Actions (login required)

View Item View Item