Abstract
Association rules mining (ARM) is a well-known combinatorial optimization problem aiming at extracting relevant rules from given large-scale datasets. According to the state of the art, the bio-inspired methods proved their efficiency by generating acceptable solutions in a reasonable time when dealing with small and medium size instances. Unfortunately, to cope with large instances such as the webdocs benchmark, these methods require more and more powerful processors and are time expensive. Nowadays, computing power is no longer a real issue. It can be provided by the power of emerging technologies such as graphics processing units (GPUs) that are massively multi-threaded processors. In this paper, we investigate the use of GPUs to speed up the computation. We propose two GPU-based bees swarm algorithms for association rules mining (single evaluation in GPU, SE-GPU and multiple evaluation in GPU, ME-GPU). SE-GPU aims at evaluating one rule at a time where each thread is associated with one transaction, whereas ME-GPU evaluates multiple rules in parallel on GPU where each thread is associated with several transactions. To validate our approaches, the two algorithms have been executed to solve well-known large ARM instances. Real experiments have been carried out on an Intel Xeon 64 bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our approaches improve the execution time up to 100\(\times \) over the sequential mono-core bees swarm optimization-ARM algorithm. Moreover, the proposed approaches have been compared with CPU multi-core ones (1–8 cores). The results show that they are faster than the multi-core versions whatever is the number of used cores.
Similar content being viewed by others
References
Adil SH, Qamar S (2009) Implementation of association rule mining using CUDA. In: International conference onemerging technologies (ICET 2009). IEEE, New York
Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2) (ACM)
Arabnia HR, Oliver MA (1989) A transputer network for fast operations on digitised images. Int J Eurograph Assoc (Comput Graph Forum) 8(1):3–12
Bendjoudi A, Chekini M, Gharbi M, Mehdi M, Benatchba K, Sitayeb-Benbouzid F, Melab N (2013) Parallel B&B algorithm on hybrid multicore/GPU architecture. In: IEEE international conference on high performance computing and communications, vol 15. IEEE, New York
Boley D et al (1999) Partitioning-based clustering for web document categorization. Decis Support Syst 27(3):329–341
Brin S et al (1997) Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Rec 26(2) (ACM)
Cano A, Luna JM, Ventura S (2013) High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput 1–24
Cecilia JM, García JM, Nisbet A, Amos M, Ujaldón M (2013) Enhancing data parallelism for ant colony optimization on gpus. J Parallel Distrib Comput 73(1):42–51
Cui Q, Xiaobo G (2013) Research on parallel association rules mining on GPU. In: Proceedings of the 2nd international conference on green communications and networks 2012 (GCN 2012), vol 2. Springer, Berlin
Djenouri Y, Drias H, Habbas Z (2014) Bees swarm optimisation using multiple strategies for association rule mining. Int J Bio-Inspir Comput 6(4):239–249
Fang W et al (2009) Frequent itemset mining on graphics processors. In: Proceedings of the fifth international workshop on data management on new hardware. ACM, New York
Gada FV, Yadav KR, Shah RD (2013) Data mining in medical application for better detection of disease. IJCER 2(2):173–176
Garg R, Mishra PK (2011) Exploiting parallelism in association rule mining algorithms. Int J Adv Technol 2.2:222–232
Goethals B, Zaki MJ (2003) Frequent itemset mining implementations repository. This site contains a wide-variety of algorithms for mining frequent, closed, and maximal itemsets. http://fimi.cs.helsinki.fi
Guvenir HA, Uysal I (2000) Bilkent university function approximation repository. http://funapp.cs.bilkent.edu.tr/DataSets. Accessed 12 March 2012
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2) (ACM)
Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, Burlington
Kacprzyk J, Zadrozny S (2013) Derivation of linguistic summaries is inherently difficult: can association rule mining help? In: Towards advanced data analysis by combining soft computing and statistics, pp 291–303. Springer, Berlin
Khademolghorani F, Baraani A, Zamanifar K (2014) Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci 8
Khabzaoui M, Dhaenens C, Talbi E-G (2005) Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005
Kozawa Y, Amagasa T, Kitagawa H (2014) Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans Inf Syst 97(4):779–789
Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. Appl Soft Comput 11(1):326–336
Li H, Zhang N (2010) Mining maximal frequent itemsets on graphics processors. In: 2010 seventh international conference on fuzzy systems and knowledge discovery (FSKD), vol 3. IEEE, New York
Lin KW, Deng DJ (2010) A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments. Int J Ad Hoc Ubiquitous Comput 6(4):205–215
Liu K, Hogan WR, Crowley RS (2011) Natural language processing methods and systems for biomedical ontology learning. J Biomed Inform 44(1):163–179
Liu Z, Chen S, Cai J, Zhang E, Lan L, Zheng J, Du J (2013) Traditional Chinese medicine syndrome-related herbal prescriptions in treatment of malignant tumors. J Tradit Chin Med 33(1):19–26
Luke S (2011) Essentials of meta-heuristics, pp 11–52
Luper D, Cameron D, Miller JA, Arabnia HR (2007) Spatial and temporal target association through semantic analysis and GPS data mining. In: International conference on information and knowledge engineering (IKE’07), USA, pp 251–257 (ISBN 1-60132-050-7)
Martínez-Ballesteros M, Nepomuceno-Chamorro IA, Riquelme JC (2013) Discovering gene association networks by multi-objective evolutionary quantitative association rules. J Comput Syst Sci
Mata J, José-Luis A, Riquelme J-C (2002) An evolutionary algorithm to discover numeric association rules. In: Proceedings of the 2002 ACM symposium on applied computing. ACM, New York
Melab N, Talbi E-G (2001) A parallel genetic algorithm for rule mining. In: Proceedings of the 15th international parallel and distributed processing symposium. IEEE Computer Society, London
Melab N, Chakroun I, Bendjoudi A (2013) Graphics processing unit-accelerated bounding for branch-and-bound applied to a permutation problem using data access optimization. Concurr Comput Pract Exp
Moslehi P et al (2014) Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. Int J Comput Sci 8
Mishra BSP et al (2011) Parallel multi-objective genetic algorithms for associative classification rule mining. In: Proceedings of the 2011 international conference on communication, computing and security. ACM, New York
Olafsson S, Li X, Wu S (2008) Operations research and data mining. Eur J Oper Res 187(3):1429–1448
Orlando S et al (2002) Adaptive and resource-aware mining of frequent sets. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2003. IEEE, New York
Park JS, Chen M-S, Yu PS (1995) An effective hash-based algorithm for mining association rules, vol 24(2). ACM, New York
Park JS, Chen M-S, Yu PS (1995) Efficient parallel data mining for association rules. In: Proceedings of the fourth international conference on information and knowledge management. ACM, New York
Parthasarathy S et al (2001) Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst 3.1:1–29
Rahbarinia B, Pedram MM, Arabnia HR, Alavi Z (2010) A multi-objective scheme to hide sequential patterns. In: The proceedings of the 2010 international conference on computer and automation engineering, ICCAE, Singapore, 26–28 February 2010, pp 153–158 (E-ISBN 978-1-4244-5586-7, doi:10.1109/ICCAE.2010.5451977)
Romero C et al (2012) Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert Syst
Silvestri C, Orlando S (2012) gpuDCI: exploiting GPUs in frequent itemset mining. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP). IEEE, New York
Sun J, Pan W (2009) Parallel association rule mining for image data. In: International conference on photonics and image in agriculture engineering (PIAGENG 2009). International Society for Optics and Photonics, London
Talia D, Trunfio P, Verta O (2005) Weka4ws: a wsrf-enabled weka toolkit for distributed data mining on grids. In: Knowledge discovery in databases: PKDD 2005, pp 309–320. Springer, Berlin
Wang ZG, Wang CS (2012) A parallel association-rule mining algorithm. In: Web information systems and mining, pp 125–129. Springer, Berlin
Weiss RM (2010) GPU-accelerated data mining with swarm intelligence. Doctoral dissertation, Honors thesis. Department of Computer Science, Macalester College. http://metislogic.net/thesis.pdf
Yan X, Zhang C, Zhang S (2009) Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl 36(2):3066–3076
Yang J, Yang Y (2010) A parallel algorithm for mining association rules. In: 2nd international conference on networking and digital society (ICNDS), vol 1. IEEE, New York
Zaïane OR, El-Hajj M, Lu P (2001) Fast parallel association rule mining without candidacy generation. In: Proceedings of the IEEE international conference on data mining, ICDM 2001. IEEE, New York
Zhang F, Zhang Y, Bakos J (2011) GPApriori: GPU-accelerated frequent itemset mining. In: IEEE international conference on cluster computing (CLUSTER). IEEE, New York
Zhou J, Yu K-M, Wu B-C (2010) Parallel frequent patterns mining algorithm on GPU. In: IEEE international conference on systems man and cybernetics (SMC). IEEE, New York
Zhou Y, Tan Y (2009) GPU-based parallel particle swarm optimization. In: IEEE congress on evolutionary computation, 2009, CEC’09, pp 1493–1500. IEEE, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Djenouri, Y., Bendjoudi, A., Mehdi, M. et al. GPU-based bees swarm optimization for association rules mining. J Supercomput 71, 1318–1344 (2015). https://doi.org/10.1007/s11227-014-1366-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-014-1366-8