|A novel approach based on genetic algorithm to speed up the discovery of classification rules on GPUs
|Roui MBeheshti, Zomorodi M, Sarvelayati M, Abdar M, Noori H, Plawiak P, Tadeusiewicz R, Zhou X, Khosravi A, Nahavandi S, U. Acharya R
|Elsevier, Knowledge-Based Systems
|Classification rules, Data mining, Genetic algorithm, GPU programming, Machine learning, rule discovery
This paper proposes a new approach to produce classification rules based on evolutionary computation with novel crossover and mutation operators customized for execution on graphics processing unit (GPU). Also, a novel method is presented to define the fitness function, i.e. the function which measures quantitatively the accuracy of the rule. The proposed fitness function is benefited from parallelism due to the parallel execution of data instances. To this end, two novel concepts; coverage matrix and reduction vectors are used and an altered form of the reduction vector is compared with previous works. Our CUDA program performs operations on coverage matrix and reduction vector in parallel. Also these data structures are used for evaluation of fitness function and calculation of genetic operators in parallel. We proposed a vector called average coverage to handle crossover and mutation properly. Our proposed method obtained a maximum accuracy of 99.74% for Hepatitis C Virus (HCV) dataset, 95.73% for Poker dataset, and 100% for COVID-19 dataset. Our speedup is higher than 20% for HCV and COVID-19, and 50% for Poker, compared to using single core processors.