Skip to main content

Scaling Genetic Algorithms to Large Distributed Datasets

Alterkawi, Laila (2022) Scaling Genetic Algorithms to Large Distributed Datasets. Doctor of Philosophy (PhD) thesis, University of Kent. (doi:10.22024/UniKent/01.02.97093) (KAR id:97093)

PDF
Language: English
Download (2MB) Preview
[thumbnail of thesis.pdf]
Preview
This file may not be suitable for users of assistive technology.
Request an accessible format
Official URL:
https://doi.org/10.22024/UniKent/01.02.97093

Abstract

Analysing large-scale data brings promises of new levels of scientific discovery and economic value. However, the fact that such a volume of data is by its nature distributed and the need for new computational methods to be effective in the face of significant changes in data complexity and size has led to the need to develop large-scale data analytics. Genetic algorithms (GAs) have proven their flexibility in many application areas, and substantial research has been dedicated to improving their performance through parallelisation. In contrast with most previous efforts, we reject approaches based on the centralisation of data in the main memory of a single node or requiring remote access to shared/distributed memory. We focus instead on scenarios where data is partitioned across machines.

In this partitioned scenario, we explore two parallelisation models: PDMS, inspired by the traditional master-slave model, and PDMD, based on island models. We adopt the two models to distribute BioHEL, a popular large-scale single-node GA classifier, using the Spark distributed data processing platform. We investigate the effect of GA control parameters (population size and migration frequency). We study the accuracy, time performance and scalability of the proposed models. Our results show that our distributed genetic algorithm design provides a good tradeoff between accuracy and time.

Item Type: Thesis (Doctor of Philosophy (PhD))
Thesis advisor: Migliavacca, Matteo
DOI/Identification number: 10.22024/UniKent/01.02.97093
Subjects: Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming,
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User: Laila Alterkawi
Date Deposited: 26 Sep 2022 07:27 UTC
Last Modified: 28 Sep 2022 06:13 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/97093 (The current URI for this page, for reference purposes)
Alterkawi, Laila: https://orcid.org/0000-0002-7996-8529
  • Depositors only (login required):

Downloads

Downloads per month over past year