Alterkawi, Laila (2022) Scaling Genetic Algorithms to Large Distributed Datasets. Doctor of Philosophy (PhD) thesis, University of Kent. (doi:10.22024/UniKent/01.02.97093) (KAR id:97093)
PDF
Language: English |
|
Download (2MB)
Preview
|
Preview |
This file may not be suitable for users of assistive technology.
Request an accessible format
|
|
Official URL: https://doi.org/10.22024/UniKent/01.02.97093 |
Resource title: | Parallelism and partitioning in large-scale GAs using spark |
---|---|
Resource type: | Publication |
DOI: | 10.1145/3321707.3321775 |
KDR/KAR URL: | |
External URL: | https://doi.org/10.1145/3321707.3321775 |
Relationship: |
Abstract
Analysing large-scale data brings promises of new levels of scientific discovery and economic value. However, the fact that such a volume of data is by its nature distributed and the need for new computational methods to be effective in the face of significant changes in data complexity and size has led to the need to develop large-scale data analytics. Genetic algorithms (GAs) have proven their flexibility in many application areas, and substantial research has been dedicated to improving their performance through parallelisation. In contrast with most previous efforts, we reject approaches based on the centralisation of data in the main memory of a single node or requiring remote access to shared/distributed memory. We focus instead on scenarios where data is partitioned across machines.
In this partitioned scenario, we explore two parallelisation models: PDMS, inspired by the traditional master-slave model, and PDMD, based on island models. We adopt the two models to distribute BioHEL, a popular large-scale single-node GA classifier, using the Spark distributed data processing platform. We investigate the effect of GA control parameters (population size and migration frequency). We study the accuracy, time performance and scalability of the proposed models. Our results show that our distributed genetic algorithm design provides a good tradeoff between accuracy and time.
Item Type: | Thesis (Doctor of Philosophy (PhD)) |
---|---|
Thesis advisor: | Migliavacca, Matteo |
DOI/Identification number: | 10.22024/UniKent/01.02.97093 |
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Laila Alterkawi |
Date Deposited: | 26 Sep 2022 07:27 UTC |
Last Modified: | 28 Sep 2022 06:13 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/97093 (The current URI for this page, for reference purposes) |
Alterkawi, Laila: | ![]() |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):