Al-Terkawi, Laila, Migliavacca, Matteo (2025) An automated parallel genetic algorithm with parametric adaptation for distributed data analysis. Scientific report, 15 . Article Number 10836. E-ISSN 2045-2322. (doi:10.1038/s41598-025-93943-0) (KAR id:109599)
|
PDF
Publisher pdf
Language: English
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
|
|
|
Download this file (PDF/2MB) |
Preview |
| Request a format suitable for use with assistive technology e.g. a screenreader | |
| Official URL: https://doi.org/10.1038/s41598-025-93943-0 |
|
Abstract
Unleashing the potential of large-scale data analysis requires advanced computational methods capable of managing the immense size and complexity of distributed data. Genetic algorithms (GAs), known for their adaptability, benefit significantly from parallelization, prompting ongoing enhancements to boost performance further. This study proposes the integration of automatic termination and population sizing mechanisms into parallel GAs to augment their flexibility and effectiveness. We extend PDMS-BioHEL and PDMD-BioHEL, two parallel GA-based classifiers implemented on the Spark platform, and through extensive experimentation, demonstrate the efficacy of our approach in enhancing computational efficiency and user-friendliness. However, while these automated strategies significantly reduce the need for manual parameter tuning, thereby increasing time efficiency, they may sometimes lead to a slight reduction in final solution accuracy, particularly under complex scenario conditions. This trade-off between efficiency and accuracy is critical, especially when high precision is paramount. Our techniques enable more efficient and effective large-scale data analysis using parallel GAs, providing a robust foundation for future advancements and inviting further investigation into balancing these aspects. [Abstract copyright: © 2025. The Author(s).]
| Item Type: | Article |
|---|---|
| DOI/Identification number: | 10.1038/s41598-025-93943-0 |
| Uncontrolled keywords: | genetic algorithms (GAs); classification; large-scale data processing; Spark; GAs parameter control |
| Subjects: | Q Science > QA Mathematics (inc Computing science) |
| Institutional Unit: | Schools > School of Computing |
| Former Institutional Unit: |
There are no former institutional units.
|
| Funders: | University of Kent (https://ror.org/00xkeyj56) |
| SWORD Depositor: | JISC Publications Router |
| Depositing User: | JISC Publications Router |
| Date Deposited: | 21 Jul 2025 10:48 UTC |
| Last Modified: | 22 Jul 2025 11:47 UTC |
| Resource URI: | https://kar.kent.ac.uk/id/eprint/109599 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):

https://orcid.org/0000-0002-5684-4865
Altmetric
Altmetric