Skip to main content
Kent Academic Repository

eDNAPlus: A unifying modelling framework for DNA-based biodiversity monitoring

Diana, Alex, Matechou, Eleni, Griffin, Jim E., Yu, Douglas W., Luo, Mingjie, Tosa, Marie, Bush, Alex, Griffiths, Richard A. (2024) eDNAPlus: A unifying modelling framework for DNA-based biodiversity monitoring. Journal of the American Statistical Association, . pp. 1-33. ISSN 0162-1459. E-ISSN 1537-274X. (doi:10.1080/01621459.2024.2412362) (KAR id:107114)

PDF Publisher pdf
Language: English


Download this file
(PDF/3MB)
[thumbnail of E. Matechou - eDNAPlus  A Unifying Modeling Framework for DNA-based Biodiversity Monitoring - PPDF.pdf]
Preview
Request a format suitable for use with assistive technology e.g. a screenreader
PDF Author's Accepted Manuscript
Language: English

Restricted to Repository staff only until 23 December 2025.

Contact us about this Publication
[thumbnail of eDNAPlus_2024.pdf]
Official URL:
https://doi.org/10.1080/01621459.2024.2412362

Abstract

DNA-based biodiversity surveys, which involve collecting physical samples from survey sites and assaying them in the laboratory to detect species via their diagnostic DNA sequences, are increasingly being adopted for biodiversity monitoring and decision-making. The most commonly employed method, metabarcoding, combines PCR with high-throughput DNA sequencing to amplify and read `DNA barcode' sequences, generating count data indicating the number of times each DNA barcode was read. However, DNA-based data are noisy and error-prone, with several sources of variation, and cannot alone estimate the species-specific amount of DNA present at a surveyed site DNA biomass. In this paper, we present a unifying modelling framework for DNA-based survey data that allows estimation of changes in DNA biomass within species, across sites and their links to environmental covariates, whilst for the first time simultaneously accounting for key sources of variation, error and noise in the data-generating process, and for between-species and between-sites correlation. Bayesian inference is performed using MCMC with Laplace approximations. We describe a re-parameterisation scheme for crossed-effects models designed to improve mixing, and an adaptive approach for updating latent variables, which reduces computation time. Theoretical and simulation results are used to guide study design, including the level of replication at different survey stages and the use of quality control methods. Finally, we demonstrate our new framework on a dataset of Malaise-trap samples, quantifying the effects of elevation and distance-to-road on each species, and produce maps identifying areas of high biodiversity and species DNA biomass.

Item Type: Article
DOI/Identification number: 10.1080/01621459.2024.2412362
Uncontrolled keywords: crossed-effects model; environmental DNA; joint species distribution modelling; observation error; occupancy modelling
Subjects: Q Science > QA Mathematics (inc Computing science) > QA276 Mathematical statistics
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Mathematics, Statistics and Actuarial Science
Funders: Natural Environment Research Council (https://ror.org/02b5d8509)
Depositing User: Eleni Matechou
Date Deposited: 05 Sep 2024 12:21 UTC
Last Modified: 23 Jan 2025 10:00 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/107114 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views of this page since July 2020. For more details click on the image.