Skip to main content
Kent Academic Repository

Phylogenomic analyses data of the avian phylogenomics project

Jarvis, Erich D, Mirarab, Siavash, Aberer, Andre J, Li, Bo, Houde, Peter, Li, Cai, Ho, Simon Y W, Faircloth, Brant C, Nabholz, Benoit, Howard, Jason T, and others. (2015) Phylogenomic analyses data of the avian phylogenomics project. GigaScience, 4 (1). Article Number 4 (s13742-. ISSN 2047-217X. (doi:10.1186/s13742-014-0038-1) (KAR id:57584)

PDF Publisher pdf
Language: English

Restricted to Repository staff only
[thumbnail of Jarvis&al2015(GigaSci - avian phylogenomics).pdf]
Microsoft Word (Full author list) Supplemental Material
Language: English
Download this file
(Microsoft Word/39kB)
[thumbnail of Full author list]
Request a format suitable for use with assistive technology e.g. a screenreader
Official URL:



Determining the evolutionary relationships among the major lineages of extant birds has been one of the biggest challenges in systematic biology. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders. We used these genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomic analyses.


Here we present the datasets associated with the phylogenomic analyses, which include sequence alignment files consisting of nucleotides, amino acids, indels, and transposable elements, as well as tree files containing gene trees and species trees. Inferring an accurate phylogeny required generating: 1) A well annotated data set across species based on genome synteny; 2) Alignments with unaligned or incorrectly overaligned sequences filtered out; and 3) Diverse data sets, including genes and their inferred trees, indels, and transposable elements. Our total evidence nucleotide tree (TENT) data set (consisting of exons, introns, and UCEs) gave what we consider our most reliable species tree when using the concatenation-based ExaML algorithm or when using statistical binning with the coalescence-based MP-EST algorithm (which we refer to as MP-EST*). Other data sets, such as the coding sequence of some exons, revealed other properties of genome evolution, namely convergence.


The Avian Phylogenomics Project is the largest vertebrate phylogenomics project to date that we are aware of. The sequence, alignment, and tree data are expected to accelerate analyses in phylogenomics and other related areas.

Item Type: Article
DOI/Identification number: 10.1186/s13742-014-0038-1
Additional information: eCollection 2015; PubMed Central PMCID: PMC4349222
Subjects: Q Science > QH Natural history
Q Science > QH Natural history > QH426 Genetics
Q Science > QL Zoology
Divisions: Divisions > Division of Natural Sciences > Biosciences
Signature Themes: Food Systems, Natural Resources and Environment
Depositing User: Mike Romanov
Date Deposited: 29 Sep 2016 19:16 UTC
Last Modified: 08 Sep 2023 14:59 UTC
Resource URI: (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.