The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Zhou, Naihui, Jiang, Yuxiang, Bergquist, Timothy R., Lee, Alexandra J., Kacsoh, Balint Z., Crocker, Alex W., Lewis, Kimberley A., Georghiou, George, Nguyen, Huy N., Hamid, Md Nafiz, and others. (2019) The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens. Genome Biology, 20 . Article Number 244. ISSN 1474-760X. (doi:10.1186/s13059-019-1835-8) (KAR id:79143)

PDF Publisher pdf Language: English This work is licensed under a Creative Commons Attribution 4.0 International License.
Download this file (PDF/8MB)	Preview
Request a format suitable for use with assistive technology e.g. a screenreader
Official URL: https://doi.org/10.1186/s13059-019-1835-8

Abstract

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental wholegenome mutation screening in Candida albicans and aeruginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

Item Type:	Article
DOI/Identification number:	10.1186/s13059-019-1835-8
Uncontrolled keywords:	Protein function prediction, Long-term memory, Biofilm, Critical assessment, Community challenge
Institutional Unit:	Schools > School of Natural Sciences > Biosciences Schools > School of Engineering, Mathematics and Physics > Mathematical Sciences
Former Institutional Unit:	Divisions > Division of Natural Sciences > Biosciences Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Mathematics, Statistics and Actuarial Science
Depositing User:	Alex Freitas
Date Deposited:	04 Dec 2019 13:21 UTC
Last Modified:	28 Apr 2026 09:08 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/79143 (The current URI for this page, for reference purposes)

University of Kent Author Information

Freitas, Alex A..

Creator's ORCID:	https://orcid.org/0000-0001-9825-4700
CReDIT Contributor Roles:

Antczak, Magdalena.

Creator's ORCID:	https://orcid.org/0000-0003-1503-1849
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.