Skip to main content
Kent Academic Repository

Chapter 2 Predicting Proteomes of Mitochondria and Related Organelles from Genomic and Expressed Sequence Tag Data

Gaston, Daniel, Tsaousis, Anastasios D., Roger, Andrew J. (2009) Chapter 2 Predicting Proteomes of Mitochondria and Related Organelles from Genomic and Expressed Sequence Tag Data. Methods in Enzymology, 457 . pp. 21-47. ISSN 0076-6879. (doi:10.1016/S0076-6879(09)05002-2) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided) (KAR id:34992)

PDF (Restricted due to copyrights) Publisher pdf
Language: English

Restricted to Repository staff only
[thumbnail of Restricted due to copyrights]
Official URL:
http://dx.doi.org/10.1016/S0076-6879(09)05002-2

Abstract

In eukaryotes, determination of the subcellular location of a novel protein encoded in genomic or transcriptomic data provides useful clues as to its possible function. However, experimental localization studies are expensive and time-consuming. As a result, accurate in silico prediction of subcellular localization from sequence data alone is an extremely important field of study in bioinformatics. This is especially so as genomic studies expand beyond model system organisms to encompass the full diversity of eukaryotes. Here we review some of the more commonly used programs for prediction of proteins that function in mitochondria, or mitochondrion-related organelles in diverse eukaryotic lineages and provide recommendations on how to apply these methods. Furthermore, we compare the predictive performance of these programs on a mixed set of mitochondrial and non-mitochondrial proteins. Although N-terminal targeting peptide prediction programs tend to have the highest accuracy, they cannot be effectively used for partial coding sequences derived from high-throughput expressed sequence tag surveys where data for the N-terminus of the encoded protein is often missing. Therefore methods that do not rely on the presence of an N-terminal targeting sequence alone are extremely useful, especially for expressed sequence tag data. The best strategy for classification of unknown proteins is to use multiple programs, incorporating a variety of prediction strategies, and closely examine the predictions with an understanding of how each of those programs will likely handle the data.

Item Type: Article
DOI/Identification number: 10.1016/S0076-6879(09)05002-2
Subjects: Q Science > QR Microbiology
Divisions: Divisions > Division of Natural Sciences > Biosciences
Depositing User: Anastasios Tsaousis
Date Deposited: 20 Aug 2013 16:55 UTC
Last Modified: 16 Nov 2021 10:12 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/34992 (The current URI for this page, for reference purposes)

University of Kent Author Information

Tsaousis, Anastasios D..

Creator's ORCID: https://orcid.org/0000-0002-5424-1905
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.