Skip to main content

Shape Analysis In Protein Structure Alignment

Gkolias, Theodoros (2018) Shape Analysis In Protein Structure Alignment. Doctor of Philosophy (PhD) thesis, University of Kent,. (KAR id:66682)

Language: English
Download (7MB) Preview
[thumbnail of 189Thesis.pdf]
This file may not be suitable for users of assistive technology.
Request an accessible format


In this Thesis we explore the problem of structural alignment of protein molecules using statistical shape analysis techniques. The structural alignment problem can be divided into three smaller ones: the representation of protein structures, the sampling of possible alignments between the molecules and the evaluation of a given alignment. Previous work done in this field, can be divided in two approaches: an adhoc algorithmic approach from the Bioinformatics literature and an approach using statistical methods either in a likelihood or Bayesian framework. Both approaches address the problem from a different scope. For example, the algorithmic approach is easy to implement but lacks an overall modelling framework, and the Bayesian address this issue but sometimes the implementation is not straightforward.

Moreover, we extended our method using Bayesian techniques to perform alignments based on posterior modes. In our approach, we estimate directly the mode of the posterior distribution which provides the final alignment between two molecules. We also, choose a different approach for treating the mean parameter. In previous methods the mean was either integrated out of the likelihood density or considered as fixed. We choose to assign a prior over it and obtain its posterior mode.

Finally, we consider an extension of the likelihood model assuming a Normal density for both the matched and unmatched parts of a molecule and diagonal covariance structure. We explore two different variants. In the first we consider a fixed zero mean for the unmatched parts of the molecules and in the second we consider a common mean for both the matched and unmatched parts. Based on simulated and real results, both models seems to perform well in obtaining high number of matched atoms and high TMscore.

Item Type: Thesis (Doctor of Philosophy (PhD))
Thesis advisor: Kume, Alfred
Uncontrolled keywords: Shape Analysis, Bioinformatics, Protein Structure Alignment
Subjects: Q Science > QA Mathematics (inc Computing science)
Q Science > QA Mathematics (inc Computing science) > QA276 Mathematical statistics
Q Science > QP Physiology (Living systems)
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Mathematics, Statistics and Actuarial Science
SWORD Depositor: System Moodle
Depositing User: System Moodle
Date Deposited: 10 Apr 2018 12:16 UTC
Last Modified: 16 Feb 2021 13:54 UTC
Resource URI: (The current URI for this page, for reference purposes)
  • Depositors only (login required):