Skip to main content
Kent Academic Repository

Exploring Protein Interactions Through the Development of in silico Methodologies and Genetic Analysis

McGreig, Jake (2022) Exploring Protein Interactions Through the Development of in silico Methodologies and Genetic Analysis. Doctor of Philosophy (PhD) thesis, University of Kent,. (doi:10.22024/UniKent/01.02.103027) (KAR id:103027)

Abstract

Significant advances in sequencing technologies over the past twenty years have drastically changed scientific approaches to solving biological problems. A shift towards big data driven research and computational predictions has been made possible by vast sequence and structuradatabases, populated by sequences determined by modern sequencing methods. This thesis focuses on the use of these resources to predict protein function in the form of ligand-binding sites, in the determination of positions in proteins that are pivotal in virus pathogenesis, and in the elucidatio of amino acids in myosin proteins which are adapted to the mass of mammalian species.

Firstly, a tool developed to predict protein-ligand interactions is introduced that utilises up-to-date data resources, tools, and machine learning algorithms to identify, and assign confidence to, candidate ligand-binding positions in proteins. This work builds on 3DLigandSite in the first major update since its release. A new webserver is presented in this work for users to query their protein sequences for probable ligand-binding residues. Sequence analysis of conventional myosin sequences identified positions associated with increased body mass and noassociation with clade in the second area explored in this thesis. In beta-cardiac myosin, nine positions were found to adjust contraction velocity when mutated, highlighting the predictive power from alignments combined with a measurable phenotype. This further shows the time- and cost-saving benefits of combined computational and experimental methods.

Finally, differentially conserved positions were identified between SARS-CoV and SARS-CoV-2, determining probable variants for the source of phenotypic differences between these viruses. A webserver for other researchers to perform these analyses was developed, enabling users to query coronavirus sequences for these variants. Our analysis included all sequences available from GISAID at the time of writing. Some of the DCPs identified using these methods are shown to be found in the ACE2 binding site of the spike protein, and in close vicinity to serine protease binding sites, which are essential for virus entry into the host.

Item Type: Thesis (Doctor of Philosophy (PhD))
Thesis advisor: Wass, Mark
DOI/Identification number: 10.22024/UniKent/01.02.103027
Uncontrolled keywords: Ligand, protein, myosin, binding, coronavirus
Subjects: Q Science
Divisions: Divisions > Division of Natural Sciences > Biosciences
Funders: Engineering and Physical Sciences Research Council (https://ror.org/0439y7842)
SWORD Depositor: System Moodle
Depositing User: System Moodle
Date Deposited: 27 Sep 2023 14:14 UTC
Last Modified: 05 Nov 2024 13:09 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/103027 (The current URI for this page, for reference purposes)

University of Kent Author Information

  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.