The beauty in a beast: Minimising the effects of diverse recording quality on vowel formant measurements in sociophonetic real-time studies

Rathcke, Tamara V and Stuart-Smith, Jane and Torsney, Bernard and Harrington, Jonathan (2016) The beauty in a beast: Minimising the effects of diverse recording quality on vowel formant measurements in sociophonetic real-time studies. Speech Communication, 86 . pp. 24-41. ISSN 0167-6393. (doi:https://doi.org/10.1016/j.specom.2016.11.001) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided)

PDF - Author's Accepted Manuscript
Restricted to Repository staff only until 9 May 2018.

Creative Commons Licence
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Contact us about this Publication Download (879kB)
[img]
Official URL
http://dx.doi.org/10.1016/j.specom.2016.11.001

Abstract

Sociophonetic real-time studies of vowel variation and change rely on acoustic analyses of sound recordings made at different times, often using different equipment and data collection procedures. The circumstances of a recording are known to affect formant tracking and may therefore compromise the validity of conclusions about sound changes made on the basis of real-time data. In this paper, a traditional F1/F2-analysis using linear predictive coding (LPC) was applied to the vowels /i u a/ extracted from spontaneous speech corpora of Glaswegian vernacular, that were recorded in the 1970s and 2000s. We assessed the technical quality of each recording, concentrating on the average levels of noise and the properties of spectral balance, and showed that the corpus comprised of mixed quality data. A series of acoustic vowel analyses subsequently unveiled that formant measurements using LPC were sensitive to the technical specification of a recording, with variable magnitudes of the effects for vowels of different qualities. We evaluated the performance of three commonly used formant normalisation procedures (Lobanov, Nearey and Watt-Fabricius) as well as normalisations by a distance ratio metric and statistical estimation, and compared these results to raw Bark-scaled formant data, showing that some of the approaches could ameliorate the impact of technical issues better than the others. We discuss the implications of these results for sociophonetic research that aims to minimise extraneous influences on recorded speech data while unveiling gradual, potentially small-scale sound changes across decades.

Item Type: Article
Uncontrolled keywords: Real-time corpus; Formants; Formant normalisation; Noise; SNR; Spectral tilt; Sociophonetic ‘gold standard’
Subjects: P Language and Literature > P Philology. Linguistics
Divisions: Faculties > Humanities > School of European Culture and Languages
Faculties > Humanities > School of European Culture and Languages > English Language and Linguistics
Depositing User: Tamara Rathcke
Date Deposited: 24 Nov 2016 12:08 UTC
Last Modified: 25 Nov 2016 11:37 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/58978 (The current URI for this page, for reference purposes)
Rathcke, Tamara V: https://orcid.org/0000-0002-4831-7387
  • Depositors only (login required):

Downloads

Downloads per month over past year