Rathcke, Tamara V, Stuart-Smith, Jane, Torsney, Bernard, Harrington, Jonathan (2016) The beauty in a beast: Minimising the effects of diverse recording quality on vowel formant measurements in sociophonetic real-time studies. Speech Communication, 86 . pp. 24-41. ISSN 0167-6393. (doi:10.1016/j.specom.2016.11.001) (KAR id:58978)
PDF
Author's Accepted Manuscript
Language: English
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
|
|
Download this file (PDF/1MB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: http://dx.doi.org/10.1016/j.specom.2016.11.001 |
Abstract
Sociophonetic real-time studies of vowel variation and change rely on acoustic analyses of sound recordings made at different times, often using different equipment and data collection procedures. The circumstances of a recording are known to affect formant tracking and may therefore compromise the validity of conclusions about sound changes made on the basis of real-time data. In this paper, a traditional F1/F2-analysis using linear predictive coding (LPC) was applied to the vowels /i u a/ extracted from spontaneous speech corpora of Glaswegian vernacular, that were recorded in the 1970s and 2000s. We assessed the technical quality of each recording, concentrating on the average levels of noise and the properties of spectral balance, and showed that the corpus comprised of mixed quality data. A series of acoustic vowel analyses subsequently unveiled that formant measurements using LPC were sensitive to the technical specification of a recording, with variable magnitudes of the effects for vowels of different qualities. We evaluated the performance of three commonly used formant normalisation procedures (Lobanov, Nearey and Watt-Fabricius) as well as normalisations by a distance ratio metric and statistical estimation, and compared these results to raw Bark-scaled formant data, showing that some of the approaches could ameliorate the impact of technical issues better than the others. We discuss the implications of these results for sociophonetic research that aims to minimise extraneous influences on recorded speech data while unveiling gradual, potentially small-scale sound changes across decades.
Item Type: | Article |
---|---|
DOI/Identification number: | 10.1016/j.specom.2016.11.001 |
Uncontrolled keywords: | Real-time corpus; Formants; Formant normalisation; Noise; SNR; Spectral tilt; Sociophonetic ‘gold standard’ |
Subjects: | P Language and Literature > P Philology. Linguistics |
Divisions: | Divisions > Division of Arts and Humanities > School of Culture and Languages |
Depositing User: | Tamara Rathcke |
Date Deposited: | 24 Nov 2016 12:08 UTC |
Last Modified: | 05 Nov 2024 10:51 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/58978 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):