Skip to main content

Speech playback geometry for smart homes

McLoughlin, Ian V. and Xie, Zhi-Peng (2014) Speech playback geometry for smart homes. In: The 18th IEEE International Symposium on Consumer Electronics (ISCE 2014). IEEE, pp. 1-2. E-ISBN 978-1-4799-4592-4. (doi:10.1109/ISCE.2014.6884428) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication)
Official URL
http://dx.doi.org/10.1109/ISCE.2014.6884428

Abstract

Smart homes are environments that intelligently communicate with users, primarily through speech, to control various devices and activities relating to daily living. Both automatic speech recognition and text-to-speech are enabling technologies, however the basic home environment is often acoustically noisy and unconstrained in usage, leading to significant practical difficulties with both key technologies. In addition to interfering noise problems, a home occupant may not always be located physically close to either a microphone or loudspeaker. Speech intelligibility in a smart home depends on many factors including background noise, speech power and content. It is also affected by the orientation of the listeners' ears with respect to the speech source. This paper describes a simple technique to adjust perceived phase angle, and reports results from experiments to assess how the intelligibility of speech heard in such an environment will vary with respect to head position.

Item Type: Book section
DOI/Identification number: 10.1109/ISCE.2014.6884428
Uncontrolled keywords: Ear, Loudspeakers, Noise, Noise measurement, Smart homes, Speech, Speech processing, automatic speech recognition, background noise, home automation, home computing, loudspeaker, loudspeakers, microphone, microphones, smart home, smart homes, speech HCI, speech intelligibility, speech playback geometry, speech power, speech processing, text-to-speech
Subjects: T Technology
Divisions: Faculties > Sciences > School of Computing
Depositing User: Ian McLoughlin
Date Deposited: 25 Aug 2015 09:36 UTC
Last Modified: 25 Sep 2019 11:34 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/48820 (The current URI for this page, for reference purposes)
McLoughlin, Ian V.: https://orcid.org/0000-0001-7111-2008
  • Depositors only (login required):