A comparison of microphone and speech recognition engine efficacy for mobile data entry

Jo Lumsden, Scott Durling, Irina Kondratova

Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

Abstract

The research presented in this paper is part of an ongoing investigation into how best to incorporate speech-based input within mobile data collection applications. In our previous work [1], we evaluated the ability of a single speech recognition engine to support accurate, mobile, speech-based data input. Here, we build on our previous research to compare the achievable speaker-independent accuracy rates of a variety of speech recognition engines; we also consider the relative effectiveness of different speech recognition engine and microphone pairings in terms of their ability to support accurate text entry under realistic mobile conditions of use. Our intent is to provide some initial empirical data derived from mobile, user-based evaluations to support technological decisions faced by developers of mobile applications that would benefit from, or require, speech-based data entry facilities.
Original languageEnglish
Title of host publicationOTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems
Subtitle of host publication2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS
EditorsRobert Meersman, Zahir Tari, Pilar Herrero
PublisherSpringer
Pages519-527
Number of pages9
ISBN (Print)978-3-540-88874-1
DOIs
Publication statusPublished - 2008
EventOTM '08 OTM Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems - , United Kingdom
Duration: 1 Jan 20081 Jan 2008

Publication series

NameLecture notes in computer science
PublisherSpringer
ISSN (Print)0302-9743

Conference

ConferenceOTM '08 OTM Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems
CountryUnited Kingdom
Period1/01/081/01/08

Fingerprint

Microphones
Speech recognition
Data acquisition
Engines

Cite this

Lumsden, J., Durling, S., & Kondratova, I. (2008). A comparison of microphone and speech recognition engine efficacy for mobile data entry. In R. Meersman, Z. Tari, & P. Herrero (Eds.), OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems: 2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS (pp. 519-527). (Lecture notes in computer science). Springer. https://doi.org/10.1007/978-3-540-88875-8_75
Lumsden, Jo ; Durling, Scott ; Kondratova, Irina. / A comparison of microphone and speech recognition engine efficacy for mobile data entry. OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems: 2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS. editor / Robert Meersman ; Zahir Tari ; Pilar Herrero. Springer, 2008. pp. 519-527 (Lecture notes in computer science).
@inbook{41eea0b2e6ef47d89af44a33865f6a82,
title = "A comparison of microphone and speech recognition engine efficacy for mobile data entry",
abstract = "The research presented in this paper is part of an ongoing investigation into how best to incorporate speech-based input within mobile data collection applications. In our previous work [1], we evaluated the ability of a single speech recognition engine to support accurate, mobile, speech-based data input. Here, we build on our previous research to compare the achievable speaker-independent accuracy rates of a variety of speech recognition engines; we also consider the relative effectiveness of different speech recognition engine and microphone pairings in terms of their ability to support accurate text entry under realistic mobile conditions of use. Our intent is to provide some initial empirical data derived from mobile, user-based evaluations to support technological decisions faced by developers of mobile applications that would benefit from, or require, speech-based data entry facilities.",
author = "Jo Lumsden and Scott Durling and Irina Kondratova",
year = "2008",
doi = "10.1007/978-3-540-88875-8_75",
language = "English",
isbn = "978-3-540-88874-1",
series = "Lecture notes in computer science",
publisher = "Springer",
pages = "519--527",
editor = "Robert Meersman and Zahir Tari and Pilar Herrero",
booktitle = "OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems",
address = "Germany",

}

Lumsden, J, Durling, S & Kondratova, I 2008, A comparison of microphone and speech recognition engine efficacy for mobile data entry. in R Meersman, Z Tari & P Herrero (eds), OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems: 2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS. Lecture notes in computer science, Springer, pp. 519-527, OTM '08 OTM Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems, United Kingdom, 1/01/08. https://doi.org/10.1007/978-3-540-88875-8_75

A comparison of microphone and speech recognition engine efficacy for mobile data entry. / Lumsden, Jo; Durling, Scott; Kondratova, Irina.

OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems: 2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS. ed. / Robert Meersman; Zahir Tari; Pilar Herrero. Springer, 2008. p. 519-527 (Lecture notes in computer science).

Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

TY - CHAP

T1 - A comparison of microphone and speech recognition engine efficacy for mobile data entry

AU - Lumsden, Jo

AU - Durling, Scott

AU - Kondratova, Irina

PY - 2008

Y1 - 2008

N2 - The research presented in this paper is part of an ongoing investigation into how best to incorporate speech-based input within mobile data collection applications. In our previous work [1], we evaluated the ability of a single speech recognition engine to support accurate, mobile, speech-based data input. Here, we build on our previous research to compare the achievable speaker-independent accuracy rates of a variety of speech recognition engines; we also consider the relative effectiveness of different speech recognition engine and microphone pairings in terms of their ability to support accurate text entry under realistic mobile conditions of use. Our intent is to provide some initial empirical data derived from mobile, user-based evaluations to support technological decisions faced by developers of mobile applications that would benefit from, or require, speech-based data entry facilities.

AB - The research presented in this paper is part of an ongoing investigation into how best to incorporate speech-based input within mobile data collection applications. In our previous work [1], we evaluated the ability of a single speech recognition engine to support accurate, mobile, speech-based data input. Here, we build on our previous research to compare the achievable speaker-independent accuracy rates of a variety of speech recognition engines; we also consider the relative effectiveness of different speech recognition engine and microphone pairings in terms of their ability to support accurate text entry under realistic mobile conditions of use. Our intent is to provide some initial empirical data derived from mobile, user-based evaluations to support technological decisions faced by developers of mobile applications that would benefit from, or require, speech-based data entry facilities.

UR - http://dl.acm.org/citation.cfm?id=1484422.1484525

U2 - 10.1007/978-3-540-88875-8_75

DO - 10.1007/978-3-540-88875-8_75

M3 - Chapter (peer-reviewed)

SN - 978-3-540-88874-1

T3 - Lecture notes in computer science

SP - 519

EP - 527

BT - OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems

A2 - Meersman, Robert

A2 - Tari, Zahir

A2 - Herrero, Pilar

PB - Springer

ER -

Lumsden J, Durling S, Kondratova I. A comparison of microphone and speech recognition engine efficacy for mobile data entry. In Meersman R, Tari Z, Herrero P, editors, OTM '08 proceedings of the OTM confederated international workshops and posters on on the move to meaningful internet systems: 2008 workshops : ADI, AWeSoMe, COMBEK, EI2N, IWSSA, MONET, OnToContent + QSI, ORM, PerSys, RDDS, SEMELS, and SWWS. Springer. 2008. p. 519-527. (Lecture notes in computer science). https://doi.org/10.1007/978-3-540-88875-8_75