The usability of speech and/or gestures in multi-modal interface systems

Farzana Alibay, Manolya Kavakli, Jean Rémy Chardonnet, Muhammad Zeeshan Baig

Research output: Chapter in Book/Published conference outputConference publication

Abstract

Multi-Modal Interface Systems (MMIS) have proliferated in the last few decades, since they provide a direct interface for both Human Computer Interaction (HCI) and face-to-face communication. Our aim is to provide users without any prior 3D modelling experience, with a multi-modal interface to create a 3D object. The system also incorporates help throughout the drawing process and identifies simple words and gestures to accomplish a range of (simple to complex) modeling tasks. We have developed a multi-modal interface that allows users to design objects in 3D, using AutoCAD commands as well as speech and gesture. We have used a microphone to collect speech input and a Leap Motion sensor to collect gesture input in real time. Two sets of experiments were conducted to investigate the usability of the system and evaluate the system performance using Leap Motion versus keyboard and mouse. Our results indicate that performing a task using speech is perceived exhausting, when there is no shared vocabulary between man and machine, and the usability of traditional input devices supersedes the usability of speech and gestures. Only a small ratio of participants, less than 7% in our experiments were able to carry out the tasks with appropriate precision.

Original languageEnglish
Title of host publicationProceedings of 2017 9th International Conference on Computer and Automation Engineering, ICCAE 2017
PublisherACM
Pages73-77
Number of pages5
ISBN (Electronic)9781450348096
DOIs
Publication statusPublished - 18 Feb 2017
Event9th International Conference on Computer and Automation Engineering, ICCAE 2017 - Sydney, Australia
Duration: 18 Feb 201721 Feb 2017

Publication series

NameACM International Conference Proceeding Series
VolumePart F127852

Conference

Conference9th International Conference on Computer and Automation Engineering, ICCAE 2017
Country/TerritoryAustralia
CitySydney
Period18/02/1721/02/17

Bibliographical note

Publisher Copyright:
© 2017 ACM.

Keywords

  • 3D object
  • Emotion recognition
  • Gesture
  • Kinect
  • Leap motion
  • Semantics
  • Speech

Fingerprint

Dive into the research topics of 'The usability of speech and/or gestures in multi-modal interface systems'. Together they form a unique fingerprint.

Cite this