Feature Extraction for Automatic Recognition of Telephone Speech

  • Allan J. Putman

Student thesis: Doctoral ThesisDoctor of Philosophy

Abstract

A speech recognition system can be considered as containing a feature extractor and a word classifier. The research described in this thesis concerns a feature extraction system for transforming telephone speech into strings of symbols, suitable for use by a word classifier. The hypothesis upon which the feature extractor is based is that measurements of the relative changes in the speech data form a useful basis for the description of words. This was formed from present day knowledge of speech perception, production and analysis. Features produced by this system are a function of both time and the relative changes detected in an n-dimensional space, defined by an analysis stage of the process.

The outputs of fourteen bandpass filters are used to specify a point in 14-dimensional space. For each 10 ms spectrum sample the angle and distance are measured from two reference points in the 14-dimensional space, to produce two waveforms for the complete word. These are segmented according to the sign of their slope. The total changes that occur between segment boundaries are used to signify particular feature symbols. Thus two feature strings are produced to describe each word.

Results of tests with the experimental system confirmed that distinctive descriptions can be obtained for words spoken in isolation, over a simulated limiting telephone connection and with a single speaker. They also demonstrated that these features are consistent for a number of repetitions of a particular word.

The advantages of this technique are that segmentation and classification at the signal level are simplified and do not require prior knowledge of the speech sound characteristics within the language being used. Also, as only relative measurements are used, any changes in the characteristics of the transmission medium should not affect the result.
Date of Award1980
Original languageEnglish
Awarding Institution
  • Aston University

Keywords

  • feature extraction
  • automatic recognition
  • telephone speech

Cite this

'