Introduction: Innovation in spoken corpus linguistics

Research output: Contribution to journalArticlepeer-review

Abstract

Over the decades, technological advancements have substantially improved the efficiency and scope of spoken corpus compilation, but there remain many challenges ––both practical and theoretical–– that constrain 1) the quality of spoken corpus data, 2) the scale to which spoken corpora can be compiled, and 3) the authenticity with which spoken language is represented in textual form. This special issue presents eight studies which address contemporary innovations in spoken corpus design, data collection, processing, and analysis, covering a range of speech contexts and varieties. The studies focus on registers including online workplace meetings, casual conversation, oral histories, oral proficiency interviews, and YouTube vlogs. Innovations include the integration of automated transcription tools, multimodal annotation schemes, creative participant recruitment methods, and developments in natural language processing (NLP). Three contributions offer critical reconceptualisations of traditional approaches to spoken corpus design, proposing strategies to improve the authenticity of spoken corpora.

Original languageEnglish
Pages (from-to)i-viii
Number of pages8
JournalResearch in Corpus Linguistics
Volume12
Issue number2
DOIs
Publication statusPublished - 24 Oct 2024

Bibliographical note

Copyright © 2024 The Author(s). This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Keywords

  • corpus construction
  • corpus design
  • representativeness
  • spoken corpora
  • transcription

Fingerprint

Dive into the research topics of 'Introduction: Innovation in spoken corpus linguistics'. Together they form a unique fingerprint.

Cite this