Corpus-driven lexicography

Ramesh Krishnamurthy

Research output: Contribution to journalArticle

Abstract

This paper discusses three important aspects of John Sinclair’s legacy: the corpus, lexicography, and the notion of ‘corpus-driven’. The corpus represents his concern with the nature of linguistic evidence. Lexicography is for him the canonical mode of language description at the lexical level. And his belief that the corpus should ‘drive’ the description is reflected in his constant attempts to utilize the emergent computer technologies to automate the initial stages of analysis and defer the intuitive, interpretative contributions of linguists to increasingly later stages in the process. Sinclair’s model of corpus-driven lexicography has spread far beyond its initial implementation at Cobuild, to most EFL dictionaries, to native-speaker dictionaries (e.g. the New Oxford Dictionary of English, and many national language dictionaries in emerging or re-emerging speech communities) and bilingual dictionaries (e.g. Collins, Oxford-Hachette).
Original languageEnglish
Pages (from-to)231-242
Number of pages12
JournalInternational Journal of Lexicography
Volume21
Issue number3
DOIs
Publication statusPublished - Sep 2008

Fingerprint

dictionary
Language
Linguistics
language
Lexicography
Technology
linguistics
community
evidence
Dictionary

Bibliographical note

This is an electronic version of an article published by Oxford University Press, in International Journal of Lexicography, volume 21(3), Pg 231-242.

Keywords

  • lexicography
  • linguistic
  • technologies
  • dictionaries
  • native

Cite this

Krishnamurthy, Ramesh. / Corpus-driven lexicography. In: International Journal of Lexicography. 2008 ; Vol. 21, No. 3. pp. 231-242.
@article{8f9751b57ca046e6a74a6f92219782e8,
title = "Corpus-driven lexicography",
abstract = "This paper discusses three important aspects of John Sinclair’s legacy: the corpus, lexicography, and the notion of ‘corpus-driven’. The corpus represents his concern with the nature of linguistic evidence. Lexicography is for him the canonical mode of language description at the lexical level. And his belief that the corpus should ‘drive’ the description is reflected in his constant attempts to utilize the emergent computer technologies to automate the initial stages of analysis and defer the intuitive, interpretative contributions of linguists to increasingly later stages in the process. Sinclair’s model of corpus-driven lexicography has spread far beyond its initial implementation at Cobuild, to most EFL dictionaries, to native-speaker dictionaries (e.g. the New Oxford Dictionary of English, and many national language dictionaries in emerging or re-emerging speech communities) and bilingual dictionaries (e.g. Collins, Oxford-Hachette).",
keywords = "lexicography, linguistic, technologies, dictionaries, native",
author = "Ramesh Krishnamurthy",
note = "This is an electronic version of an article published by Oxford University Press, in International Journal of Lexicography, volume 21(3), Pg 231-242.",
year = "2008",
month = "9",
doi = "10.1093/ijl/ecn028",
language = "English",
volume = "21",
pages = "231--242",
journal = "International Journal of Lexicography",
issn = "0950-3846",
publisher = "Oxford University Press",
number = "3",

}

Corpus-driven lexicography. / Krishnamurthy, Ramesh.

In: International Journal of Lexicography, Vol. 21, No. 3, 09.2008, p. 231-242.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Corpus-driven lexicography

AU - Krishnamurthy, Ramesh

N1 - This is an electronic version of an article published by Oxford University Press, in International Journal of Lexicography, volume 21(3), Pg 231-242.

PY - 2008/9

Y1 - 2008/9

N2 - This paper discusses three important aspects of John Sinclair’s legacy: the corpus, lexicography, and the notion of ‘corpus-driven’. The corpus represents his concern with the nature of linguistic evidence. Lexicography is for him the canonical mode of language description at the lexical level. And his belief that the corpus should ‘drive’ the description is reflected in his constant attempts to utilize the emergent computer technologies to automate the initial stages of analysis and defer the intuitive, interpretative contributions of linguists to increasingly later stages in the process. Sinclair’s model of corpus-driven lexicography has spread far beyond its initial implementation at Cobuild, to most EFL dictionaries, to native-speaker dictionaries (e.g. the New Oxford Dictionary of English, and many national language dictionaries in emerging or re-emerging speech communities) and bilingual dictionaries (e.g. Collins, Oxford-Hachette).

AB - This paper discusses three important aspects of John Sinclair’s legacy: the corpus, lexicography, and the notion of ‘corpus-driven’. The corpus represents his concern with the nature of linguistic evidence. Lexicography is for him the canonical mode of language description at the lexical level. And his belief that the corpus should ‘drive’ the description is reflected in his constant attempts to utilize the emergent computer technologies to automate the initial stages of analysis and defer the intuitive, interpretative contributions of linguists to increasingly later stages in the process. Sinclair’s model of corpus-driven lexicography has spread far beyond its initial implementation at Cobuild, to most EFL dictionaries, to native-speaker dictionaries (e.g. the New Oxford Dictionary of English, and many national language dictionaries in emerging or re-emerging speech communities) and bilingual dictionaries (e.g. Collins, Oxford-Hachette).

KW - lexicography

KW - linguistic

KW - technologies

KW - dictionaries

KW - native

UR - http://www.scopus.com/inward/record.url?scp=50949094741&partnerID=8YFLogxK

UR - http://ijl.oxfordjournals.org/content/21/3/231

U2 - 10.1093/ijl/ecn028

DO - 10.1093/ijl/ecn028

M3 - Article

VL - 21

SP - 231

EP - 242

JO - International Journal of Lexicography

JF - International Journal of Lexicography

SN - 0950-3846

IS - 3

ER -