Analyzing lexical emergence in modern American English online

Jack Grieve*, Andrea Nini, Diansheng Guo

*Corresponding author for this work

Research output: Contribution to journalArticle

Abstract

This paper introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method based on a multi-billion word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of speech, and word formations processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.
Original languageEnglish
Pages (from-to)99-127
Number of pages29
JournalEnglish Language and Linguistics
Volume21
Issue number1
Early online date25 May 2016
DOIs
Publication statusPublished - 1 Mar 2017

Fingerprint

quantitative method
social media
semantics
language
Word Forms
American English
time
Formation Process
Social Media
Semantic Classes
Part of Speech
Quantitative Methods
Word Formation
Natural Language

Bibliographical note

CORRIGENDUM: In the above mentioned article by Grieve, Nini & Guo, an error has occurred in the section numbering. Section 4 is missing and has been mistakenly labelled with section 5. All sections and subsections labelled section 5, should be section 4. Which means section 6 should be renamed section 5. DOI: http://dx.doi.org/10.1017/S1360674316000526

Cite this

Grieve, Jack ; Nini, Andrea ; Guo, Diansheng. / Analyzing lexical emergence in modern American English online. In: English Language and Linguistics. 2017 ; Vol. 21, No. 1. pp. 99-127.
@article{cc22942705b241059b30c844a73f9e37,
title = "Analyzing lexical emergence in modern American English online",
abstract = "This paper introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method based on a multi-billion word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of speech, and word formations processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.",
author = "Jack Grieve and Andrea Nini and Diansheng Guo",
note = "CORRIGENDUM: In the above mentioned article by Grieve, Nini & Guo, an error has occurred in the section numbering. Section 4 is missing and has been mistakenly labelled with section 5. All sections and subsections labelled section 5, should be section 4. Which means section 6 should be renamed section 5. DOI: http://dx.doi.org/10.1017/S1360674316000526",
year = "2017",
month = "3",
day = "1",
doi = "10.1017/S1360674316000113",
language = "English",
volume = "21",
pages = "99--127",
journal = "English Language and Linguistics",
issn = "1360-6743",
publisher = "Cambridge University Press",
number = "1",

}

Analyzing lexical emergence in modern American English online. / Grieve, Jack; Nini, Andrea; Guo, Diansheng.

In: English Language and Linguistics, Vol. 21, No. 1, 01.03.2017, p. 99-127.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Analyzing lexical emergence in modern American English online

AU - Grieve, Jack

AU - Nini, Andrea

AU - Guo, Diansheng

N1 - CORRIGENDUM: In the above mentioned article by Grieve, Nini & Guo, an error has occurred in the section numbering. Section 4 is missing and has been mistakenly labelled with section 5. All sections and subsections labelled section 5, should be section 4. Which means section 6 should be renamed section 5. DOI: http://dx.doi.org/10.1017/S1360674316000526

PY - 2017/3/1

Y1 - 2017/3/1

N2 - This paper introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method based on a multi-billion word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of speech, and word formations processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.

AB - This paper introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method based on a multi-billion word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of speech, and word formations processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.

UR - http://www.scopus.com/inward/record.url?scp=84969932258&partnerID=8YFLogxK

UR - https://www.cambridge.org/core/journals/english-language-and-linguistics/article/analyzing-lexical-emergence-in-modern-american-english-online-1/73E2D917856BE39ACD9EE3789E2BE597

U2 - 10.1017/S1360674316000113

DO - 10.1017/S1360674316000113

M3 - Article

AN - SCOPUS:84969932258

VL - 21

SP - 99

EP - 127

JO - English Language and Linguistics

JF - English Language and Linguistics

SN - 1360-6743

IS - 1

ER -