Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs

Research output: Chapter in Book/Report/Conference proceedingEntry for encyclopedia/dictionary

Abstract

This chapter introduces Native Language Identification (NLID) and considers the casework applications with regard to authorship analysis of online material. It presents findings from research identifying which linguistic features were the best indicators of native (L1) Persian speakers blogging in English, and analyses how these features cope at distinguishing between native influences from languages that are linguistically and culturally related. The first chapter section outlines the area of Native Language Identification, and demonstrates its potential for application through a discussion of relevant case history. The next section discusses a development of methodology for identifying influence from L1 Persian in an anonymous blog author, and presents findings. The third part discusses the application of these features to casework situations as well as how the features identified can form an easily applicable model and demonstrates the application of this to casework. The research presented in this chapter can be considered a case study for the wider potential application of NLID.
Original languageEnglish
Title of host publicationNew Threats and Countermeasures in Digital Crime and Cyber Terrorism
EditorsMaurice Dawson, Marwan Omar
PublisherIGI Global
Pages213-234
Number of pages22
ISBN (Electronic)9781466683464
ISBN (Print)1466683457, 9781466683457
DOIs
Publication statusPublished - 30 Apr 2015

Publication series

NamePremier Reference Source
PublisherIGI Global

Fingerprint

weblog
casework
language
Blogs
Linguistics
linguistics
methodology

Cite this

Perkins, R. C. (2015). Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs. In M. Dawson, & M. Omar (Eds.), New Threats and Countermeasures in Digital Crime and Cyber Terrorism (pp. 213-234). (Premier Reference Source). IGI Global. https://doi.org/10.4018/978-1-4666-8345-7.ch012, https://doi.org/10.4018/978-1-4666-8345-7.ch012
Perkins, Ria C. / Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs. New Threats and Countermeasures in Digital Crime and Cyber Terrorism. editor / Maurice Dawson ; Marwan Omar. IGI Global, 2015. pp. 213-234 (Premier Reference Source).
@inbook{866b8933dfac430bacc4c96295999abb,
title = "Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs",
abstract = "This chapter introduces Native Language Identification (NLID) and considers the casework applications with regard to authorship analysis of online material. It presents findings from research identifying which linguistic features were the best indicators of native (L1) Persian speakers blogging in English, and analyses how these features cope at distinguishing between native influences from languages that are linguistically and culturally related. The first chapter section outlines the area of Native Language Identification, and demonstrates its potential for application through a discussion of relevant case history. The next section discusses a development of methodology for identifying influence from L1 Persian in an anonymous blog author, and presents findings. The third part discusses the application of these features to casework situations as well as how the features identified can form an easily applicable model and demonstrates the application of this to casework. The research presented in this chapter can be considered a case study for the wider potential application of NLID.",
author = "Perkins, {Ria C}",
year = "2015",
month = "4",
day = "30",
doi = "10.4018/978-1-4666-8345-7.ch012",
language = "English",
isbn = "1466683457",
series = "Premier Reference Source",
publisher = "IGI Global",
pages = "213--234",
editor = "Maurice Dawson and Marwan Omar",
booktitle = "New Threats and Countermeasures in Digital Crime and Cyber Terrorism",
address = "United States",

}

Perkins, RC 2015, Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs. in M Dawson & M Omar (eds), New Threats and Countermeasures in Digital Crime and Cyber Terrorism. Premier Reference Source, IGI Global, pp. 213-234. https://doi.org/10.4018/978-1-4666-8345-7.ch012, https://doi.org/10.4018/978-1-4666-8345-7.ch012

Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs. / Perkins, Ria C.

New Threats and Countermeasures in Digital Crime and Cyber Terrorism. ed. / Maurice Dawson; Marwan Omar. IGI Global, 2015. p. 213-234 (Premier Reference Source).

Research output: Chapter in Book/Report/Conference proceedingEntry for encyclopedia/dictionary

TY - CHAP

T1 - Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs

AU - Perkins, Ria C

PY - 2015/4/30

Y1 - 2015/4/30

N2 - This chapter introduces Native Language Identification (NLID) and considers the casework applications with regard to authorship analysis of online material. It presents findings from research identifying which linguistic features were the best indicators of native (L1) Persian speakers blogging in English, and analyses how these features cope at distinguishing between native influences from languages that are linguistically and culturally related. The first chapter section outlines the area of Native Language Identification, and demonstrates its potential for application through a discussion of relevant case history. The next section discusses a development of methodology for identifying influence from L1 Persian in an anonymous blog author, and presents findings. The third part discusses the application of these features to casework situations as well as how the features identified can form an easily applicable model and demonstrates the application of this to casework. The research presented in this chapter can be considered a case study for the wider potential application of NLID.

AB - This chapter introduces Native Language Identification (NLID) and considers the casework applications with regard to authorship analysis of online material. It presents findings from research identifying which linguistic features were the best indicators of native (L1) Persian speakers blogging in English, and analyses how these features cope at distinguishing between native influences from languages that are linguistically and culturally related. The first chapter section outlines the area of Native Language Identification, and demonstrates its potential for application through a discussion of relevant case history. The next section discusses a development of methodology for identifying influence from L1 Persian in an anonymous blog author, and presents findings. The third part discusses the application of these features to casework situations as well as how the features identified can form an easily applicable model and demonstrates the application of this to casework. The research presented in this chapter can be considered a case study for the wider potential application of NLID.

UR - http://www.scopus.com/inward/record.url?scp=84958252560&partnerID=8YFLogxK

U2 - 10.4018/978-1-4666-8345-7.ch012

DO - 10.4018/978-1-4666-8345-7.ch012

M3 - Entry for encyclopedia/dictionary

SN - 1466683457

SN - 9781466683457

T3 - Premier Reference Source

SP - 213

EP - 234

BT - New Threats and Countermeasures in Digital Crime and Cyber Terrorism

A2 - Dawson, Maurice

A2 - Omar, Marwan

PB - IGI Global

ER -

Perkins RC. Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs. In Dawson M, Omar M, editors, New Threats and Countermeasures in Digital Crime and Cyber Terrorism. IGI Global. 2015. p. 213-234. (Premier Reference Source). https://doi.org/10.4018/978-1-4666-8345-7.ch012, https://doi.org/10.4018/978-1-4666-8345-7.ch012