Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs

Ria C Perkins*

*Corresponding author for this work

    Research output: Chapter in Book/Published conference outputEntry for encyclopedia/dictionary

    Abstract

    This chapter introduces Native Language Identification (NLID) and considers the casework applications with regard to authorship analysis of online material. It presents findings from research identifying which linguistic features were the best indicators of native (L1) Persian speakers blogging in English, and analyses how these features cope at distinguishing between native influences from languages that are linguistically and culturally related. The first chapter section outlines the area of Native Language Identification, and demonstrates its potential for application through a discussion of relevant case history. The next section discusses a development of methodology for identifying influence from L1 Persian in an anonymous blog author, and presents findings. The third part discusses the application of these features to casework situations as well as how the features identified can form an easily applicable model and demonstrates the application of this to casework. The research presented in this chapter can be considered a case study for the wider potential application of NLID.
    Original languageEnglish
    Title of host publicationNew Threats and Countermeasures in Digital Crime and Cyber Terrorism
    EditorsMaurice Dawson, Marwan Omar
    PublisherIGI Global
    Pages213-234
    Number of pages22
    ISBN (Electronic)9781466683464
    ISBN (Print)1466683457, 9781466683457
    DOIs
    Publication statusPublished - 30 Apr 2015

    Publication series

    NamePremier Reference Source
    PublisherIGI Global

    Fingerprint

    Dive into the research topics of 'Native Language Identification (NLID) for Forensic Authorship Analysis of Weblogs'. Together they form a unique fingerprint.

    Cite this