Comparing sentence-level features for authorship analysis in Portuguese

Research output: Chapter in Book/Report/Conference proceedingConference contribution

View graph of relations Save citation

Authors

Research units

Abstract

In this paper we compare the robustness of several types of stylistic markers to help discriminate authorship at sentence level. We train a SVM-based classifier using each set of features separately and perform sentence-level authorship analysis over corpus of editorials published in a Portuguese quality newspaper. Results show that features based on POS information, punctuation and word / sentence length contribute to a more robust sentence-level authorship analysis. © Springer-Verlag Berlin Heidelberg 2010.

Request a copy

Request a copy

Details

Publication date23 Dec 2010
Publication titleComputational processing of the Portuguese language : 9th International Conference, PROPOR 2010, Porto Alegre, RS, Brazil, April 27-30, 2010. Proceedings
EditorsThiago Alexandre Salgueiro Pardo, António Branco, Aldebaro Klautau, et al
Place of PublicationBerlin (DE)
PublisherSpringer
Pages51-54
Number of pages4
ISBN (Electronic)978-3-642-12320-7
ISBN (Print)978-3-642-12319-1
Original languageEnglish
Event9th International Conference on Computational Processing of the Portuguese Language - Porto Alegre, RS, Brazil

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume6001
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Computational Processing of the Portuguese Language
Abbreviated titlePROPOR 2010
CountryBrazil
CityPorto Alegre, RS
Period27/04/1030/04/10

DOI

Employable Graduates; Exploitable Research

Copy the text from this field...