Comparing sentence-level features for authorship analysis in Portuguese

Research output: Chapter in Book/Report/Conference proceedingConference contribution

View graph of relations Save citation


Research units


In this paper we compare the robustness of several types of stylistic markers to help discriminate authorship at sentence level. We train a SVM-based classifier using each set of features separately and perform sentence-level authorship analysis over corpus of editorials published in a Portuguese quality newspaper. Results show that features based on POS information, punctuation and word / sentence length contribute to a more robust sentence-level authorship analysis.

Request a copy

Request a copy


Publication date23 Dec 2010
Publication titleComputational processing of the Portuguese language : 9th International Conference, PROPOR 2010, Porto Alegre, RS, Brazil, April 27-30, 2010. Proceedings
EditorsThiago Alexandre Salgueiro Pardo, António Branco, Aldebaro Klautau, et al
Place of PublicationBerlin (DE)
Number of pages4
ISBN (Electronic)978-3-642-12320-7
ISBN (Print)978-3-642-12319-1
Original languageEnglish
Event9th International Conference on Computational Processing of the Portuguese Language - Porto Alegre, RS, Brazil
Duration: 27 Apr 201030 Apr 2010

Publication series

NameLecture Notes in Computer Science
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference9th International Conference on Computational Processing of the Portuguese Language
Abbreviated titlePROPOR 2010
CityPorto Alegre, RS

Employable Graduates; Exploitable Research

Copy the text from this field...