Automatic detection of orthographic cues for cognate recognition

Andrea Mulloni, Viktor Pekar

Research output: Unpublished contribution to conferenceUnpublished Conference Paperpeer-review

26 Citations (Scopus)

Abstract

Present-day machine translation technologies crucially depend on the size and quality of lexical resources. Much of recent research in the area has been concerned with methods to build bilingual dictionaries automatically. In this paper we propose a methodology for the automatic detection of cognates between two languages based solely on the orthography of words. From a set of known cognates, the method induces rules capturing regularities of orthographic mutations that a word undergoes when migrating from one language into the other. The rules are then applied as a preprocessing step before measuring the orthographic similarity between putative cognates. As a result, the method allows to achieve an improvement in the F-measure of 11,86% in comparison with detecting cognates based only on the edit distance between them.

Original languageEnglish
Pages2387-2390
Number of pages4
Publication statusPublished - 30 May 2006
Event5th International Conference on Language Resources and Evaluation, LREC 2006 - Genoa, Italy
Duration: 22 May 200628 May 2006

Conference

Conference5th International Conference on Language Resources and Evaluation, LREC 2006
Country/TerritoryItaly
CityGenoa
Period22/05/0628/05/06

Fingerprint

Dive into the research topics of 'Automatic detection of orthographic cues for cognate recognition'. Together they form a unique fingerprint.

Cite this