Issues in learning an ontology from text

Christopher Brewster, Simon Jupp, Joanne Luciano, David Shotton, Robert Stevens, Ziqi Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Ontology construction for any domain is a labour intensive and complex process. Any methodology that can reduce the cost and increase efficiency has the potential to make a major impact in the life sciences. This paper describes an experiment in ontology construction from text for the Animal Behaviour domain. Our objective was to see how much could be done in a simple and rapid manner using a corpus of journal papers. We used a sequence of text processing steps, and describe the different choices made to clean the input, to derive a set of terms and to structure those terms in a hierarchy. We were able in a very short space of time to construct a 17000 term ontology with a high percentage of suitable terms. We describe some of the challenges, especially
that of focusing the ontology appropriately given a starting point of a heterogeneous corpus.
Original languageEnglish
Title of host publicationThe 11th Annual Bio-Ontologies Meeting
EditorsPhillip Lord, Nigam Shah, Susanna-Assunta Sansone, Matthew Cockerill
Number of pages4
Publication statusPublished - 2008
Event11th Annual Bio-Ontologies Meeting - Toronto, Canada
Duration: 20 Jul 2008 → …

Conference

Conference11th Annual Bio-Ontologies Meeting
CountryCanada
CityToronto
Period20/07/08 → …

Fingerprint

Ontology
Text processing
Animals
Personnel
Costs
Experiments

Cite this

Brewster, C., Jupp, S., Luciano, J., Shotton, D., Stevens, R., & Zhang, Z. (2008). Issues in learning an ontology from text. In P. Lord, N. Shah, S-A. Sansone, & M. Cockerill (Eds.), The 11th Annual Bio-Ontologies Meeting
Brewster, Christopher ; Jupp, Simon ; Luciano, Joanne ; Shotton, David ; Stevens, Robert ; Zhang, Ziqi. / Issues in learning an ontology from text. The 11th Annual Bio-Ontologies Meeting. editor / Phillip Lord ; Nigam Shah ; Susanna-Assunta Sansone ; Matthew Cockerill. 2008.
@inproceedings{c87dfbdd2fe445b894d1f7f58e9c245a,
title = "Issues in learning an ontology from text",
abstract = "Ontology construction for any domain is a labour intensive and complex process. Any methodology that can reduce the cost and increase efficiency has the potential to make a major impact in the life sciences. This paper describes an experiment in ontology construction from text for the Animal Behaviour domain. Our objective was to see how much could be done in a simple and rapid manner using a corpus of journal papers. We used a sequence of text processing steps, and describe the different choices made to clean the input, to derive a set of terms and to structure those terms in a hierarchy. We were able in a very short space of time to construct a 17000 term ontology with a high percentage of suitable terms. We describe some of the challenges, especiallythat of focusing the ontology appropriately given a starting point of a heterogeneous corpus.",
author = "Christopher Brewster and Simon Jupp and Joanne Luciano and David Shotton and Robert Stevens and Ziqi Zhang",
year = "2008",
language = "English",
editor = "Phillip Lord and Nigam Shah and Susanna-Assunta Sansone and Matthew Cockerill",
booktitle = "The 11th Annual Bio-Ontologies Meeting",

}

Brewster, C, Jupp, S, Luciano, J, Shotton, D, Stevens, R & Zhang, Z 2008, Issues in learning an ontology from text. in P Lord, N Shah, S-A Sansone & M Cockerill (eds), The 11th Annual Bio-Ontologies Meeting. 11th Annual Bio-Ontologies Meeting, Toronto, Canada, 20/07/08.

Issues in learning an ontology from text. / Brewster, Christopher; Jupp, Simon; Luciano, Joanne; Shotton, David; Stevens, Robert; Zhang, Ziqi.

The 11th Annual Bio-Ontologies Meeting. ed. / Phillip Lord; Nigam Shah; Susanna-Assunta Sansone; Matthew Cockerill. 2008.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Issues in learning an ontology from text

AU - Brewster, Christopher

AU - Jupp, Simon

AU - Luciano, Joanne

AU - Shotton, David

AU - Stevens, Robert

AU - Zhang, Ziqi

PY - 2008

Y1 - 2008

N2 - Ontology construction for any domain is a labour intensive and complex process. Any methodology that can reduce the cost and increase efficiency has the potential to make a major impact in the life sciences. This paper describes an experiment in ontology construction from text for the Animal Behaviour domain. Our objective was to see how much could be done in a simple and rapid manner using a corpus of journal papers. We used a sequence of text processing steps, and describe the different choices made to clean the input, to derive a set of terms and to structure those terms in a hierarchy. We were able in a very short space of time to construct a 17000 term ontology with a high percentage of suitable terms. We describe some of the challenges, especiallythat of focusing the ontology appropriately given a starting point of a heterogeneous corpus.

AB - Ontology construction for any domain is a labour intensive and complex process. Any methodology that can reduce the cost and increase efficiency has the potential to make a major impact in the life sciences. This paper describes an experiment in ontology construction from text for the Animal Behaviour domain. Our objective was to see how much could be done in a simple and rapid manner using a corpus of journal papers. We used a sequence of text processing steps, and describe the different choices made to clean the input, to derive a set of terms and to structure those terms in a hierarchy. We were able in a very short space of time to construct a 17000 term ontology with a high percentage of suitable terms. We describe some of the challenges, especiallythat of focusing the ontology appropriately given a starting point of a heterogeneous corpus.

M3 - Conference contribution

BT - The 11th Annual Bio-Ontologies Meeting

A2 - Lord, Phillip

A2 - Shah, Nigam

A2 - Sansone, Susanna-Assunta

A2 - Cockerill, Matthew

ER -

Brewster C, Jupp S, Luciano J, Shotton D, Stevens R, Zhang Z. Issues in learning an ontology from text. In Lord P, Shah N, Sansone S-A, Cockerill M, editors, The 11th Annual Bio-Ontologies Meeting. 2008