Describing and communicating uncertainty within the semantic web

Matthew Williams, Dan Cornford, Lucy Bastin, Ben Ingram

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The Semantic Web relies on carefully structured, well defined, data to allow machines to communicate and understand one another. In many domains (e.g. geospatial) the data being described contains some uncertainty, often due to incomplete knowledge; meaningful processing of this data requires these uncertainties to be carefully analysed and integrated into the process chain. Currently, within the SemanticWeb there is no standard mechanism for interoperable description and exchange of uncertain information, which renders the automated processing of such information implausible, particularly where error must be considered and captured as it propagates through a processing sequence. In particular we adopt a Bayesian perspective and focus on the case where the inputs / outputs are naturally treated as random variables. This paper discusses a solution to the problem in the form of the Uncertainty Markup Language (UncertML). UncertML is a conceptual model, realised as an XML schema, that allows uncertainty to be quantified in a variety of ways i.e. realisations, statistics and probability distributions. UncertML is based upon a soft-typed XML schema design that provides a generic framework from which any statistic or distribution may be created. Making extensive use of Geography Markup Language (GML) dictionaries, UncertML provides a collection of definitions for common uncertainty types. Containing both written descriptions and mathematical functions, encoded as MathML, the definitions within these dictionaries provide a robust mechanism for defining any statistic or distribution and can be easily extended. Universal Resource Identifiers (URIs) are used to introduce semantics to the soft-typed elements by linking to these dictionary definitions. The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types, e.g. a series of marginal Gaussian distributions, a set of statistics, such as the first three marginal moments, or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system, and ultimately paves the way for complex data processing chains in the Semantic Web.
Original languageEnglish
Title of host publicationProceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web
EditorsFernando Bobillo, Paulo C.G. da Costa, et al
PublisherCEUR-WS.org
Publication statusPublished - Dec 2008
EventUncertainty Reasoning for the Semantic Web Workshop, part of 7th International Semantic Web Conference - Karlsruhe (DE), Denmark
Duration: 26 Oct 2008 → …

Publication series

NameCEUR Workshop Proceedings
Publisher1613-0073
Volume423

Workshop

WorkshopUncertainty Reasoning for the Semantic Web Workshop, part of 7th International Semantic Web Conference
Abbreviated titleURSW '08
CountryDenmark
CityKarlsruhe (DE)
Period26/10/08 → …

Fingerprint

Semantic Web
Markup languages
Glossaries
Statistics
XML
Uncertainty
Interpolation
Processing
Gaussian distribution
Decision support systems
Risk management
Random variables
Interoperability
Probability distributions
Semantics

Bibliographical note

Williams, Matthew; Cornford, Dan; Bastin, Lucy; Ingram, Ben : Describing and communicating uncertainty within the semantic web. Proc. of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web, 2008, ceur-ws.org/Vol-423/paper3.pdf

Keywords

  • semantic web
  • uncertainty
  • SemanticWeb
  • interoperable description
  • exchange of uncertain information
  • error
  • Bayesian perspective
  • random variables
  • Uncertainty Markup Language
  • UncertML
  • soft-typed XML schema design
  • Geography Markup Language
  • MathML
  • semantics to the soft-typed elements by linking to these dictionary definitions.
  • The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types
  • e.g. a series of marginal Gaussian distributions
  • a set of statistics
  • such as the first three marginal moments
  • or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system
  • and ultimately paves the way for complex data processing chains in the Semantic Web.

Cite this

Williams, M., Cornford, D., Bastin, L., & Ingram, B. (2008). Describing and communicating uncertainty within the semantic web. In F. Bobillo, P. C. G. da Costa, & et al (Eds.), Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web (CEUR Workshop Proceedings; Vol. 423). CEUR-WS.org.
Williams, Matthew ; Cornford, Dan ; Bastin, Lucy ; Ingram, Ben. / Describing and communicating uncertainty within the semantic web. Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web. editor / Fernando Bobillo ; Paulo C.G. da Costa ; et al. CEUR-WS.org, 2008. (CEUR Workshop Proceedings).
@inproceedings{e1208e84b7c641448156f637184e55dd,
title = "Describing and communicating uncertainty within the semantic web",
abstract = "The Semantic Web relies on carefully structured, well defined, data to allow machines to communicate and understand one another. In many domains (e.g. geospatial) the data being described contains some uncertainty, often due to incomplete knowledge; meaningful processing of this data requires these uncertainties to be carefully analysed and integrated into the process chain. Currently, within the SemanticWeb there is no standard mechanism for interoperable description and exchange of uncertain information, which renders the automated processing of such information implausible, particularly where error must be considered and captured as it propagates through a processing sequence. In particular we adopt a Bayesian perspective and focus on the case where the inputs / outputs are naturally treated as random variables. This paper discusses a solution to the problem in the form of the Uncertainty Markup Language (UncertML). UncertML is a conceptual model, realised as an XML schema, that allows uncertainty to be quantified in a variety of ways i.e. realisations, statistics and probability distributions. UncertML is based upon a soft-typed XML schema design that provides a generic framework from which any statistic or distribution may be created. Making extensive use of Geography Markup Language (GML) dictionaries, UncertML provides a collection of definitions for common uncertainty types. Containing both written descriptions and mathematical functions, encoded as MathML, the definitions within these dictionaries provide a robust mechanism for defining any statistic or distribution and can be easily extended. Universal Resource Identifiers (URIs) are used to introduce semantics to the soft-typed elements by linking to these dictionary definitions. The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types, e.g. a series of marginal Gaussian distributions, a set of statistics, such as the first three marginal moments, or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system, and ultimately paves the way for complex data processing chains in the Semantic Web.",
keywords = "semantic web, uncertainty, SemanticWeb, interoperable description, exchange of uncertain information, error, Bayesian perspective, random variables, Uncertainty Markup Language, UncertML, soft-typed XML schema design, Geography Markup Language, MathML, semantics to the soft-typed elements by linking to these dictionary definitions., The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types, e.g. a series of marginal Gaussian distributions, a set of statistics, such as the first three marginal moments, or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system, and ultimately paves the way for complex data processing chains in the Semantic Web.",
author = "Matthew Williams and Dan Cornford and Lucy Bastin and Ben Ingram",
note = "Williams, Matthew; Cornford, Dan; Bastin, Lucy; Ingram, Ben : Describing and communicating uncertainty within the semantic web. Proc. of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web, 2008, ceur-ws.org/Vol-423/paper3.pdf",
year = "2008",
month = "12",
language = "English",
series = "CEUR Workshop Proceedings",
publisher = "CEUR-WS.org",
editor = "Fernando Bobillo and {da Costa}, {Paulo C.G.} and {et al}",
booktitle = "Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web",

}

Williams, M, Cornford, D, Bastin, L & Ingram, B 2008, Describing and communicating uncertainty within the semantic web. in F Bobillo, PCG da Costa & et al (eds), Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web. CEUR Workshop Proceedings, vol. 423, CEUR-WS.org, Uncertainty Reasoning for the Semantic Web Workshop, part of 7th International Semantic Web Conference, Karlsruhe (DE), Denmark, 26/10/08.

Describing and communicating uncertainty within the semantic web. / Williams, Matthew; Cornford, Dan; Bastin, Lucy; Ingram, Ben.

Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web. ed. / Fernando Bobillo; Paulo C.G. da Costa; et al. CEUR-WS.org, 2008. (CEUR Workshop Proceedings; Vol. 423).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Describing and communicating uncertainty within the semantic web

AU - Williams, Matthew

AU - Cornford, Dan

AU - Bastin, Lucy

AU - Ingram, Ben

N1 - Williams, Matthew; Cornford, Dan; Bastin, Lucy; Ingram, Ben : Describing and communicating uncertainty within the semantic web. Proc. of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web, 2008, ceur-ws.org/Vol-423/paper3.pdf

PY - 2008/12

Y1 - 2008/12

N2 - The Semantic Web relies on carefully structured, well defined, data to allow machines to communicate and understand one another. In many domains (e.g. geospatial) the data being described contains some uncertainty, often due to incomplete knowledge; meaningful processing of this data requires these uncertainties to be carefully analysed and integrated into the process chain. Currently, within the SemanticWeb there is no standard mechanism for interoperable description and exchange of uncertain information, which renders the automated processing of such information implausible, particularly where error must be considered and captured as it propagates through a processing sequence. In particular we adopt a Bayesian perspective and focus on the case where the inputs / outputs are naturally treated as random variables. This paper discusses a solution to the problem in the form of the Uncertainty Markup Language (UncertML). UncertML is a conceptual model, realised as an XML schema, that allows uncertainty to be quantified in a variety of ways i.e. realisations, statistics and probability distributions. UncertML is based upon a soft-typed XML schema design that provides a generic framework from which any statistic or distribution may be created. Making extensive use of Geography Markup Language (GML) dictionaries, UncertML provides a collection of definitions for common uncertainty types. Containing both written descriptions and mathematical functions, encoded as MathML, the definitions within these dictionaries provide a robust mechanism for defining any statistic or distribution and can be easily extended. Universal Resource Identifiers (URIs) are used to introduce semantics to the soft-typed elements by linking to these dictionary definitions. The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types, e.g. a series of marginal Gaussian distributions, a set of statistics, such as the first three marginal moments, or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system, and ultimately paves the way for complex data processing chains in the Semantic Web.

AB - The Semantic Web relies on carefully structured, well defined, data to allow machines to communicate and understand one another. In many domains (e.g. geospatial) the data being described contains some uncertainty, often due to incomplete knowledge; meaningful processing of this data requires these uncertainties to be carefully analysed and integrated into the process chain. Currently, within the SemanticWeb there is no standard mechanism for interoperable description and exchange of uncertain information, which renders the automated processing of such information implausible, particularly where error must be considered and captured as it propagates through a processing sequence. In particular we adopt a Bayesian perspective and focus on the case where the inputs / outputs are naturally treated as random variables. This paper discusses a solution to the problem in the form of the Uncertainty Markup Language (UncertML). UncertML is a conceptual model, realised as an XML schema, that allows uncertainty to be quantified in a variety of ways i.e. realisations, statistics and probability distributions. UncertML is based upon a soft-typed XML schema design that provides a generic framework from which any statistic or distribution may be created. Making extensive use of Geography Markup Language (GML) dictionaries, UncertML provides a collection of definitions for common uncertainty types. Containing both written descriptions and mathematical functions, encoded as MathML, the definitions within these dictionaries provide a robust mechanism for defining any statistic or distribution and can be easily extended. Universal Resource Identifiers (URIs) are used to introduce semantics to the soft-typed elements by linking to these dictionary definitions. The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpolation service uses the information within these observations to influence the prediction outcome. The output uncertainties may be encoded in a variety of UncertML types, e.g. a series of marginal Gaussian distributions, a set of statistics, such as the first three marginal moments, or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system, and ultimately paves the way for complex data processing chains in the Semantic Web.

KW - semantic web

KW - uncertainty

KW - SemanticWeb

KW - interoperable description

KW - exchange of uncertain information

KW - error

KW - Bayesian perspective

KW - random variables

KW - Uncertainty Markup Language

KW - UncertML

KW - soft-typed XML schema design

KW - Geography Markup Language

KW - MathML

KW - semantics to the soft-typed elements by linking to these dictionary definitions.

KW - The INTAMAP (INTeroperability and Automated MAPping) project provides a use case for UncertML. This paper demonstrates how observation errors can be quantified using UncertML and wrapped within an Observations & Measurements (O&M) Observation. The interpo

KW - e.g. a series of marginal Gaussian distributions

KW - a set of statistics

KW - such as the first three marginal moments

KW - or a set of realisations from a Monte Carlo treatment. Quantifying and propagating uncertainty in this way allows such interpolation results to be consumed by other services. This could form part of a risk management chain or a decision support system

KW - and ultimately paves the way for complex data processing chains in the Semantic Web.

UR - http://www.scopus.com/inward/record.url?scp=84864854660&partnerID=8YFLogxK

M3 - Conference contribution

T3 - CEUR Workshop Proceedings

BT - Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web

A2 - Bobillo, Fernando

A2 - da Costa, Paulo C.G.

A2 - et al,

PB - CEUR-WS.org

ER -

Williams M, Cornford D, Bastin L, Ingram B. Describing and communicating uncertainty within the semantic web. In Bobillo F, da Costa PCG, et al, editors, Proceedings of the Fourth International Workshop on Uncertainty Reasoning for the Semantic Web. CEUR-WS.org. 2008. (CEUR Workshop Proceedings).