A framework for automated construction of resource space based on background knowledge

Xu Yu, Li Peng, Zhixing Huang*, Hai Zhuge

*Corresponding author for this work

Research output: Contribution to journalSpecial issuepeer-review

Abstract

Resource Space Model is a kind of data model which can effectively and flexibly manage the digital resources in cyber-physical system from multidimensional and hierarchical perspectives. This paper focuses on constructing resource space automatically. We propose a framework that organizes a set of digital resources according to different semantic dimensions combining human background knowledge in WordNet and Wikipedia. The construction process includes four steps: extracting candidate keywords, building semantic graphs, detecting semantic communities and generating resource space. An unsupervised statistical language topic model (i.e., Latent Dirichlet Allocation) is applied to extract candidate keywords of the facets. To better interpret meanings of the facets found by LDA, we map the keywords to Wikipedia concepts, calculate word relatedness using WordNet's noun synsets and construct corresponding semantic graphs. Moreover, semantic communities are identified by GN algorithm. After extracting candidate axes based on Wikipedia concept hierarchy, the final axes of resource space are sorted and picked out through three different ranking strategies. The experimental results demonstrate that the proposed framework can organize resources automatically and effectively.

Original languageEnglish
Pages (from-to)222-231
Number of pages10
JournalFuture Generation Computer Systems
Volume32
Early online date5 Aug 2013
DOIs
Publication statusPublished - Mar 2014

Keywords

  • latent Dirichlet allocation
  • resource space model
  • semantic graph
  • Wikipedia

Fingerprint

Dive into the research topics of 'A framework for automated construction of resource space based on background knowledge'. Together they form a unique fingerprint.

Cite this