Guiding local regression using visualisation

Dharmesh M. Maniyar, Ian T. Nabney

Research output: Chapter in Book/Published conference outputConference publication

Abstract

Solving many scientific problems requires effective regression and/or classification models for large high-dimensional datasets. Experts from these problem domains (e.g. biologists, chemists, financial analysts) have insights into the domain which can be helpful in developing powerful models but they need a modelling framework that helps them to use these insights. Data visualisation is an effective technique for presenting data and requiring feedback from the experts. A single global regression model can rarely capture the full behavioural variability of a huge multi-dimensional dataset. Instead, local regression models, each focused on a separate area of input space, often work better since the behaviour of different areas may vary. Classical local models such as Mixture of Experts segment the input space automatically, which is not always effective and it also lacks involvement of the domain experts to guide a meaningful segmentation of the input space. In this paper we addresses this issue by allowing domain experts to interactively segment the input space using data visualisation. The segmentation output obtained is then further used to develop effective local regression models.
Original languageEnglish
Title of host publicationDeterministic and statistical methods in machine learning
EditorsJoab Winkler, Mahesan Niranjan, Neil Lawrence
Place of PublicationBerlin (DE)
PublisherSpringer
Pages98-109
Number of pages12
ISBN (Electronic)978-3-540-31728-9
ISBN (Print)3-540-29073-7, 978-3-540-29073-5
DOIs
Publication statusPublished - Dec 2005
Event1st International Workshop on Deterministic and Statistical Methods in Machine Learning - Sheffield, United Kingdom
Duration: 7 Sept 200410 Sept 2004

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume3635
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Workshop

Workshop1st International Workshop on Deterministic and Statistical Methods in Machine Learning
Country/TerritoryUnited Kingdom
CitySheffield
Period7/09/0410/09/04

Keywords

  • regression models
  • classification models
  • large high-dimensional datasets

Fingerprint

Dive into the research topics of 'Guiding local regression using visualisation'. Together they form a unique fingerprint.

Cite this