Canary: Extracting Requirements-Related Information from Online Discussions

Peter Sawyer; Georgi  Kanchev; Amit Chopra; Pradeep Murukannaiah

doi:10.1109/RE.2017.83

Canary: Extracting Requirements-Related Information from Online Discussions

Peter Sawyer, Georgi Kanchev, Amit Chopra, Pradeep Murukannaiah

College of Engineering and Physical Sciences

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

Online discussions about software applications generate a large amount of requirements-related information. This information can potentially be usefully applied in requirements engineering; however currently, there are few systematic approaches for extracting such information. To address this gap, we propose Canary, an approach for extracting and querying requirements-related information in online discussions. The highlight of our approach is a high-level query language that combines aspects of both requirements and discussion in online forums. We give the semantics of the query language in terms of relational databases and SQL. We demonstrate the usefulness of the language using examples on real data extracted from online discussions. Our approach relies on human annotations of online discussions. We highlight the subtleties involved in interpreting the content in online discussions and the assumptions and choices we made to effectively address them. We demonstrate the feasibility of generating high-quality annotations by obtaining them from lay Amazon Mechanical Turk users.

Original language	English
Title of host publication	Requirements Engineering Conference (RE), 2017 IEEE 25th International
Publisher	IEEE
ISBN (Electronic)	978-1-5386-3191-1
ISBN (Print)	978-1-5386-3192-8
DOIs	https://doi.org/10.1109/RE.2017.83
Publication status	Published - 26 Sept 2017

Publication series

Name	2017 IEEE 25th International Requirements Engineering Conference (RE)
Publisher	IEEE
ISSN (Electronic)	2332-6441

Bibliographical note

Keywords

Requirements elicitation; Crowdsourcing; Social media; Online discussions; Query language

Access to Document

10.1109/RE.2017.83

Canary Extracting Requirements-Related
© Copyright 2017 IEEE.
Accepted author manuscript, 1.13 MB

Cite this

@inproceedings{6a368ba1282645e2a8f3d8e0f2601f10,

title = "Canary: Extracting Requirements-Related Information from Online Discussions",

abstract = "Online discussions about software applications generate a large amount of requirements-related information. This information can potentially be usefully applied in requirements engineering; however currently, there are few systematic approaches for extracting such information. To address this gap, we propose Canary, an approach for extracting and querying requirements-related information in online discussions. The highlight of our approach is a high-level query language that combines aspects of both requirements and discussion in online forums. We give the semantics of the query language in terms of relational databases and SQL. We demonstrate the usefulness of the language using examples on real data extracted from online discussions. Our approach relies on human annotations of online discussions. We highlight the subtleties involved in interpreting the content in online discussions and the assumptions and choices we made to effectively address them. We demonstrate the feasibility of generating high-quality annotations by obtaining them from lay Amazon Mechanical Turk users.",

keywords = "Requirements elicitation; Crowdsourcing; Social media; Online discussions; Query language",

author = "Peter Sawyer and Georgi Kanchev and Amit Chopra and Pradeep Murukannaiah",

year = "2017",

month = sep,

day = "26",

doi = "10.1109/RE.2017.83",

language = "English",

isbn = "978-1-5386-3192-8",

series = "2017 IEEE 25th International Requirements Engineering Conference (RE)",

publisher = "IEEE",

booktitle = "Requirements Engineering Conference (RE), 2017 IEEE 25th International",

address = "United States",

}

Canary: Extracting Requirements-Related Information from Online Discussions. / Sawyer, Peter; Kanchev, Georgi ; Chopra, Amit et al.
Requirements Engineering Conference (RE), 2017 IEEE 25th International. IEEE, 2017. (2017 IEEE 25th International Requirements Engineering Conference (RE)).

Research output: Chapter in Book/Published conference output › Conference publication

TY - GEN

T1 - Canary: Extracting Requirements-Related Information from Online Discussions

AU - Sawyer, Peter

AU - Kanchev, Georgi

AU - Chopra, Amit

AU - Murukannaiah, Pradeep

PY - 2017/9/26

Y1 - 2017/9/26

N2 - Online discussions about software applications generate a large amount of requirements-related information. This information can potentially be usefully applied in requirements engineering; however currently, there are few systematic approaches for extracting such information. To address this gap, we propose Canary, an approach for extracting and querying requirements-related information in online discussions. The highlight of our approach is a high-level query language that combines aspects of both requirements and discussion in online forums. We give the semantics of the query language in terms of relational databases and SQL. We demonstrate the usefulness of the language using examples on real data extracted from online discussions. Our approach relies on human annotations of online discussions. We highlight the subtleties involved in interpreting the content in online discussions and the assumptions and choices we made to effectively address them. We demonstrate the feasibility of generating high-quality annotations by obtaining them from lay Amazon Mechanical Turk users.

AB - Online discussions about software applications generate a large amount of requirements-related information. This information can potentially be usefully applied in requirements engineering; however currently, there are few systematic approaches for extracting such information. To address this gap, we propose Canary, an approach for extracting and querying requirements-related information in online discussions. The highlight of our approach is a high-level query language that combines aspects of both requirements and discussion in online forums. We give the semantics of the query language in terms of relational databases and SQL. We demonstrate the usefulness of the language using examples on real data extracted from online discussions. Our approach relies on human annotations of online discussions. We highlight the subtleties involved in interpreting the content in online discussions and the assumptions and choices we made to effectively address them. We demonstrate the feasibility of generating high-quality annotations by obtaining them from lay Amazon Mechanical Turk users.

KW - Requirements elicitation; Crowdsourcing; Social media; Online discussions; Query language

UR - https://ieeexplore.ieee.org/document/8048888

U2 - 10.1109/RE.2017.83

DO - 10.1109/RE.2017.83

M3 - Conference publication

SN - 978-1-5386-3192-8

T3 - 2017 IEEE 25th International Requirements Engineering Conference (RE)

BT - Requirements Engineering Conference (RE), 2017 IEEE 25th International

PB - IEEE

ER -

Canary: Extracting Requirements-Related Information from Online Discussions

Abstract

Publication series

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this