Abstract
This article evaluates the reliability, efficiency, and effectiveness of Linguistic Inquiry and Word Count (LIWC; Boyd et al., 2022) for the analysis of a white nationalist forum. This is important because LIWC has been the computational tool of choice for scores of studies generally and many examining extremist content in a forensic or security context. Our purpose, therefore, is to understand whether LIWC can be depended upon for large-scale analyses; we initially examine this here using a small sample of posts from a set of just eight users and manually checking the program's automated codings of a subset of categories. Our results show that the LIWC coding cannot be relied upon – precision falls to as low as 49.6% and recall as low as 41.7% for some categories. It would be possible to engage in considerable manual correction of these results, but this undermines its purported efficiency for large datasets.
| Original language | English |
|---|---|
| Article number | 100118 |
| Number of pages | 9 |
| Journal | Applied Corpus Linguistics |
| Volume | 5 |
| Issue number | 1 |
| Early online date | 9 Jan 2025 |
| DOIs | |
| Publication status | Published - Apr 2025 |
Bibliographical note
Copyright © 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (https://creativecommons.org/licenses/by/4.0/).Keywords
- LIWC
- reliability
- computerized text analysis
- forensic linguistics
- Discourse Analysis
Fingerprint
Dive into the research topics of 'Is Linguistic Inquiry and Word Count (LIWC) reliable, efficient, and effective for the analysis of large online datasets in forensic and security contexts?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver