Abstract
This article evaluates the reliability, efficiency, and effectiveness of Linguistic Inquiry and Word Count (LIWC; Boyd et al., 2022) for the analysis of a white nationalist forum. This is important because LIWC has been the computational tool of choice for scores of studies generally and many examining extremist content in a forensic or security context. Our purpose, therefore, is to understand whether LIWC can be depended upon for large-scale analyses; we initially examine this here using a small sample of posts from a set of just eight users and manually checking the program's automated codings of a subset of categories. Our results show that the LIWC coding cannot be relied upon – precision falls to as low as 49.6% and recall as low as 41.7% for some categories. It would be possible to engage in considerable manual correction of these results, but this undermines its purported efficiency for large datasets.
| Original language | English |
|---|---|
| Article number | 100118 |
| Number of pages | 9 |
| Journal | Applied Corpus Linguistics |
| Volume | 5 |
| Issue number | 1 |
| Early online date | 9 Jan 2025 |
| DOIs | |
| Publication status | Published - Apr 2025 |
Bibliographical note
Copyright © 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (https://creativecommons.org/licenses/by/4.0/).Keywords
- LIWC
- reliability
- computerized text analysis
- forensic linguistics
- Discourse Analysis