Abstract
This article evaluates the reliability, efficiency, and effectiveness of Linguistic Inquiry and Word Count (LIWC; Boyd et al., 2022) for the analysis of a white nationalist forum. This is important because LIWC has been the computational tool of choice for scores of studies generally and many examining extremist content in a forensic or security context. Our purpose, therefore, is to understand whether LIWC can be depended upon for large-scale analyses; we initially examine this here using a small sample of posts from a set of just eight users and manually checking the program's automated codings of a subset of categories. Our results show that the LIWC coding cannot be relied upon – precision falls to as low as 49.6% and recall as low as 41.7% for some categories. It would be possible to engage in considerable manual correction of these results, but this undermines its purported efficiency for large datasets.
Original language | English |
---|---|
Article number | 100118 |
Number of pages | 9 |
Journal | Applied Corpus Linguistics |
Volume | 5 |
Issue number | 1 |
Early online date | 9 Jan 2025 |
DOIs | |
Publication status | Published - Apr 2025 |
Bibliographical note
Copyright © 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (https://creativecommons.org/licenses/by/4.0/).Keywords
- LIWC
- reliability
- computerized text analysis
- forensic linguistics
- Discourse Analysis