Abstract
This paper presents a novel algorithm that leverages cutting-edge machine-learning techniques to accurately and efficiently detect AI-generated texts. Rapid advancements in natural language processing models have led to the generation of text closely resembling human language, making it increasingly difficult to differentiate between human and AI-generated content. However, misuse of such texts presents a serious and imminent threat to the quality of academic publishing. This underscores the urgent need for robust detection mechanisms to ensure information quality, maintain trust, and preserve the integrity of research publications. Our proposed model outperformed existing algorithms for accuracy with less computational complexity. The proposed model is a feature-based hybrid deep learning network that leverages part-of-speech tagging and integrates Bidirectional Long Short-Term Memory (BiLSTM) networks with Attention modules. The initial module extracts local contextual features using convolutional layers, followed by BiLSTM layers that capture long-term dependencies from past and future sequences. An attention mechanism highlights critical sequence components, enhancing the model’s focus on relevant data. The outputs from the attention and initial modules are concatenated through a residual connection, ensuring comprehensive feature representation. This combination is then fed into dense layers for final classification, effectively balancing feature richness and computational efficiency. The proposed model was evaluated on two benchmark datasets, achieving 85.00% and 88.00% accuracy, respectively.
Original language | English |
---|---|
Pages (from-to) | 71563-71576 |
Number of pages | 14 |
Journal | IEEE Access |
Volume | 13 |
Early online date | 21 Apr 2025 |
DOIs | |
Publication status | E-pub ahead of print - 21 Apr 2025 |
Bibliographical note
Copyright © 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/Keywords
- Accuracy
- Detectors
- Computational modeling
- Transformers
- Text detection
- Feature extraction
- Context modeling
- Attention mechanisms
- Fake news
- Deep learning
- AI-generated text detection
- authorship analysis
- authorship verification
- machine-generated text detection