Abstract
Mining entities from open Chinese mechanical equipment texts have become prevailing in the intelligent manufacturing field. However, compared to named entities in other domains, it is very hard to determine entity boundaries in mining Chinese mechanical equipment texts because there is no unified standard for entity simplification, and digits and units are mixed in the text. To address the issue, this paper presents an entity boundary-define strategy and constructs a mechanical equipment-oriented corpus called MECorpus using open Chinese mechanical equipment texts by combining domain knowledge. A multi-neural network collaboration model Adaptive-BERT-BiLSTM-CRF-Rating (ABBCR) is then proposed for mechanical equipment named entity recognition. The novelty of ABBCR is characterised by its adaptive input mechanism and rating score ability for identified entities. Various experiments about ABBCR model selection, evaluation and application are conducted on MECorpus. Experimental results show that the ABBCR model provides high-quality mechanical equipment entities for constructing the mechanical equipment knowledge graph. ABBCR combined with the large language model has proved to be a promising method to manage complex mechanical equipment expertise.
Original language | English |
---|---|
Journal | Journal of Engineering Design |
Early online date | 1 Jul 2024 |
DOIs | |
Publication status | E-pub ahead of print - 1 Jul 2024 |
Bibliographical note
Copyright © 2024 Informa UK Limited, trading as Taylor & Francis Group. This is an Accepted Manuscript version of the following article, accepted for publication in the Journal of Engineering Design and published on 1st July 2024. This version is made available under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.Keywords
- Named entity recognition
- adaptive mechanism
- bidirectional encoder representations from transformers
- fuzzy boundary
- multi-neural network collaboration