| 注册
首页|期刊导航|河北农业大学学报|融合对抗训练与胶囊网络的羊疾病命名实体自动标注方法

融合对抗训练与胶囊网络的羊疾病命名实体自动标注方法

张泽嘉 孙小华 王超 王斌 袁万哲 王福顺

河北农业大学学报2026,Vol.49Issue(2):95-107,13.
河北农业大学学报2026,Vol.49Issue(2):95-107,13.DOI:10.13320/j.cnki.jauh.2026.0025

融合对抗训练与胶囊网络的羊疾病命名实体自动标注方法

An automatic annotation method for sheep disease named entities integrating adversarial training and capsule networks

张泽嘉 1孙小华 2王超 3王斌 3袁万哲 4王福顺3

作者信息

  • 1. 河北农业大学信息科学与技术学院,河北保定 071001
  • 2. 河北软件职业技术学院数字传媒系,河北保定 071000
  • 3. 河北农业大学信息科学与技术学院,河北保定 071001||河北省农业大数据重点实验室,河北保定 071000
  • 4. 河北农业大学动物医学院/中兽医学院,河北保定 071000
  • 折叠

摘要

Abstract

Disease prevention and control are crucial for the healthy development of the sheep industry.However,weak capabilities of the primary diagnosis and over-reliance on expert experience often lead to delay in disease management.Deep learning-based large models offer an innovative solution for assisted diagnosis,for which sheep disease texts serve as core carriers of knowledge in this field.Named Entity Recognition(NER)is a key step towards automatic text annotation.Nevertheless,sheep disease corpora faces challenges such as imbalanced entity distribution,complex semantic relationships,and ambiguous entity boundaries.To address these issues,this study analyzed the characteristics of sheep disease texts,established annotation rules,and constructed a corpus containing 9 categories and 9,635 entities to support NER tasks,proposing a named entity recognition model for sheep disease texts—SD-ATCN(A Named Entity Recognition Model for Sheep Diseases Texts Integrating Adversarial Training and Capsule Networks).First,the RoBERTa model was used to extract contextual embeddings,which were integrated with adversarial samples generated through adversarial training to form the output of the text embedding layer.This enhanced the model's stability in handling noisy data and imbalanced annotation scenarios.Then,capsule networks were incorporated into the global features extracted by BiLSTM to improve the model's ability to capture complex entity relationships.Finally,a CRF layer decoded the entity labels to achieve automatic entity recognition.Additionally,differential learning rates were applied during training to flexibly adjust learning rates across different layers,accelerating convergence and improving performance.Experimental results showed that the model respectively achieved precision,recall,and F,scores as 93.28%,95.57%,and 94.40%,outperforming the baseline RoBERTa-BiLSTM-CRF model by 2.13%,1.20%,and 1.69%.Based on the SD-ATCN model,an automatic annotation system for sheep disease texts was developed,significantly improving annotation efficiency and providing technical support for automated text processing and intelligent diagnosis in sheep disease management.

关键词

羊疾病/命名实体识别/RoBERTa/对抗训练/胶囊网络/差分学习率/自动标注

Key words

sheep diseases/named entity recognition/RoBERTa/adversarial training/capsule networks/differential learning rates/automatic annotation

分类

信息技术与安全科学

引用本文复制引用

张泽嘉,孙小华,王超,王斌,袁万哲,王福顺..融合对抗训练与胶囊网络的羊疾病命名实体自动标注方法[J].河北农业大学学报,2026,49(2):95-107,13.

基金项目

河北省重点研发计划项目(22327403D) (22327403D)

河北省现代农业产业技术体系羊产业创新团队专项资金项目(HBCT2024250204). (HBCT2024250204)

河北农业大学学报

1000-1573

访问量0
|
下载量0
段落导航相关论文