| 注册
首页|期刊导航|农业机械学报|基于SBERT-Attention-LDA与ML-LSTM特征融合的烟草问句意图识别方法

基于SBERT-Attention-LDA与ML-LSTM特征融合的烟草问句意图识别方法

朱波 黎魁 邱兰 黎博

农业机械学报2024,Vol.55Issue(5):273-281,9.
农业机械学报2024,Vol.55Issue(5):273-281,9.DOI:10.6041/j.issn.1000-1298.2024.05.026

基于SBERT-Attention-LDA与ML-LSTM特征融合的烟草问句意图识别方法

Tobacco Interrogative Intent Recognition Based on SBERT-Attention-LDA and ML-LSTM Feature Fusion

朱波 1黎魁 1邱兰 1黎博2

作者信息

  • 1. 昆明理工大学机电工程学院,昆明 650504
  • 2. 武汉工程大学机电工程学院,武汉 430205
  • 折叠

摘要

Abstract

Aiming at the problems of feature sparsity,terminology and difficulty in capturing semantic associations within the text in question intention recognition in the tobacco domain,a feature fusion method based on sentence-bidirectional encoder representational from transformers-Attention mechanism-latent dirichlet allocation(SBERT-Attention-LDA)and multi layers-long short term memory(ML-LSTM)feature fusion was proposed.The method first dynamically encoded the tobacco question based on the SBERT pre-training model combined with the Attention mechanism and converted it into semantic-rich feature vectors,and at the same time,the topic vector of the question was modelled by using the LDA model to capture the topic information in the question;and then the joint feature representation with more complete and accurate question semantics was obtained by using the modified model-level ML-LSTM feature fusion method;and then the three-layer LSTM and ML-LSTM feature fusion method was used to identify the intention of the question.Then a 3-channel convolutional neural network(CNN)was used to extract the hidden features in the hybrid semantic representation of the question and fed them into the fully connected layer and Softmax function to achieve the classification of the question intent.Compared with the enhanced representation through knowledge integration and embedding(BERT and ERNIE)CNN models,the improvement was obvious(the F1 values were improved by 2.07 percentage points and 2.88 percentage points,respectively),which supported the construction of the Q&A system for tobacco websites.

关键词

烟草问句分类/自然语言处理/特征融合/自注意力机制

Key words

classification of tobacco questions/natural language processing/feature fusion/self-attention mechanis

分类

信息技术与安全科学

引用本文复制引用

朱波,黎魁,邱兰,黎博..基于SBERT-Attention-LDA与ML-LSTM特征融合的烟草问句意图识别方法[J].农业机械学报,2024,55(5):273-281,9.

基金项目

中国烟草总公司云南省烟草公司重点项目(2021530000241012) (2021530000241012)

农业机械学报

OA北大核心CSTPCD

1000-1298

访问量0
|
下载量0
段落导航相关论文