动态调整语义的词性加权多模态情感分析模型OA北大核心CSTPCD
Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
为更好地利用单词词性包含的语义信息和伴随单词出现时的非自然语言上下文信息,提出动态调整语义的词性加权多模态情感分析(part of speech weighted multi-modal sentiment analysis model with dynamic semantics adjustment,PW-DS)模型.该模型以自然语言为主体,分别使用基于Transformer的双向编码器表示(bidirectional encoder representation from Transformers,BERT)模型、广义自回归预训练(generalized autoregressive pretraining for language understanding,XLNet)模型和一种鲁棒优化的BERT预训练(robustly optimized BERT pretraining approach,RoBERTa)模型为文本模态做词嵌入编码;创建动态调整语义模块将自然语言和非自然语言信息有效结合;设计词性加权模块,提取单词词性并赋权以优化情感判别.与张量融合网络和低秩多模态融合等当前先进模型的对比实验结果表明,PW-DS模型在公共数据集CMU-MOSI和CMU-MOSEI上的平均绝对误差分别达到了0.607和0.510,二分类准确率分别为89.02%和86.93%,优于对比模型.通过消融实验分析了不同模块对模型效果的影响,验证了模型的有效性.
In order to better utilize of the semantic information contained in the part of speech of words and the contextual information of unnatural language accompanying the appearance of words,a part of speech weighted multi-modal sentiment analysis model with dynamic adjustment of semantic representation(PM-DS)is proposed.The PM-DS model takes natural language as the main body,and uses bidirectional encoder representation from transformer model,generalized autoregressive pre-training model for language understanding(XLNet)and a robustly optimized BERT pretraining approach(RoBERTa)to embed words into text patterns,respectively.A dynamic semantic adjustment module is created to effectively combine natural language and unnatural language information.The part of speech weighting module is designed to extract the part of speech of words and assigned weights to optimize sentiment discrimination.Comparative experimental results with the current advanced models such as tensor fusion network and low-rank multimodal fusion show that the average absolute errors of PW-DS model on public data sets CMU-MOSI and CMU-MOSEI are 0.607 and 0.510,respectively,and the binary classification accuracies are 89.02%and 86.93%,respectively,which is better than the models in the comparative experiments.The effects of different modules on the model are also analyzed through ablation experiments.The experimental results demonstrate that the proposed model is effective to deal with the problem of multi-modal emotion analysis.
花强;陈卓;张峰;董春茹
河北省机器学习与计算智能重点实验室,河北大学数学与信息科学学院,河北保定 071002
计算机与自动化
人工智能多模态情感分析动态调整语义词性加权多模态向量位置可视化词性加权可视化
artificial intelligencemultimodal sentiment analysisdynamical adjustment semanticpart of speech weightingvisualization of multimodal vector positionvisualization of part of speech weights
《深圳大学学报(理工版)》 2024 (003)
283-292 / 10
The Innovation Capacity Enhancement Program-Science and Technology Platform Project of Hebei Province(22567623H) 河北省创新能力提升计划科技平台资助项目(22567623H)
评论