福建电脑2025,Vol.41Issue(1):11-17,7.DOI:10.16707/j.cnki.fjpc.2025.01.002
多模融合的陶瓷图像中文描述生成方法研究
Study On The Chinese Description And Generation Method Of Multimodal Fusion Ceramic Image
摘要
Abstract
Early methods for generating ceramic image descriptions had issues with insufficient accuracy in recognition and description.To address these issues,this paper proposes a multi-scale image feature extraction method based on deep residual networks and feature pyramid networks,and utilizes a long short-term memory network with additive attention mechanism to generate a Res FL model for Chinese descriptions.The experimental results show that the Res FL model is significantly superior to traditional neural network methods in terms of description accuracy and detail capture,and has high application value in improving the consistency and accuracy of ceramic image description.关键词
陶瓷图像/图像描述/图像特征提取Key words
Ceramic Images/Image Description/Image Feature Extraction分类
信息技术与安全科学引用本文复制引用
胡智猛,彭永康,张秀娟..多模融合的陶瓷图像中文描述生成方法研究[J].福建电脑,2025,41(1):11-17,7.基金项目
本文得到景德镇市级科技计划项目(No.2023GY001-01)、江西省03专项及5G项目(No.20232ABC03A29)、高等学校大学生创新创业训练计划项目(No.202310408016)资助. (No.2023GY001-01)