| 注册
首页|期刊导航|中南大学学报(自然科学版)|基于双目视觉与Transformer的连铸坯模型定位与测量

基于双目视觉与Transformer的连铸坯模型定位与测量

李同谱 许四祥 施宇翔 杨利法

中南大学学报(自然科学版)2024,Vol.55Issue(4):1312-1322,11.
中南大学学报(自然科学版)2024,Vol.55Issue(4):1312-1322,11.DOI:10.11817/j.issn.1672-7207.2024.04.006

基于双目视觉与Transformer的连铸坯模型定位与测量

Continuous casting slab model positioning and measurement based on binocular vision and Transformer

李同谱 1许四祥 1施宇翔 1杨利法1

作者信息

  • 1. 安徽工业大学机械工程学院,安徽马鞍山,243032
  • 折叠

摘要

Abstract

In order to address the problems of low efficiency and complex matching of traditional binocular vision detection algorithms,a continuous casting slab model positioning and measurement based on binocular vision and Transformer method was proposed in this paper.Firstly,a calibrated parallel binocular camera was used to collect images of the continuous casting slab model,which were used as datasets after correction and labeling.Then,with the proposed Transunet* as the backbone,a neural network was used to output the key point coordinates of the datasets.The network model adopted a multi-scale U-shape structure to offset the lower bound of theoretical error of Gaussian heatmap caused by the downsampling quantization.In order to improve the defect that convolutional neural networks only focus on local features,Transformer module was added to enhance the information exchange in each channel,and an optimized loss function calculation method was proposed to overcome the problem of the misproportion of positive and negative samples and accelerate network convergence.Finally,the network output was reconstructed with binocular vision to complete the distance measurement.The results show that the proposed algorithm outperforms other neural network methods in the detection accuracy of key points.Compared with the sub-optimal methods,the root-mean-square error and normalized mean error the proposed method are reduced by 17.24%and 18.58%,respectively.In the three-dimensional ranging,the accuracy of the proposed method is obviously superior to that of the traditional feature detection algorithm.Thus,the proposed method can meet the requirements of high precision and small environmental impact in industrial measurement and positioning.

关键词

双目视觉/Transformer/关键点检测/注意力机制

Key words

binocular vision/Transformer/landmark detection/attention mechanism

分类

信息技术与安全科学

引用本文复制引用

李同谱,许四祥,施宇翔,杨利法..基于双目视觉与Transformer的连铸坯模型定位与测量[J].中南大学学报(自然科学版),2024,55(4):1312-1322,11.

基金项目

国家自然科学基金资助项目(51374007) (51374007)

安徽高校自然科学研究重点项目(KJ2020A0259) (KJ2020A0259)

Project(51374007)supported by the National Natural Science Foundation of China (51374007)

Project(KJ2020A0259)supported by the Key Project of Natural Science Research of Anhui Educational Committee) (KJ2020A0259)

中南大学学报(自然科学版)

OA北大核心CSTPCD

1672-7207

访问量0
|
下载量0
段落导航相关论文