首页|期刊导航|南京林业大学学报（自然科学版）|基于自注意力机制的航空林火图像识别方法

基于自注意力机制的航空林火图像识别方法

王俊玲范习健杨绪兵业巧林符利勇

南京林业大学学报（自然科学版）2025，Vol.49Issue(2)：194-202,9.

南京林业大学学报（自然科学版）2025，Vol.49Issue(2)：194-202,9.DOI:10.12302/j.issn.1000-2006.202308021

基于自注意力机制的航空林火图像识别方法

Method for aerial forest fire image recognition based on self-attention mechanism

王俊玲 ¹范习健 ²杨绪兵 ²业巧林 ²符利勇³

作者信息

1. 南京林业大学信息科学技术学院,江苏南京 210037||南京理工大学紫金学院计算机与人工智能学院,江苏南京 210023
2. 南京林业大学信息科学技术学院,江苏南京 210037
3. 中国林业科学研究院资源信息研究所,北京 100091
折叠

摘要

Abstract

[Objective]This study aims to address the challenges of small fire point targets and complex environments in aerial forest fire images,we propose FireViT,a self-attention-based image recognition method.[Method]This method aims to enhance the accuracy and robustness of aerial forest fire image recognition.We used forest fire videos collected by drones in Chongli District,Zhangjiakou City,to construct a dataset through data preprocessing.A 10-layer vision transformer(ViT)was selected as the backbone network.Images were serialized using overlapping sliding windows,with embedded positional information fed into the first layer of ViT.The region selection modules,extracted from the preceding nine layers of ViT,were integrated into the tenth layer through multi-head self-attention and multi-layer perceptron mechanisms.This effectively amplified minor differences between subgraphs to capture features of small targets.Finally,a contrastive feature learning strategy was employed to construct an objective loss function for model prediction.We validated the model's effectiveness by establishing training and testing sets with sample ratios of 8∶2,7∶3,6∶4,and 4∶6,and compared its performance with five classical models.[Result]With the allocation ratio of four training and test sets,the model achieved a recognition rate of 100％and accuracy of 94.82％,95.05％,94.90％,and 94.80％,respectively,with an average accuracy of 94.89％.This performance surpassed that of the five comparison models.The model converged rapidly,maintained a high recognition accuracy rate,and demonstrated stability in subsequent iterations.It showed strong generalization ability.The recognition rates were 99.97％,99.89％,99.80％and 99.77％,also higher than the five comparison models.[Conclusion]This research employed a model that integrated a self-attention mechanism with weakly supervised learning to reveal distinct local feature variations in aerial forest fire images across various environments.The approach exhibited strong generalization capability and robustness,which was significant for improving the capacity,efficiency,and effectiveness of fire situation management and hazard response.It also played a crucial role in preventing forest wildfires.

关键词

航空林火图像/自注意力机制/细粒度分类/视觉变压器(ViT)/森林防火/无人机/张家口崇礼区

Key words

aerial forest fire image/self-attention mechanism/fine grained classification/vision transformer/forest fire prevention/unmanned aerial vehicles(UAV)/Chongli District of Zhangjiakou City

分类

信息技术与安全科学

引用本文复制引用

王俊玲,范习健,杨绪兵,业巧林,符利勇..基于自注意力机制的航空林火图像识别方法[J].南京林业大学学报（自然科学版）,2025,49(2):194-202,9.

基金项目

国家自然科学基金项目(62072246) （62072246）

张家口崇礼区森林智慧防火项目(DA2020001) （DA2020001）

南京市留学人员科技创新项目. （）

南京林业大学学报（自然科学版）

OA北大核心

ISSN：1000-2006

访问量0

下载量0

段落导航