地震学报2026,Vol.48Issue(2):355-373,19.DOI:10.11939/jass.20250042
基于泛化地震样本训练的全卷积神经网络在2016年熊本MW6.2地震监测中的应用
Application of fully convolutional neural network trained on generalized seismic samples in the 2016 Kumamoto MW6.2 earthquake monitoring
摘要
Abstract
Earthquake monitoring is one of the core tasks of seismology research.In recent years,significant progress has been made in earthquake monitoring and positioning methods based on neural networks.Among them,the neural network method based on waveform analy-sis has become a research hotspot due to its advantages in feature extraction and real-time pro-cessing;meanwhile,the method using travel time information also shows important potential in this rapidly developing field.However,most existing deep learning models still face a wide-spread and unresolved basic limitation:seriously insufficient generalization ability,which is usually confined to specific geographic areas and network configurations adopted in the training stage.This limitation significantly hinders the practical application of such methods in global or regional earthquake early warning systems,which have extremely high requirements for rapid deployment and operational flexibility.In response to this key challenge,this study systemati-cally implemented and rigorously evaluated a newly proposed general fully convolutional neur-al network(FCN)model.The model was trained on large-scale diversified generalized seismic samples,and the 2016 MW6.2 Kumamoto earthquake sequence—a case with high complexity and scientific value—w as used as a comprehensive verification object to focus on the quantita-tive evaluation of its early warning timeliness,multi-dimensional positioning accuracy,and cross-magnitude generalization ability in real operating scenarios.The complex FCN architec-ture is designed for real-time seismic data processing,and consists of three parallel dedicated sub-networks,which are responsible for event detection,source location and magnitude estima-tion simultaneously through end-to-end analysis of continuous three-component waveform data streams.The excellent generalization ability of the model is mainly due to its innovative train-ing paradigm,which uses advanced data reorganization technology to construct a massive syn-thetic data set covering a wide range of source-station configuration variability.This data gener-ation process effectively simulates various real station geometric layouts and source location scenarios,and strictly follows the basic physical laws of seismic wave propagation,thereby enabling the model to learn inherently transferable physical characteristics instead of merely memorizing specific network configurations.In the detailed experimental verification,we selected the mainshock-affected key area as the monitoring area,input the systematically pre-processed waveform data from 12 reasonably deployed stations into the pre-trained model,and successfully identified and located 69 obvious aftershocks within the critical first hour after the mainshock.The comprehensive analysis results show that the model can release reliable alarms within 4.4-6.4 seconds after the P wave arrives at the first trigger station without any transfer learning or region-specific parameter adjustment,and provide robust estimation of basic source parameters.The statistical results show that the root mean square error of the epicenter determ-ination of all successful positioning events is 3.409 km,and the root mean square error of depth estimation is 3.787 km.The system exhibits consistently excellent practicability in practical applications and maintains performance stability for seismic events of different magnitude ranges in complex sequences.However,critical assessment also reveals several limitations re-quiring further research:The complete system has an aftershock detection rate of 38.5%,with underreported events mainly concentrated in small-magnitude events or spatio-temporal cluster-ing sequences disturbed by waveforms;meanwhile,the accuracy of depth estimation remains lower than that of horizontal positioning,which is speculated to be associated with the simpli-fied one-dimensional velocity model adopted in the training stage.In summary,the generalized FCN method provides a promising and feasible technical path for the rapid deployment of com-plex earthquake early warning systems worldwide,and achieves a better balance between calcu-lation speed,operation accuracy and cross-tectonic environment generalization ability.Future research should focus on improving the detection sensitivity of small earthquakes and cluster events,introducing a more realistic three-dimensional velocity structure to enhance depth resol-ution,and optimizing the network architecture to meet the operational requirements in resource-constrained environments.关键词
地震预警/地震定位/全卷积神经网络/熊本MW6.2地震/深度学习Key words
earthquake early warning/earthquake location/fully convolutional neural net-works/Kumamoto MW6.2 earthquake/deep learning分类
天文与地球科学引用本文复制引用
韦必福,张雄,杨禹..基于泛化地震样本训练的全卷积神经网络在2016年熊本MW6.2地震监测中的应用[J].地震学报,2026,48(2):355-373,19.基金项目
国家自然科学联合基金(U2239204)、国家自然科学基金(42474092)及上海佘山地球物理国家野外科学观测研究站开放基金(SSOP202103)共同资助. (U2239204)