| 注册
首页|期刊导航|地震学报|基于数据与知识双驱动的TransSeisNet地震活动性预测

基于数据与知识双驱动的TransSeisNet地震活动性预测

王雪鉴 王自法 李兆焱 冯永成 周坤伯 王昭栋 张新元 Wang Jianming

地震学报2026,Vol.48Issue(2):339-354,16.
地震学报2026,Vol.48Issue(2):339-354,16.DOI:10.11939/jass.20250014

基于数据与知识双驱动的TransSeisNet地震活动性预测

Data and knowledge dual-driven TransSeisNet for seismicity prediction

王雪鉴 1王自法 1李兆焱 1冯永成 1周坤伯 1王昭栋 1张新元 1Wang Jianming1

作者信息

  • 1. 中国哈尔滨 150080 中国地震局工程力学研究所||中国哈尔滨 150080 中国地震局地震工程与工程振动重点实验室||中国哈尔滨 150080 地震灾害防治应急管理部重点实验室
  • 折叠

摘要

Abstract

Seismicity prediction remains a critical challenge in seismology,requiring a delicate balance between data-driven insights and domain-specific physical principles.Traditional stat-istical methods,such as the epidemic-type aftershock sequence(ETAS)model,have long served as fundamental tools for analyzing earthquake catalogs.However,these approaches struggle to fully utilize rapidly growing seismic data due to their reliance on simplified paramet-ric assumptions and limited adaptability to complex spatiotemporal patterns.On the other hand,purely data-driven machine learning models,while capable of processing high dimen-sional datasets,often produce predictions lacking physical interpretability that can violate estab-lished seismological laws.To bridge this gap,this study proposes TransSeisNet,a hybrid framework that synergizes the computational power of deep learning with the empirical rigor of statistical seismology.By directly embedding domain knowledge into the model architecture and optimization process,TransSeisNet achieves both high predictive accuracy and adherence to physical constraints,providing a robust solution for earthquake forecasting. Methodological framework The TransSeisNet architecture is based on the Transformer neural network paradigm,renowned for its ability to model long-term dependencies in sequential data through self-attention mechanisms.The model processes earthquake catalogs(continuous records of seismic events containing temporal,spatial,and magnitude information)to predict future seismic activ-ity.Key innovations include:① Physical constraint layer.The physical constraint layer is integrated into the output layer to enforce compliance with empirical seismological laws.For instance,the magnitude distribution of predicted events is normalized to follow the power-law relationship of the Gutenberg-Richter(GR)Law,ensuring model outputs conform to observed frequency-magnitude scaling.Additionally,temporal clustering patterns,such as the rapid aftershock decay described by the Omori-Utsu Law,are explicitly encoded to prevent non-physical predictions.② Knowledge-guided loss function.The training objective combines con-ventional negative log-likelihood terms with regularization terms derived from statistical seismo-logy.For example,deviations from the GR Law's b-value or violations of Omori-Utsu decay parameters are penalized during optimization.This dual-objective approach ensures simultan-eous optimization of data fidelity and physical consistency. Integration of domain knowledge TransSeisNet systematically incorporates three pillars of statistical seismology:① Guten-berg-Richter Law.It constrains the predicted event magnitude-frequency distribution to follow a power-law scaling,preventing unrealistic overprediction of large-magnitude events.② ETAS model characteristics.The self-attention mechanism implicitly captures the ETAS model's core premise—earthquakes can trigger subsequent events—by modeling temporal and spatial trig-gering probabilities.③ Omori-Utsu decay.It regularizes the temporal decay of aftershock producti-vity to align with empirically observed trends,ensuring predicted aftershock sequences decay at rates consistent with historical observations. Experimental results TransSeisNet was rigorously evaluated on two real-world earthquake catalogs and one syn-thetic catalog:① Southern California catalog,covering the San Jacinto fault zone(1981-2023),and containing over 12 000 events with magnitudes M≥2.0;② Japan Meteorological Agency catalog,covering the Tohoku and Kanto regions(1990-2023),and containing over 20 000 events,with emphasis on subduction zone seismicity;③ Synthetic catalog,generated using ETAS parameters to validate the model's ability to recover known triggering dynamics.Performance Highlights:① Superior accuracy.TransSeisNet consistently outperformed the benchmark ETAS model in predicting the timing,location,and magnitude of seismic events across all catalogs.For instance,in the Southern California catalog,the model demonstrated a 30%improvement in likelihood-based evaluation metrics compared to ETAS.② Enhanced sta-bility.By incorporating physical constraints,TransSeisNet exhibited reduced sensitivity to data noise and outliers,producing stable predictions even during periods of high seismic activity(e.g.,aftershock sequences following major earthquakes).③ Generalization capability.The model maintained robust performance across diverse tectonic settings,including strike-slip fault systems(San Jacinto)and subduction zones(Japan),highlighting its adaptability to varying seismogenic regimes. Model optimization and analysis Architectural depth:Comparative studies of model depth revealed that a 6-layer Trans-former configuration achieved optimal performance,balancing computational efficiency and predictive power.Shallower architectures(e.g.,four layers)exhibited underfitting on com-plex sequences,while deeper models(more than eight layers)showed diminishing returns.Ac-tivation function selection:Experiments with ReLU,ELU,and Swish activations indicated minimal performance differences,though ELU marginally improved training stability due to its smooth gradient properties.Comparative analysis with machine learning baselines:TransSeis-Net outperformed alternative machine learning architectures including LSTM and GRU net-works,which struggled to simultaneously capture long-term dependencies and enforce physical constraints. Conclusion TransSeisNet represents a significant advancement in seismicity prediction by unifying data-driven machine learning with empirical seismological principles.Its dual knowledge-data-driven framework addresses limitations of both traditional statistical methods(e.g.,rigid para-metric assumptions)and purely machine learning approaches(e.g.,lack of interpretabi-lity).The model's success underscores the value of integrating domain knowledge into neural network design,particularly in geophysical applications where physical plausibility is para-mount.Future work will focus on extending the framework to incorporate real-time geodetic data(e.g.,GNSS measurements)and multi-physics simulations,further enhancing its utility for operational earthquake forecasting and hazard assessment.

关键词

ETAS/Transformer/地震活动性/经验知识/地震目录

Key words

ETAS/Transformer/seismicity/empirical knowledge/earthquake catalog

分类

天文与地球科学

引用本文复制引用

王雪鉴,王自法,李兆焱,冯永成,周坤伯,王昭栋,张新元,Wang Jianming..基于数据与知识双驱动的TransSeisNet地震活动性预测[J].地震学报,2026,48(2):339-354,16.

基金项目

国家重点研发计划(2023YFC3805203)、中国地震局工程力学研究所基本科研业务费专项(2023A01)和国家自然科学基金面上项目(52378544,52378543)共同资助. (2023YFC3805203)

地震学报

0253-3782

访问量0
|
下载量0
段落导航相关论文