| 注册
首页|期刊导航|中国防汛抗旱|基于社交媒体与大模型的城市内涝灾情信息提取研究

基于社交媒体与大模型的城市内涝灾情信息提取研究

杨晨 李松涛 刘颖 张会 李硕

中国防汛抗旱2026,Vol.36Issue(5):7-14,8.
中国防汛抗旱2026,Vol.36Issue(5):7-14,8.DOI:10.16867/j.issn.1673-9264.2026189

基于社交媒体与大模型的城市内涝灾情信息提取研究

Urban waterlogging disaster information extraction based on social media and large language models

杨晨 1李松涛 2刘颖 3张会 1李硕1

作者信息

  • 1. 华北水利水电大学数字孪生水利高等研究院,郑州 450046
  • 2. 华北水利水电大学水利学院,郑州 450046
  • 3. 华北水利水电大学水资源学院,郑州 450046
  • 折叠

摘要

Abstract

Urban underlying surfaces are complex and infrastructure is inadequate,making cities prone to waterlogging disasters during heavy rainfall.Rapid identification and extraction of disaster information is crucial for improving urban flood response efficiency and disaster management capabilities.Social media,due to its real-time nature and high public participation,has become an important data source for obtaining urban waterlogging disaster information.To address challenges such as the large scale,semantic complexity,and difficulty in directly supporting decision-making of social media disaster text information,this study takes the Zhengzhou"7·20"rainstorm as an example and constructs a 15-category fine-grained classification system based on the urban waterlogging disaster chain.A large language model(LLM)-driven disaster information extraction method is proposed,with the BERT-TextCNN model introduced as a benchmark for comparative analysis.The results show that:①LLMs exhibit high few-shot learning efficiency.The LLMs DeepSeek,ChatGPT,and GLM can achieve disaster classification with small sample sizes,and their performance shows a positive correlation with data scale.Among them,ChatGPT performs best,achieving a macro-and micro-averaged F1 score of up to 0.89.②The deep learning benchmark model performs robustly with larger sample sizes.BERT-TextCNN demonstrates reliable classification capability with sufficient labeled data,achieving a weighted F1 score of 0.86.③LLMs have a significant data efficiency advantage.With only 450 labeled samples,DeepSeek and ChatGPT achieve comprehensive performance superior to the BERT-TextCNN model trained on 5 600 samples,significantly reducing the cost of manual annotation.This study can provide a reference for urban waterlogging disaster information extraction and emergency decision support.

关键词

城市内涝/社交媒体/信息提取/BERT-TextCNN/大语言模型

Key words

urban waterlogging/social media/information extraction/BERT-TextCNN/large language models

分类

天文与地球科学

引用本文复制引用

杨晨,李松涛,刘颖,张会,李硕..基于社交媒体与大模型的城市内涝灾情信息提取研究[J].中国防汛抗旱,2026,36(5):7-14,8.

基金项目

河南省科技攻关项目(252102321017) (252102321017)

河南省高校重点科研项目计划基础研究专项(26ZX019). (26ZX019)

中国防汛抗旱

1673-9264

访问量0
|
下载量0
段落导航相关论文