现代信息科技2025,Vol.9Issue(8):106-110,116,6.DOI:10.19850/j.cnki.2096-4706.2025.08.020
基于深度学习的网页内容解析方法
Web Content Parsing Method Based on Deep Learning
摘要
Abstract
In order to extract valuable information from Web pages efficiently and accurately,this paper proposes a Web content parsing method based on Deep Learning.This method aims to extract text information from complex Hyper Text Markup Language(HTML).This method combines the feature extraction ability of Deep Learning,Natural Language Processing technology and layout information in HTML documents to construct a Multi-Layer Neural Network model,so as to realize the recognition of Web content.The experimental results show that compared with the traditional Web content extraction method based on text density,this method has obvious advantages in accuracy,adaptability and robustness.关键词
网页内容解析/深度学习/神经网络/自适应性Key words
Web content parsing/Deep Learning/Neural Network/adaptability分类
信息技术与安全科学引用本文复制引用
袁公萍,谢红韬,舒珏淋,周维..基于深度学习的网页内容解析方法[J].现代信息科技,2025,9(8):106-110,116,6.基金项目
国家自然科学基金-面向公共安全的场景智能感知与异常行为预警(U20B2069) (U20B2069)