| 注册
首页|期刊导航|农业工程学报|基于文本分类和知识挖掘的远洋渔船安全问题分析

基于文本分类和知识挖掘的远洋渔船安全问题分析

刘爽 丁哲 吕超 朱珊珊

农业工程学报2023,Vol.39Issue(24):215-223,9.
农业工程学报2023,Vol.39Issue(24):215-223,9.DOI:10.11975/j.issn.1002-6819.202306095

基于文本分类和知识挖掘的远洋渔船安全问题分析

Evaluating the safety of distant-water fishing vessels using text classification and knowledge mining

刘爽 1丁哲 1吕超 2朱珊珊1

作者信息

  • 1. 上海海洋大学工程学院,上海 201306
  • 2. 上海海洋大学经济管理学院,上海 201306
  • 折叠

摘要

Abstract

Potential knowledge can be extracted from the safety text of distant water fishing(DWF)vessels.However,the previous approaches have not yet been fully developed for the safety text of fishing vessels.Some challenges remained,such as the low accuracy of text classification,and insufficient depth of knowledge extraction.In this study,an analytical approach was proposed to combine text classification,knowledge mining,and co-occurrence network technology under the Cape Town Agreement(CTA)of 2012.The text data on DWF vessel safety was also collected from the fishery management organizations,associations,and over 20 fishery enterprises from eight Chinese coastal provinces and cities,including Zhejiang,Shanghai,and Fujian.The DWF vessel safety corpus consisted of more than 5,000 valid questions and 100,000 characters.The analytical approach comprised three stages.Firstly,a hybrid deep learning model was developed using bidirectional encoder representations from transformers-text convolutional neural networks(BERT-TextCNN),according to the characteristics of DWF vessel safety text,such as diverse data types,sparse data features,and fuzzy boundaries.The character vectors were generated to extract the contextual semantic and deep syntactic information of the text using BERT during text representation.Multiple convolutional kernels of TextCNN were utilized to spatially model the generated character vectors and then to extract the local features for the accurate classification of safety theme.Secondly,term rrequency-inverse document frequency(TF-IDF)was employed to extract the key safety knowledge of fishing vessels,considering the importance and prevalence of knowledge within each safety theme.Finally,a co-occurrence network was constructed to visualize the safety knowledge of fishing vessels,including distributional patterns and interconnections.The results show that the BERT-TextCNN model achieved an accuracy,macro average recall rate,and macro average F1 value of 98.20%,98.02%,and 98.05%,respectively.The performance outperformed the other 17 comparative models,which utilized three text representations(BERT,Word2vec,and Character embedding)and six neural networks(TextCNN,Softmax,DPCNN,BiLSTM-Attention,RCNN,and Transformer).Meanwhile,the theme-based knowledge mining and analytical approach achieved clear rankings of DWF vessel compliance and safety management knowledge,as well as relationship networks crossing ten safety knowledge themes of fishing vessels,including provisions,structure,stability,electrical installations,fire protection,crew protections,life-saving equipment,emergency procedures,wireless communication,and shipborne navigation equipment.Intelligent safety knowledge services and decision-making tools were obtained to improve the compliance level and safety management efficiency in DWF.The finding can provide a strong reference to promote the application and development of knowledge service systems and the smart fishing industry.

关键词

渔船/安全/文本分类/分析方法/知识挖掘/开普敦协定

Key words

fishing vessels/safety/text classification/analytical approach/knowledge mining/Cape Town agreement

分类

农业科技

引用本文复制引用

刘爽,丁哲,吕超,朱珊珊..基于文本分类和知识挖掘的远洋渔船安全问题分析[J].农业工程学报,2023,39(24):215-223,9.

基金项目

农业农村部财政项目(D8021210076) (D8021210076)

农业工程学报

OA北大核心CSCDCSTPCD

1002-6819

访问量0
|
下载量0
段落导航相关论文