| 注册
首页|期刊导航|软件导刊|基于K-BERT-LDA的层级多标签招标标段分类方法

基于K-BERT-LDA的层级多标签招标标段分类方法

侯继辉 吴小忠 刘晖 夏卓群 梁涤青 邱涵 徐嘉慧

软件导刊2024,Vol.23Issue(12):66-74,9.
软件导刊2024,Vol.23Issue(12):66-74,9.DOI:10.11907/rjdk.241778

基于K-BERT-LDA的层级多标签招标标段分类方法

A Hierarchical Multi-Label Bidding Section Classification Method Based on K-BERT-LDA

侯继辉 1吴小忠 1刘晖 1夏卓群 2梁涤青 2邱涵 2徐嘉慧2

作者信息

  • 1. 湖南湘能创业项目管理有限公司,湖南 长沙 410221
  • 2. 长沙理工大学 计算机与通信工程学院,湖南 长沙 410000
  • 折叠

摘要

Abstract

Traditional manual bidding has low efficiency and accuracy in dividing bids.A hierarchical multi label text classification method based on K-BERT-LDA is proposed for material bidding texts with sparse semantic features and obvious hierarchical structure of labels.First-ly,text features are extracted through a hybrid model.The K-BERT model extracts text features with knowledge injection to compensate for se-mantic information gaps.The LDA topic model extracts topic distribution features and further enriches the text feature representation through feature fusion.Secondly,joint embedding of category labels,where the prediction results of upper level labels can guide lower level classifica-tion and fully utilize the tree structure relationship between labels to improve the accuracy of multi label text classification.Finally,an intelli-gent processing strategy based on text similarity algorithm is proposed to ensure the success rate of bidding and obtain bidding results by merg-ing sections with insufficient pre investment amounts.The experiment shows that the proposed method has better classification performance than other classification methods and single model,and the accuracy,precision and F1 value reach 95.45%,92.57%and 91.88%respectively,which can effectively and accurately achieve the goal of intelligent classification.

关键词

招标分标/层级多标签文本分类/知识注入/主题分布/特征融合/文本相似度

Key words

bidding division/hierarchical multi-label text classification/knowledge injection/topic distribution/feature fusion/text simi-larity

分类

信息技术与安全科学

引用本文复制引用

侯继辉,吴小忠,刘晖,夏卓群,梁涤青,邱涵,徐嘉慧..基于K-BERT-LDA的层级多标签招标标段分类方法[J].软件导刊,2024,23(12):66-74,9.

基金项目

湖南省自然科学基金项目(2023JJ30052) (2023JJ30052)

软件导刊

1672-7800

访问量0
|
下载量0
段落导航相关论文