| 注册
首页|期刊导航|计算机与现代化|面向下游任务优化的不平衡数据重采样

面向下游任务优化的不平衡数据重采样

郭华

计算机与现代化Issue(2):28-32,51,6.
计算机与现代化Issue(2):28-32,51,6.DOI:10.3969/j.issn.1006-2475.2025.02.004

面向下游任务优化的不平衡数据重采样

Resampling of Imbalanced Data for Optimizing Downstream Tasks

郭华1

作者信息

  • 1. 中国石油大学(华东)计算机科学与技术学院,山东 青岛 266580
  • 折叠

摘要

Abstract

Data resampling is a key method for correcting imbalanced dataset.Traditional methods construct balanced samples by minimizing geometric errors in the sample space,but they perform poorly in high-dimensional space with complex distribu-tion patterns.Moreover,relying on statistical features lacks specificity for downstream tasks.To address this issue,this paper presents Sampling for Optimizing Downstream Neural Network(SOD-NN),a neural network for data sampling.This approach utilizes the ability of neural networks for nonlinear processing to identify the distribution characteristics of high-dimensional samples.It combines with downstream tasks to create a two-stage network,enabling overall optimization,thereby enhancing the model's capability to meet the requirements of downstream tasks effectively.Specifically,the dataset is first divided spatially during sampling.Residual processing of sample subsets is then applied to prevent data degradation.Subsequently,a self-attention mechanism is utilized to construct global feature,ensuring consistency with the original sample distribution.Experimen-tal results indicate that the model proposed in this paper significantly improves the recognition performance of minority class samples in downstream classification tasks,enhancing the robustness of processing these tasks.

关键词

数据重采样/样本不平衡/自适应采样网络/自注意力机制

Key words

data resampling/imbalanced data/adaptive sampling network/self-attention mechanism

分类

计算机与自动化

引用本文复制引用

郭华..面向下游任务优化的不平衡数据重采样[J].计算机与现代化,2025,(2):28-32,51,6.

基金项目

山东省自然科学基金资助项目(ZR2020MF140) (ZR2020MF140)

计算机与现代化

1006-2475

访问量0
|
下载量0
段落导航相关论文