首页|期刊导航|现代电子技术|带有半渐进式分层提取机制的轻量化多任务模型

带有半渐进式分层提取机制的轻量化多任务模型

杨程车文刚

现代电子技术2024，Vol.47Issue(3)：18-24,7.

现代电子技术2024，Vol.47Issue(3)：18-24,7.DOI:10.16652/j.issn.1004-373x.2024.03.004

带有半渐进式分层提取机制的轻量化多任务模型

Lightweight multi-tasking learning model with semi-progressive layered extraction mechanism

杨程 ¹车文刚¹

作者信息

1. 昆明理工大学信息工程与自动化学院, 云南昆明 650500
折叠

摘要

Abstract

Currently,the multi-tasking learning model is widely used in many fields.However,most models with better effect consist of complex network layers and architectures,which makes it difficult for these multi-task learning models to be applied to resource-limited devices,for example,countries or regions with limited funds but large population bases carry out census prediction,portable devices carry out translation activities and other tasks.In view of the above,a lightweight multi-tasking learning model with semi-progressive layered extraction mechanism is proposed.In the model,the top-level Expert module of the specific task is pruned first and the work originally responsible for extracting the in-depth information of each specific task is entrusted to the Tower module of each task.This approach makes the model lightweight and,at the same time,retains the characteristics of separating the shared parameters of the task and the unique parameters of the task and extracting information hierarchically.Inspired by uncertainty to weigh losses,the dynamic joint loss is optimized to compensate for the decrease in the performance and accuracy of the model after pruning.The model can predict the importance of tasks continuously and adjust the weight of each task.Some hyperparameters are also tuned.The evaluation of the model on the UCI Census-Income public dataset finally proves that the model has the same performance as that before lightweight.

关键词

多任务学习/渐进式分层提取/轻量化/不确定性损失权重/联合损失优化/UCI

Key words

multi-tasking learning/progressive layered extraction/lightweight/uncertainty to weigh losses/joint loss optimization/UCI

分类

信息技术与安全科学

引用本文复制引用

杨程,车文刚..带有半渐进式分层提取机制的轻量化多任务模型[J].现代电子技术,2024,47(3):18-24,7.

现代电子技术

OA北大核心CSTPCD

ISSN：1004-373X

访问量0

下载量0

段落导航