首页|期刊导航|通信学报|模型互联网中基于自我效能的Token级多模型协作

模型互联网中基于自我效能的Token级多模型协作

王建辉李哲涛石伟凡王泽平郑智润李成新

通信学报2026，Vol.47Issue(2)：125-139,15.

通信学报2026，Vol.47Issue(2)：125-139,15.DOI:10.11959/j.issn.1000-436x.2026033

模型互联网中基于自我效能的Token级多模型协作

Token-level multi-model collaboration based on self-efficacy in AI-model network

王建辉 ¹李哲涛 ¹石伟凡 ¹王泽平 ¹郑智润 ²李成新³

作者信息

1. 暨南大学信息科学技术学院,广东广州 510632
2. 亚洲大学人工智能系,韩国水原 16499
3. 湘潭大学数学与计算科学学院,湖南湘潭 411105
折叠

摘要

Abstract

To address the trade-off between inference performance and cost in Token-level collaboration within the AI-model network,a self-efficacy-based Token-level multi-model collaboration method named ConfiPara was proposed.Firstly,a Token-level collaborative method with an exit mechanism was designed to mitigate the high overhead of exis-ting approaches.Secondly,a self-efficacy assessment algorithm integrating the base model's confidence and reliability was introduced to determine the optimal exit timing.By leveraging self-efficacy to guide the base model in switching to independent inference at appropriate moments,redundant collaboration was skipped,thereby maintaining accuracy while reducing Token overhead.Experimental results demonstrate that the proposed ConfiPara method achieves a substantial reduction in Token consumption and inference latency with only a minor accuracy loss.In a single collaborative model scenario,the method reduces Token cost by approximately 21%and cuts per-Token generation latency by up to 75%,at the cost of only a 2.5%drop in accuracy.

关键词

大模型/模型互联网/Token级模型协作/退出机制/自我效能

Key words

large model/AI-model network/Token-level model collaboration/exit mechanism/self-efficacy

分类

信息技术与安全科学

引用本文复制引用

王建辉,李哲涛,石伟凡,王泽平,郑智润,李成新..模型互联网中基于自我效能的Token级多模型协作[J].通信学报,2026,47(2):125-139,15.

基金项目

国家自然科学基金资助项目(No.W2411053,No.U23B2027) The National Natural Science Foundation of China(No.W2411053,No.U23B2027) （No.W2411053,No.U23B2027）

通信学报

ISSN：1000-436X

访问量0

下载量0

段落导航