首页|期刊导航|Visual Intelligence|Mini-InternVL:a flexible-transfer pocket multi-modal model with 5%parameters and 90%performance
Mini-InternVL:a flexible-transfer pocket multi-modal model with 5%parameters and 90%performance
Zhangwei Gao Zhe Chen Erfei Cui Yiming Ren Weiyun Wang Jinguo Zhu Hao Tian Shenglong Ye Junjun He Xizhou Zhu Lewei Lu Tong Lu Yu Qiao Jifeng Dai Wenhai Wang
Visual Intelligence2024,Vol.2Issue(1):P.392-408,17.
Visual Intelligence2024,Vol.2Issue(1):P.392-408,17.DOI:10.1007/s44267-024-00067-6
Mini-InternVL:a flexible-transfer pocket multi-modal model with 5%parameters and 90%performance
摘要
关键词
Lightweight multi-modal large language model/Vision-language model/Knowledge distillation/Visual instruction tuning分类
信息技术与安全科学引用本文复制引用
Zhangwei Gao,Zhe Chen,Erfei Cui,Yiming Ren,Weiyun Wang,Jinguo Zhu,Hao Tian,Shenglong Ye,Junjun He,Xizhou Zhu,Lewei Lu,Tong Lu,Yu Qiao,Jifeng Dai,Wenhai Wang..Mini-InternVL:a flexible-transfer pocket multi-modal model with 5%parameters and 90%performance[J].Visual Intelligence,2024,2(1):P.392-408,17.基金项目
supported by the National Key R&D Program of China(Nos.2022ZD0160102 and 2022ZD0161300) (Nos.2022ZD0160102 and 2022ZD0161300)
the National Natural Science Foundation of China(Nos.62376134 and 62372223). (Nos.62376134 and 62372223)