计算机与现代化Issue(7):21-27,7.DOI:10.3969/j.issn.1006-2475.2025.07.004
多模态大语言模型在色素性皮肤病变诊断中的应用
Application of Multimodal Large Language Models in Diagnosis of Pigmented Skin Lesions
摘要
Abstract
Accurate diagnosis of pigmented skin lesions presents a complex and challenging task.In contemporary medical prac-tice,intelligent diagnostic tools can significantly enhance the precision of both diagnosis and treatment.This study proposes an innovative multimodal large language model,SkinCPM-V,to address diagnostic challenges associated with textural patterns,hair artifacts,and vascular structures in dermoscopic images.SkinCPM-V is deeply optimized based on MiniCPM-V,and spe-cially customized for the characteristics of skin lesions.It has been extensively trained on publicly available dermatological datas-ets from Kaggle,leveraging the LoRA technique to achieve efficient parameter fine-tuning.Comprehensive evaluations reveal that SkinCPM-V achieves exceptional performance,with BLEU-4,ROUGE-1,ROUGE-2,and ROUGE-L scores of 0.8880,0.9380,0.9104,and 0.9349,respectively,indicating a high level of alignment between generated outputs and reference stan-dards.Additionally,the model's effectiveness in real-world diagnostic tasks is validated through F1 score of 0.9067,precision of 0.9028,and recall of 0.9444,highlighting its robust performance.Compared to other multimodal large language models,SkinCPM-V demonstrates superior results across all evaluation metrics.This highlights its ability to generate high-quality textual descriptions and underscores its potential for integration into clinical workflows.The findings of this study validate the utility of SkinCPM-V in the diagnosis of pigmented skin lesions and pave the way for broader applications of multimodal large language models in medical domains,offering a promising avenue for advancing diagnostic technologies.关键词
色素性皮肤病变/自动化诊断/多模态大语言模型/皮肤镜诊断/参数高效微调/模型评估Key words
pigmented skin lesions/automated diagnosis/multimodal large language model/dermoscopic diagnosis/parameter-efficient fine-tuning/model evaluation分类
信息技术与安全科学引用本文复制引用
孙凯杰,胡继礼..多模态大语言模型在色素性皮肤病变诊断中的应用[J].计算机与现代化,2025,(7):21-27,7.基金项目
安徽省高校科研重点项目(2024AH050917) (2024AH050917)
安徽省级教学研究项目(2021jyxm0822) (2021jyxm0822)