首页|期刊导航|中国医学装备|ChatGPT-4.0与DeepSeek-V3两种人工智能语言模型在回答近视问题的基准分析比较

ChatGPT-4.0与DeepSeek-V3两种人工智能语言模型在回答近视问题的基准分析比较

姚晶磊李露茜姜慧君 Sun Chen-Hsin 任骁方肖林

中国医学装备2026，Vol.23Issue(3)：86-89,4.

中国医学装备2026，Vol.23Issue(3)：86-89,4.DOI:10.3969/j.issn.1672-8270.2026.03.016

ChatGPT-4.0与DeepSeek-V3两种人工智能语言模型在回答近视问题的基准分析比较

Comparison of benchmarking analysis of ChatGPT-4.0 and DeepSeek-V3 of two kinds of AI language models in response to questions about myopia

姚晶磊 ¹李露茜 ¹姜慧君 ¹Sun Chen-Hsin ²任骁方 ³肖林⁴

作者信息

1. 北京京煤集团总医院眼科北京 102300
2. 新加坡国立大学医院眼科新加坡 119074
3. 首都儿科研究所附属儿童医院眼科北京 100020
4. 首都医科大学附属北京世纪坛医院眼科北京 100038
折叠

摘要

Abstract

Objective:To compare the difference of performance of ChatGPT-4.0 and DeepSeek-V3 of two kinds of artificial intelligence(AI)Chatbots in response to questions about myopia,so as to provide references for application of AI chatbot.Method:From October 2024 to March 2025,a comparative test about two kinds of AI chatbots,namely ChatGPT-4.0 and DeepSeek-V3 of large language model(LLM),was conducted on the responses to questions about myopia at the National University Hospital of Singapore(NUHS)and Beijing Jingmei Group General Hospital of China.The accuracy and comprehensiveness were detected and evaluated by specialists.The content of the myopia question and answer(Q&A)consisted of 30 myopia-related questions in ophthalmic clinical practice,covering six themes about myopia:the pathogenesis,clinical manifestations,diagnosis,treatment,prevention,and prognosis.The evaluation was conducted by storing two kinds of AI chatbots from two aspects including accuracy and comprehensiveness.Results:In terms of accuracy evaluation,11 results(36.7%)of the answers of ChatGPT-4.0 chatbot were detected and evaluated as"good",and 23 results(76.7%)of the answers of DeepSeek-V3 chatbot were detected and evaluated as"good",and the difference of the proportion between two groups was significant(x2=9.791,P<0.05).In terms of the evaluation for comprehensiveness,the comprehensive score of the ChatGPT-4.0 chatbot was(2.44±0.33)points in answering questions,and that of the DeepSeek chatbot was(2.63±0.17)points,and there was not statistically significant difference between them(P>0.05).Conclusion:AI chatbot can provide effective helps about consulting myopia for users.The accuracy of the DeepSeek-V3 chatbot in responding to questions about myopia is superior to that of the ChatGPT-4.0 chatbot.

关键词

近视/ChatGPT-4.0聊天机器人/DeepSeek-V3聊天机器人/大语言模型(LLM)

Key words

Myopia/ChatGPT-4.0 chatbot/DeepSeek-V3 chatbot/Large language model(LLM)

分类

医药卫生

引用本文复制引用

姚晶磊,李露茜,姜慧君,Sun Chen-Hsin,任骁方,肖林..ChatGPT-4.0与DeepSeek-V3两种人工智能语言模型在回答近视问题的基准分析比较[J].中国医学装备,2026,23(3):86-89,4.

基金项目

北京京煤集团总医院院级科研资助项目(ZZ2024-46) Hospital-level Scientific Research Support Project of Beijing Jingmei Group General Hospital(ZZ2024-46) （ZZ2024-46）

中国医学装备

ISSN：1672-8270

访问量0

下载量0

段落导航