| 注册
首页|期刊导航|网络安全与数据治理|一种针对垂类模型的综合成效评测框架

一种针对垂类模型的综合成效评测框架

宋元 张衎 任熠辉 黄晓鹏

网络安全与数据治理2025,Vol.44Issue(11):18-23,29,7.
网络安全与数据治理2025,Vol.44Issue(11):18-23,29,7.DOI:10.19358/j.issn.2097-1788.2025.11.004

一种针对垂类模型的综合成效评测框架

A comprehensive effectiveness evaluation framework for domain-specific models

宋元 1张衎 2任熠辉 1黄晓鹏1

作者信息

  • 1. 苏州市人工智能有限公司,江苏 苏州 215100
  • 2. 苏州市人工智能有限公司,江苏 苏州 215100||苏州国际发展集团有限公司,江苏 苏州 215007
  • 折叠

摘要

Abstract

This paper addresses the issues of single evaluation dimensions,lack of domain adaptability,and fragmented methods in the evaluation practice of domain-specific models,and proposes a comprehensive effectiveness evaluation framework.This study aims to address the"evaluation gap"between technology research and development and industrial application through standardized solutions,providing a scientific basis for the development,deployment,and supervision of domain-specific models.The research method includes constructing a multidimensional indicator system centered on security compliance,technical performance,and application value,and designing a supporting evaluation dataset construction strategy and a hybrid evaluation method.The latter integrates automated testing,manual evaluation,and large models as evaluation means.The research results form a structured e-valuation system that covers the classification of evaluation objects,indicator definition,and method implementation,which can a-chieve a comprehensive and comparable evaluation of different types of domain-specific models.The conclusion shows that the framework helps to improve the objectivity and operability of the evaluation and promote the trustworthy application of domain-spe-cific models in key areas.In the future,it will need to be verified in practice and dynamically optimized to adapt to technological development.

关键词

人工智能/垂类模型/模型评测

Key words

artificial intelligence/domain-specific model/model evaluation

分类

计算机与自动化

引用本文复制引用

宋元,张衎,任熠辉,黄晓鹏..一种针对垂类模型的综合成效评测框架[J].网络安全与数据治理,2025,44(11):18-23,29,7.

基金项目

中国博士后科学基金(2024M762322) (2024M762322)

网络安全与数据治理

2097-1788

访问量0
|
下载量0
段落导航相关论文