| 注册
首页|期刊导航|现代教育技术|大语言模型作文评价反馈质量的实证分析

大语言模型作文评价反馈质量的实证分析

瞿锦雯 叶丽新 孙潘懿 徐悦 吴蓓蕾

现代教育技术2026,Vol.36Issue(3):62-71,10.
现代教育技术2026,Vol.36Issue(3):62-71,10.DOI:10.3969/j.issn.1009-8097.2026.03.007

大语言模型作文评价反馈质量的实证分析

An Empirical Analysis on the Feedback Quality in Essay Evaluation by Large Language Models

瞿锦雯 1叶丽新 1孙潘懿 1徐悦 1吴蓓蕾1

作者信息

  • 1. 华东师范大学 中国语言文学系,上海 200241
  • 折叠

摘要

Abstract

At present,large language model(LLM)povide an optimized opportunity for intelligent essay evaluation with their with advanced text processing capabilities.However,their feedback quality and semantic analysis ability remain insufficient.Issues such as superficial theme and over-polishing often reveal that LLM feedback cannot yet meet the core evaluation requirements of Chinese essay focusing on theme and expression.Therefore,it is necessary to evaluate their feedback quality.Therefore,this paper first developed a quality evaluation standard for LLM essay evaluation feedback through literature review and expert consultation.Ten LLM were used to evaluate 50 narrative and 50 argumentative essays respectively according to the given essay scoring criteria.Then,the paper scored the LLM feedback based on the standard,calculated inter-rater reliability,and conducted descriptive analysis and qualitative summary of the results.The study found out that LLM's feedback had room for improvement in stylistic stability,richness and in-depth understanding;LLM's feedback lacked teacher-like abilities in guiding thinking,critical evaluation and judging linguistic literary grace;LLM was good at providing feedback on theme,material,structure,emotion and ideology;LLM's feedback showed a certain degree of disciplinary professionalism.Finally,suggestions were put forward based on the findings,aiming to provide a reference for promoting the in-depth integration of LLM and Chinese language essay evaluation.

关键词

大语言模型/作文评价/反馈质量/智能作文评价

Key words

large language model/essay evaluation/feedback quality/automated writing evaluation

分类

社会科学

引用本文复制引用

瞿锦雯,叶丽新,孙潘懿,徐悦,吴蓓蕾..大语言模型作文评价反馈质量的实证分析[J].现代教育技术,2026,36(3):62-71,10.

基金项目

本文为中央高校基本科研业务费项目华东师范大学哲学社会科学"数智化语文教育创新团队"项目(项目编号:2024QKT004)的阶段性研究成果. (项目编号:2024QKT004)

现代教育技术

1009-8097

访问量0
|
下载量0
段落导航相关论文