计算机工程2024,Vol.50Issue(7):42-52,11.DOI:10.19678/j.issn.1000-3428.0069593
基于大语言模型的个性化实验报告评语自动生成与应用
Personalized Experiment Report Comments Auto-Generation and Application Based on Large Language Models
摘要
Abstract
While reviewing computer experiment reports,assessment systems exhibit diversity and discrepancies.The rigid templates used for evaluation lack personalized content,and the results often fail to provide a basis for interpretability.To address these issues,this study proposes a personalized experiment report comments auto-generation framework based on large language models.The study employs a Theme-Evaluation Decisions-Integrated(T-ED-I)hint strategy to extract a unique evaluation system based on teachers'requirements regarding experiment and code quality.This strategy ultimately builds a shared library of assessment decision trees for computer software courses.It introduces a method for grading experiments and code-quality themes based on large language models and decision trees.By retrieving an evaluation decision tree from the library that matches a student's experiment report and integrating the report and code text,the proposed method auto-generation quantitative or qualitative grading results for the experiment and code quality,along with corresponding interpretative justifications.Finally,personalized evaluation comments are generated by integrating the students'completed experimental tasks,theme grading results,and evaluation bases into a experiment report template.The experimental results show that the decision trees generated using the T-ED-I hint strategy significantly outperform those generated from strategies without hints.Ablation studies confirm the effectiveness and rationality of each component of this strategy.Additionally,when comparing the auto-generation grading results with the original teacher evaluations,the match rate for software engineering,programming,and interdisciplinary courses exceeds 90%.Moreover,teachers'ratings on the auto-generation comments in terms of fluency,relevance,and rationality indicate a high level of quality across these dimensions.关键词
大语言模型/实验评估决策树/个性化/评语自动生成/代码质量评价Key words
large language models/decision trees of experimental evaluation/personalization/comments auto-generation/code quality assessment分类
计算机与自动化引用本文复制引用
翟洁,李艳豪,李彬彬,郭卫斌..基于大语言模型的个性化实验报告评语自动生成与应用[J].计算机工程,2024,50(7):42-52,11.基金项目
上海高校市级重点课程建设项目(沪教委高[2022]27号) (沪教委高[2022]27号)
上海市教育委员会课题项目 ()
教育部-华为"智能基座"产教融合协同育人基地一流课程项目. ()