现代教育技术2026,Vol.36Issue(5):16-26,11.DOI:10.3969/j.issn.1009-8097.2026.05.002
教育研究中应用AI合成数据的机遇与挑战
Opportunities and Challenges of Applying AI Synthetic Data in Educational Research
摘要
Abstract
With the prevalence of large language models(LLMs),AI-synthesized data has attracted extensive attention as an innovative tool reshaping the evidence base of educational research.However,this emerging practice,expanded from statistics to educational research,has sparked profound controversies over the changing nature of scientific evidence,with its application boundaries and potential risks remaining unclear.This paper reviewed the evolutionary trajectory of the synthetic data from statistical disclosure to LLM generation,analyzing how LLMs reshaped the generation logic of synthetic data through world models,theory-of-mind simulation and other mechanisms,and systematically explored its application forms across quantitative,qualitative,experimental simulation,evaluative research and other scenarios.Furthermore,it identified core challenges including representational distortion,cognitive mechanism discrepancies,inadequate ethical norms,and difficulties in quality assessment.This study highlighted the context dependency of the effective application of synthetic data,and called for constructing a new epistemological system adapted to human-machine collaborative research to promote the prudent and responsible application of this emerging tool.关键词
合成数据/教育研究/大语言模型/数据生成/伦理Key words
synthetic data/educational research/large language models/data generation/ethics分类
社会科学引用本文复制引用
褚乐阳,仇星月..教育研究中应用AI合成数据的机遇与挑战[J].现代教育技术,2026,36(5):16-26,11.基金项目
本文为2025年度广东省哲学社会科学规划青年项目"基于大语言模型的跨学科教学设计的理论与实践研究"(GD25YJY39)的阶段性研究成果. (GD25YJY39)