国内大语言模型在学科知识图谱自动标注上的应用——以道德与法治和数学学科为例OA北大核心CSTPCD
A case study on the application of the automatic labelling of the subject knowledge graph of Chinese large language models:Take morality and law and mathematics as examples
随着人工智能技术的迅猛发展,大语言模型(large language models,LLMs)在自然语言处理和各种知识应用中展现了强大的能力.研究了国内大语言模型在中小学学科知识图谱自动标注中的应用,重点以义务教育阶段道德与法治学科和高中数学学科为例进行分析和探讨.在教育领域,知识图谱的构建对于整理和系统化学科知识具有重要意义,然而传统的知识图谱构建方法在数据标注方面存在效率低、耗费大量人工成本等问题.研究旨在通过大语言模型来解决这些问题,从而提升知识图谱构建的自动化和智能化水平.基于国内大语言模型的现状,探讨了其在学科知识图谱自动标注中的应用,以道德与法治和数学学科为例,阐述了相关方法和实验结果.首先,探讨了研究背景和意义.接着,综述了国内大语言模型的发展现状和学科知识图谱的自动标注技术.在方法与模型部分,尝试探索一种基于国内大语言模型的自动标注方法,力图完善其在学科知识图谱上的应用.还探讨了学科知识图谱人工标注方法模型,以此作为对比,评估自动标注方法的实际效果.在实验与分析部分,通过在道德与法治和数学学科的自动标注实验和对其结果的分析,发现两个学科的知识图谱自动标注均取得了较高的准确率和效率,与人工标注结果进行了深入比较分析,得出了一系列有价值的结论,验证了所提出方法的有效性和准确性.最后,对未来的研究方向进行了展望.总体而言,研究为学科知识图谱的自动标注提供了一种新的思路和方法,有望推动相关领域的进一步发展.
With the rapid development of artificial intelligence technology,large language models(LLMs)have demonstrated strong abilities in natural language processing and various knowledge applications.This study examined the application of Chinese large language models in the automatic labelling of knowledge graphs for primary and secondary school subjects in particular compulsory education stage morality and law and high school mathematics.In education,the construction of knowledge graphs is crucial for organizing systemic knowledge.However,traditional knowledge graph methods have problems such as low efficiency and labor-cost consumption in data labelling.This study aimed to solve these problems using LLMs,thereby improving the level of automation and intelligence in the construction of knowledge graphs.Based on the status quo of domestic LLMs,this paper discusses their application in the automatic labelling of subject knowledge graphs.Taking morality and rule of law and mathematics as examples,the relevant methods and experimental results are explained.First,the research background and significance are discussed.Second,the development status of the domestic large language model and automatic labelling technology of the subject knowledge graph are then presented.In the methods and model section,an automatic labelling method based on LLMs is explored to improve its application in a subject knowledge graph.This study also explored the subject knowledge graph model to compare and evaluate the actual effect of the automatic labelling method.In the experiment and analysis section,through the automatic labelling experiments and results analysis of the subjects of morality and law and mathematics,the knowledge graphs of the two disciplines are automatically labeled to achieve high accuracy and efficiency.A series of valuable conclusions are obtained,and the effectiveness and accuracy of the proposed methods are verified.Finally,future research directions are discussed.In general,this study provides a new concept and method for the automatic labelling of subject knowledge graphs,which is expected to promote further developments in related fields.
寇思佳;闫凤云;马晶
教育部教育技术与资源发展中心(中央电化教育馆),北京 100031北京市延庆区教育科学研究中心,北京 102100北京大学附属中学 北京 100190
教育学
大语言模型知识图谱自动标注道德与法治数学
large language models(LLMs)knowledge graphautomatic labellingmorality and lawmathematics
《华东师范大学学报(自然科学版)》 2024 (005)
81-92 / 12
国家重点研发计划(2023YFC3341200)
评论