中国科学数据(中英文网络版)2026,Vol.11Issue(1):82-96,15.DOI:10.11922/11-6035.csd.2024.0171.zh
面向甘肃旅游大模型的知识注入数据集
A dataset of knowledge injection for Gansu tourism large models
摘要
Abstract
With the digital transformation of the tourism industry,intelligent services have become an important part of modern travel.However,existing recommendation and Q&A systems remain limited in terms of knowledge coverage and reasoning capabilities,making it difficult to meet diverse user demands.To this end,this study constructs a high-quality knowledge-injection dataset for tourism-oriented large models,focusing on the Gansu region and covering the three core dimensions:attractions,hotels,and cuisine.The dataset was constructed through multi-platform data collection,manual curation and strict quality control procedures,including data source selection,consistency checking and manual review,to ensure the accuracy,comprehensiveness and relevance of the data.After processing,the final dataset comprises 810,793 triples,30,045 entities,and 14 relationship types,organized in the form of a knowledge graph.To facilitate intuitive display and exploration of multi-dimensional data associations,Neo4j is used as the storage and visualization tool.The release of this dataset provides strong data support for smart tourism and promote knowledge sharing and the development of intelligent systems in tourism domain.关键词
知识图谱/旅游大模型/数据集构建/旅游推荐/智能问答Key words
knowledge graph/tourism large model/dataset construction/tourism recommendation/intelligent Q&A引用本文复制引用
陈敏,朱登赟,万福成,于洪志,卢保青..面向甘肃旅游大模型的知识注入数据集[J].中国科学数据(中英文网络版),2026,11(1):82-96,15.基金项目
国家自然科学基金(62366046) (62366046)
甘肃省基础研究创新群体项目(24JRRA154) (24JRRA154)
2025年甘肃省高校研究生"创新之星"项目(2025CXZX-243). National Natural Science Foundation of China(62366046) (2025CXZX-243)
Gansu Province Basic Research Innovation Group Project(24JRRA154) (24JRRA154)
Gansu Provincial University Graduate Students'Innovation Star'Project in 2025(2025CXZX-243). (2025CXZX-243)