数据与计算发展前沿2021,Vol.3Issue(6):81-97,17.DOI:10.11871/jfdc.issn.2096-742X.2021.06.006
面向科学知识发现的造血干细胞知识图谱构建研究
Generating a Hematopoietic Stem Cell Knowledge Graph for Scientific Knowledge Discovery
摘要
Abstract
[Objective]The hematopoietic stem cell (HSC) is one kind of the most effective stem cells for clinical treatments.It is of great significance to discover important knowledge entities,knowledge relations,and knowledge paths by literature mining for HSC knowledge discovery.Knowledge graph (KG),which represents knowledge entities and their relations with more details in a simple manner is widely used in scientific knowledge discovery (SKD).[Methods]This paper proposes a framework of generating KG using Subject-Predicate-Object (SPO) triples from literature,which includes six processes:literature retrieval.SPO extracting,SPO cleanup,SPO ranking,discovery pattern integrating,and graph building.Then,an HSC KG was constructed based on the Ne04j graph database following the framework.Finally,three kinds of SKD scenarios using HSC KG are introduced by empirical analysis.[Results]The results show that HSC KG has the advantages of "using graph data structure","integrating discovery patterns","fusing native graph mining algorithms".and "easy to use",which can effectively support deep open discovery,close discovery,and topic discovery in HSC.关键词
知识图谱/SPO三元组/科学知识发现/文献挖掘/造血干细胞Key words
knowledge graph/SPO triple/scientific knowledge discovery/literature mining/hematopoietic stem cell引用本文复制引用
胡正银,刘蕾蕾,陈文杰,刘春江,钱力,宋亦兵..面向科学知识发现的造血干细胞知识图谱构建研究[J].数据与计算发展前沿,2021,3(6):81-97,17.基金项目
National Key Research and Development Program "Application demonstration of comprehensive science and technology services for typical industries in Pearl River Delta Urban Agglomeration" (Grant No:2018YFB1404205) (Grant No:2018YFB1404205)
the Ministry of Science and Technology Innovation Methods Special Project (Grant No:2019IM020100) (Grant No:2019IM020100)