| 注册
首页|期刊导航|植物学报|维管植物质体DNA数据在物种和区域上的空缺研究

维管植物质体DNA数据在物种和区域上的空缺研究

邓言 鲁丽敏 张强 陈之端 胡海花

植物学报2025,Vol.60Issue(1):1-16,16.
植物学报2025,Vol.60Issue(1):1-16,16.DOI:10.11983/CBB24034

维管植物质体DNA数据在物种和区域上的空缺研究

A Comprehensive Evaluation of the Plastid DNA Data Gaps of Vascular Plants in Species and Geographic Area

邓言 1鲁丽敏 2张强 3陈之端 2胡海花2

作者信息

  • 1. 广西师范大学生命科学学院,桂林 541006||中国科学院植物研究所,植物多样性与特色经济作物全国重点实验室,系统与进化植物学重点实验室,北京 100093
  • 2. 中国科学院植物研究所,植物多样性与特色经济作物全国重点实验室,系统与进化植物学重点实验室,北京 100093||国家植物园,北京 100093
  • 3. 广西壮族自治区中国科学院广西植物研究所,广西喀斯特植物保育与恢复生态学重点实验室,桂林 541006
  • 折叠

摘要

Abstract

INTRODUCTION:Molecular data is one of the most important bases for many biological studies,including phylogeny,ecology,and biogeography etc.Incomplete sampling may lead to biased results and inadequate conclusions.However,few studies have evaluated current state of sampling density for sequencing DNA data comprehensively.Plastid DNA sequences have been applied in scientific studies of plants extensively due to their easy accessibility,uniparental inheri-tance,and moderate rate of mutation.Therefore,it is essential to investigate the current state of sampling density for sequencing plastid DNA data in species and geographic area for researchers to better utilize it. RATIONALE:The GenBank is the biggest and most commonly used database of sequencing DNA data.The data gap of plastid DNA in species and geographic area for vascular plants was investigated based on the GenBank database in this study.Firstly,the plastid DNA data of vascular plant species were downloaded from the GenBank database and cleaned.Secondly,species names were standardized according to the World Checklist of Vascular Plants(WCVP)database.Thirdly,to evaluate the current state of sampling density for plastid DNA data of vascular plants,we counted the number of species with plastid DNA sequenced and the proportion of missing data of lineages representing orders and families.We also mapped the proportion of missing data in each region to evaluate the current state of sampling density of plastid DNA data geographically.To further investigate the potential influencing factors of the plastid DNA data gap,Spearman's cor-relations between the proportion of missing data and species diversity among major groups of vascular plants or regions were calculated. RESULTS:Only 33.75%vascular plant species have at least one record of DNA in GenBank,covering 139 005 vascular plant species(angiosperms:131 220 species,gymnosperms:1 154 species,and pteridophytes:6 631 species).For data gap in species,sequenced species were unevenly sampled among lineages,with the proportion of missing data generally correlated with species richness within the lineages.The top three orders of the highest proportion of missing data were Paracryphiales,Piperales,and Dilleniales,and the top three families were Triuridaceae,Pentaphragmataceae,and Xy-ridaceae.For data gap in geographic area,the proportion of missing data of plastid DNA of vascular plant species showed a trend of latitudinal gradient,with the degree of missing data decreasing from the equator to the poles.Regions with high proportion of missing data usually possess high biodiversity,including many biodiversity hotspots.In addition,endemic species were generally with the high proportion of missing data in the majority of regions. CONCLUSION:Our research evaluated the current state of sampling density for plastid DNA data in species and geo-graphic area comprehensively.Our results suggested that about 140 000 vascular plant species have been sequenced for the plastid DNAs.However,there are still large data gaps for the plastid DNA of vascular plants in the following three aspects:(1)Only 1/3 vascular plant species have been sequenced;(2)Ratios of species with plastid DNA sequenced are uneven among lineages;(3)The proportion of missing data decreases from the equator to the poles,with more deficien-cies in biodiversity hotspots and endemic species.Based on the results of this study,we propose to give priority to col-lection and sequencing of vascular plants for groups with high proportion of missing data and regions with high biodiversity,particularly for the endemic species.Our research points out the direction of filling plastid DNA data gap and will be beneficial to biodiversity protection.

关键词

质体DNA/维管植物/数据缺失/植物大数据/GenBank

Key words

plastid DNA/vascular plants/missing data/big data of plant/GenBank

引用本文复制引用

邓言,鲁丽敏,张强,陈之端,胡海花..维管植物质体DNA数据在物种和区域上的空缺研究[J].植物学报,2025,60(1):1-16,16.

基金项目

国家自然科学基金(No.32200190,No.32122009) (No.32200190,No.32122009)

植物学报

OA北大核心

1674-3466

访问量0
|
下载量0
段落导航相关论文