中国水产科学2025,Vol.32Issue(6):753-765,13.DOI:10.12264/JFSC2025-0004
淡水定居型刀鲚无间隙基因组组装
Gap-free genome assembly of freshwater resident Coilia nasus
摘要
Abstract
As one of the ecotypes of anadromous Coilia nasus,the freshwater resident Coilia nasus taihuensis is unable to conduct anadromous behavior due to the existence of physical barriers.C.nasus taihuensis can grow and reproduce independently in freshwater habitats such as Lake Taihu.Genetic differences in the genome have been demonstrated between anadromous C.nasus and non-anadromous C.nasus taihuensis due to geographical isolation and the significant difference in living environment.The genetic mechanisms related to environmental adaptation of C.nasus taihuensis were hindered by the lack of reference genomes.To conduct comprehensive and systematic analysis of the genomic characteristics of C.nasus taihuensis,a freshwater resident individual from Lake Taihu was selected as the experimental object.The first complete,gap-free reference genome of C.nasus taihuensis was constructed using a combination of PacBio(Pacific Biosciences)high-fidelity reads and ONT(Oxford Nanopore Technologies)ultra-long reads and Hi-C datasets.Based on a K-mer value of 21,we estimated the genome size as 612.76 Mb with a heterozygosity rate of 1.31%.The results of genome assembly showed a gap-free C.nasus taihuensis genome with an assembly size of 834.09 Mb.A contig N50 of 35.45 Mb was obtained through library construction,sequencing,assembly,chromosome mounting and gaps filling.Compared with the genome of cultured C.nasus,our gap-free genome substantially improved contiguity and completeness,where contig N50 increased from 1.6 Mb to 35.46 Mb.The integrity also increased from 87.1%to 91.9%,representing the highest quality genome.By Hi-C data,the assembled sequences were anchored and oriented onto all the 24 chromosomes with a total length of 829.28 Mb,covering 99.83%of the scaffold-level genome.After gap filling using ONT ultra-long reads,all 24 chromosomes were assembled without gaps,representing the highest assembly quality.BUSCO analysis based on the actinopterygii_odb 10 database showed that 91.9%of the expected actinopterygii_odb10 genes(single-copy genes:90%;duplicated genes:1.9%)were identified as complete,suggesting that the assembled C.nasus taihuensis genome is highly complete.Furthermore,a total of 382.393 Mb of repetitive sequence,accounted for 45.85%of the genome.Using a combination of de novo prediction,protein homology and RNA-seq annotation,a total of 21730 protein-coding genes were identified.Approximately 99.71%(21666 genes)of the total predicted genes were assigned with at least functional annotation,showing a more complete annotation.Furthermore,the conservation synteny between C.nasus taihuensis and C.nasus was compared to validate the chromosome assembly,and the results showed that the genomic sequences of C.nasus and C.nasus taihuensis were highly consistent(96.95%).Such highly conserved synteny and strict correspondence of chromosome assignment indicated a close genetic relationship between the two ecotypes.The high-quality gap-free assembled genome of C.nasus taihuensis can provide material for studying the freshwater adaptability of C.nasus and accumulate basic data for further studies on the population genetics of C.nasus.关键词
淡水定居型/刀鲚/无间隙基因组/基因注释/共线性分析Key words
freshwater resident/Coilia nasus/gap-free genome/gene annotation/collinear analysis分类
农业科技引用本文复制引用
马凤娇,王慧,任泷,刘凯..淡水定居型刀鲚无间隙基因组组装[J].中国水产科学,2025,32(6):753-765,13.基金项目
中国水产科学研究院中央级公益性科研院所基本科研业务费专项资金项目(2023TD11,2023TD65) (2023TD11,2023TD65)
江苏省农业农村厅渔业生态与资源监测专项 ()
江苏省研究生科研与实践创新计划项目(KYCX23_0757). (KYCX23_0757)