Journal of Agricultural Big Data ›› 2025, Vol. 7 ›› Issue (2): 227-237.doi: 10.19788/j.issn.2096-6369.100040

Previous Articles     Next Articles

Variant Site Dataset of 99 Durio zibethinus Germplasm Resources

JI XiaoHao1,2(), ZHENG DaoJun3, XIE ShengHua3, SHI Meng2, ZHONG YiWang3, WANG YingYing2, WANG XiaoDi2, LIU FengZhi2, FENG XueJie3,*(), WANG HaiBo1,2,*()   

  1. 1. National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, Hainan, China
    2. Research Institute of Pomology, Chinese Academy of Agricultural Sciences/Key Laboratory of Horticultural Crops Germplasm Resources Utilization, Ministry of Agriculture and Rural Affairs, Xingcheng 125100, Liaoning, China
    3. Sanya Institute, Hainan Academy of Agricultural Sciences / Institute of Tropical Fruit Trees, Hainan Academy of Agricultural Sciences / Key Laboratory of Genetic Resources Evaluation and Utilization of Tropical Fruits and Vegetables (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs / Key Laboratory of Tropical Fruit Tree Biology of Hainan Province / Haikou Scientific Observation and Experimental Station for Tropical Fruit Trees, Ministry of Agriculture and Rural Affairs/ Hainan Field Scientific Observation and Research Station for Tropical Fruit Trees, Haikou 571100, Hainan, China
  • Received:2024-06-27 Accepted:2024-08-13 Online:2025-06-26 Published:2025-06-23
  • Contact: FENG XueJie, WANG HaiBo

Abstract:

Durian has high economic and nutritional value. In China, the durian industry is highly dependent on imports. The durian industry in Hainan Province is in its infancy, characterized by limited acreage, low yield, complete reliance on introduced varieties, lack of self-sufficiency, and insufficient supporting cultivation techniques. These issues lead to a stark contrast between high market demand and a weak industry. There is an urgent need for the collection, identification, and evaluation of durian germplasm resources. In this study, DNA was extracted from 99 durian germplasm resources. Libraries were constructed, and second-generation whole-genome sequencing was performed. Bioinformatic analyses, including quality control of sequencing data, variant site discovery and annotation, and population evolution studies, were conducted on the sequencing data. The total amount of sequencing data was 1.62 Tb, yielding 54,974,697 variant sites, including SNPs, insertions (INS), and deletions (DEL), with SNPs being the most prevalent. On average, there is one variant site per 13 bases in the durian genome. These variant sites are mainly located in intergenic regions, with fewer in gene exons and introns. The 99 durian resources can be divided into three subgroups. The distance at which the LD coefficient decays to half its maximum value is only 0.1-0.2 kb, indicating rich genetic diversity. This study provides genome sequencing data and variant site information for 99 durian germplasm resources, offering fundamental data support for durian genetics, breeding methods, and breeding theory research. This will aid in the selection and breeding of durian varieties in Hainan and worldwide.

Data summary:

Items Description
Name of dataset Variant Site Dataset of 99 Durio zibethinus Germplasm Resources
Specific subject area Agronomy, biology
Research topic Genetic variation of durian germplasm resources
Time range 2022 - 2023
Temporal resolution one year
Geographical scope Sanya City, Hainan Province, China
Data types and technical formats .XLSX, VCF
Dataset structure This dataset consists of one table and one VCF file, primarily including the quality control results of WGS sequencing data, alignment information, and variant site information.
Volume of dataset 143.36 GB
Data accessibility CSTR:17058.11.sciencedb.agriculture.00077;https://cstr.cn/17058.11.sciencedb.agriculture.00077
DOI:10.57760/sciencedb.agriculture.00077; https://doi.org/10.57760/sciencedb.agriculture.00077
Financial support Nanfan Special Project of the Chinese Academy of Agricultural Sciences(SWAQ09); The Agricultural Science and Technology Innovation Program of the Chinese Academy of Agricultural Sciences (CAAS-ASTIP-2021-RIP-02).

Key words: durian, variant sites, SNP, population evolution