Journal of Agricultural Big Data ›› 2025, Vol. 7 ›› Issue (2): 261-268.doi: 10.19788/j.issn.2096-6369.100042

Previous Articles     Next Articles

Construction Data Set of Knowledge Map of main Crops Approved Varieties in Guangdong Province from 2016 to 2023

GAO ZhuoJun1(), ZHANG DanDan2,*(), CHEN RongYu3   

  1. 1. Institute of Agricultural Economics and Information, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
    2. Agricultural Information Institute of CAAS / Key Laboratory of Knowledge Mining and Knowledge Service for Agricultural Convergence Publishing, Beijing 100081, China
    3. Haifeng County Agricultural Science Research Institute, Shanwei 516499, Guangdong, China
  • Received:2024-08-15 Accepted:2024-09-29 Online:2025-06-26 Published:2025-06-23
  • Contact: ZHANG DanDan

Abstract:

This study is carried out in combination with the data of crops approved varieties in Guangdong Province and related technologies of knowledge map. Seed industry is the initial link of agricultural industrial chain and an important pillar to ensure national food security and economic development. As an important innovative resource in this link, approved varieties are popularized after strict testing and objective evaluation, which effectively realizes the protection and utilization of germplasm resources and promotes the high-quality development of seed industry. With the advancement of agricultural informatization, the amount of agricultural data has increased dramatically, and modern information technologies such as big data and artificial intelligence have played a prominent role in improving agricultural production efficiency and optimizing resource allocation. As an important branch technology of artificial intelligence and semantic network, knowledge mapping has been widely used in various fields, while the research of knowledge mapping in agricultural field focuses on key issues such as crop cultivation, water and fertilizer management, pest control and so on. Based on the reliability, practicability, continuity and other factors of data, this study collected the eight-year crop variety data of Guangdong Province from 2016 to 2023 as basic data by obtaining the information publicly released by the Guangdong Provincial Department of Agriculture and Rural Affairs. The data was stored in. doc format and contained a lot of characters and characters. In order to facilitate machine identification and subsequent knowledge map construction, this study removed the influence of noise by data cleaning, and extracted common attributes according to the characteristics and yield performance of varieties. Finally, 823 germplasm resources data of three crops approved varieties by rice, corn and soybean were sorted and merged, and stored as structured data in. xlsx and. json formats. In order to verify the validity of the data, the knowledge map of main crops approved varieties in Guangdong Province was successfully constructed by using the graphic database: Neo4j. Relevant scientific research and production units can establish an expert knowledge base of crops approved varieties based on this data set, and build intelligent services such as intelligent question and answer, management decision and information recommendation for specific agricultural tasks through database expansion and multi-source data fusion.

Data summary:

Items Description
Dateset name Construction Data Set of Knowledge Map of main Crops Approved Varieties in Guangdong Province from 2016 to 2023
Specific subject area Other disciplines of agriculture
Research topic Crops; Agricultural knowledge map; Data mining
Time range 2016-2023
Temporal resolution Year
Geographical scope Guangdong Province
Data types and technical formats .xlsx,.json
Dataset structure This dataset consists of one tabular file and three text files, the tabular file contains a total of 823 germplasm resource data of three types of crops (rice, corn and soybean) in Guangdong Province from 2016 to 2023, and the text file extracts common high-frequency attribute data for rice, maize and soybean according to their characteristic characteristics and yield performance..
Volume of dataset 4.18 MB
Key index in dataset Crop category, variety name, variety source, growth period, planting time, morphological characteristics, disease resistance, yield performance, average yield per mu, planting area, etc
Data accessibility CSTR: 17058.11.sciencedb.agriculture.00117; https://cstr.cn/17058.11.sciencedb.agriculture.00117
DOI: 10.57760/sciencedb.agriculture.00117; https://doi.org/10.57760/sciencedb.agriculture.00117
Financial support Guangdong Provincial Lingnan Characteristic Agriculture Science Data Center (2021B1212100005);
Research on knowledge fusion and shared services of crop seed industry data resources (2023KMKS04)

Key words: crops, approved varieties, characteristics, knowledge map, germplasm resources