Journal of Agricultural Big Data ›› 2023, Vol. 5 ›› Issue (4): 47-55.doi: 10.19788/j.issn.2096-6369.230405

Previous Articles     Next Articles

A Training Dataset for Deep Neural Network Model Recognition of Common Cotton Diseases

ZHAO HongXin1,2(), SHAO MingYue1,2, PAN Pan1,2, WANG ZhiAo1,2, MU Qiang1,2, HE ZiKang1,2, ZHANG JianHua1,2,*()   

  1. 1. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    2. Hainan National Breeding and Multiplication Institute at Sanya, Chinese Academy of Agricultural Sciences, Sanya 572024, Hainan, China
  • Received:2023-08-18 Accepted:2023-09-05 Online:2023-12-26 Published:2024-01-05

Abstract:

In the realm of cotton disease identification, the Deep Neural Network emerges as a pivotal paradigm. Progress in this sphere hinges on the availability of a comprehensive repository of scientific data, encapsulating a broader spectrum of diseases, variegated soil profiles, and multifaceted environmental attributes. Currently, this dearth of data serves as the principal bottleneck, impeding the advancement of state-of-the-art models and algorithms.Within this scholarly exposition, we present a meticulously curated cotton disease dataset, poised to bridge this knowledge chasm. This dataset comprehensively encompasses four prevalent cotton diseases: anthracnose, bacterial blight, brown spot, and wilt disease. These maladies' exemplars were meticulously gleaned from cotton fields situated in the Potianyang High-standard Farmland Demonstration Base, nestled serenely in Sanya, Hainan Province, China.The dataset unfolds as a magnum opus, comprising 3 453 high-resolution images. These vivid snapshots provide a panoramic view, capturing the pristine vitality of healthy leaves, juxtaposed with leaves beset by disease at various growth stages. The data acquisition, executed with precision, leveraged field random sampling methodologies, ensuring a faithful reflection of the natural complexity in real-world cotton plantations.Every image underwent meticulous scrutiny, with ten seasoned mavens in cotton pathology meticulously overseeing the annotation. An additional cohort of twenty annotators conducted a second round of annotations on randomly selected image subsets, fortifying the dataset's integrity and precision. The Vision Transformer model was employed to guarantee the dataset's resilience and accuracy.This hallowed dataset was meticulously gathered amidst the complexity of field environments, encapsulating the nuances of major cotton diseases in their native habitat. Its high image resolution, akin to an opulent tapestry of visual data, renders it an invaluable resource for pioneering research, astute training, and the relentless validation of astute, intelligent cotton disease recognition models and algorithms. This opulent repository caters to the discriminating tastes of researchers, practitioners, and sagacious decision-makers, furnishing them with a comprehensive and crystalline understanding of the multifaceted tapestry of cotton diseases and their intricate management.

Data summary:

Item Description
Dataset name A Training Dataset for Deep Neural Network Model Recognition of Common Cotton Diseases
Specific subject area Agricultural Science, Computer Science
Time range December, 2021-August, 2023
Geographical scope This dataset covers the plain planting area of Potianyang Base in Sanya City, Hainan Province, with a central latitude and longitude of (109.165497,18.3931609999999)
Data types and technical formats Cotton Image Format *. jpg, Cotton Disease Classification Standard Format *. txt
Dataset structure The dataset consists of 3453 image files and one text file. The image files belong to a folder named Cotton Disease Data, all of which are *. JPG files. The folder where the text files belong is named the Cotton Disease Dataset, where all files are *. TXT
Volume of data 2.74 GB
Data accessibility CSTR:17058.11.sciencedb.agriculture.00029
DOI:10.57760/sciencedb.agriculture.00029
Financial support National Key R&D Plan (2022YFF0711805); Science and Technology Special Fund for Sanya Yazhou Bay Science and Technology City (SCKJ-JYRC-2023-45);Innovation Engineering of the Chinese Academy of Agricultural Sciences (CAAS - ASTIP - 2023 - AII, ZDXM23011); Special funds for basic research business of central level public welfare research institutes (Y2022XK24, Y2022QC17, JBYW - AII - 2022 - 14, JBYW - AII - 2023 - 06);
Sanya Chinese Academy of Agricultural Sciences National South Breeding Research Institute South Breeding Special Project (YDLH01, YDLH07, YBXM10, ZDXM23011, YBXM2312)

Key words: deep learning, cotton anthracnose, cotton angular leaf spot, cotton brown spot, Fusarium wilt, image recognition technology