Journal of Agricultural Big Data >
A Training Dataset for Deep Neural Network Model Recognition of Common Cotton Diseases
Received date: 2023-08-18
Accepted date: 2023-09-05
Online published: 2024-01-05
In the realm of cotton disease identification, the Deep Neural Network emerges as a pivotal paradigm. Progress in this sphere hinges on the availability of a comprehensive repository of scientific data, encapsulating a broader spectrum of diseases, variegated soil profiles, and multifaceted environmental attributes. Currently, this dearth of data serves as the principal bottleneck, impeding the advancement of state-of-the-art models and algorithms.Within this scholarly exposition, we present a meticulously curated cotton disease dataset, poised to bridge this knowledge chasm. This dataset comprehensively encompasses four prevalent cotton diseases: anthracnose, bacterial blight, brown spot, and wilt disease. These maladies' exemplars were meticulously gleaned from cotton fields situated in the Potianyang High-standard Farmland Demonstration Base, nestled serenely in Sanya, Hainan Province, China.The dataset unfolds as a magnum opus, comprising 3 453 high-resolution images. These vivid snapshots provide a panoramic view, capturing the pristine vitality of healthy leaves, juxtaposed with leaves beset by disease at various growth stages. The data acquisition, executed with precision, leveraged field random sampling methodologies, ensuring a faithful reflection of the natural complexity in real-world cotton plantations.Every image underwent meticulous scrutiny, with ten seasoned mavens in cotton pathology meticulously overseeing the annotation. An additional cohort of twenty annotators conducted a second round of annotations on randomly selected image subsets, fortifying the dataset's integrity and precision. The Vision Transformer model was employed to guarantee the dataset's resilience and accuracy.This hallowed dataset was meticulously gathered amidst the complexity of field environments, encapsulating the nuances of major cotton diseases in their native habitat. Its high image resolution, akin to an opulent tapestry of visual data, renders it an invaluable resource for pioneering research, astute training, and the relentless validation of astute, intelligent cotton disease recognition models and algorithms. This opulent repository caters to the discriminating tastes of researchers, practitioners, and sagacious decision-makers, furnishing them with a comprehensive and crystalline understanding of the multifaceted tapestry of cotton diseases and their intricate management.
Data summary:
| Item | Description |
|---|---|
| Dataset name | A Training Dataset for Deep Neural Network Model Recognition of Common Cotton Diseases |
| Specific subject area | Agricultural Science, Computer Science |
| Time range | December, 2021-August, 2023 |
| Geographical scope | This dataset covers the plain planting area of Potianyang Base in Sanya City, Hainan Province, with a central latitude and longitude of (109.165497,18.3931609999999) |
| Data types and technical formats | Cotton Image Format *. jpg, Cotton Disease Classification Standard Format *. txt |
| Dataset structure | The dataset consists of 3453 image files and one text file. The image files belong to a folder named Cotton Disease Data, all of which are *. JPG files. The folder where the text files belong is named the Cotton Disease Dataset, where all files are *. TXT |
| Volume of data | 2.74 GB |
| Data accessibility | CSTR:17058.11.sciencedb.agriculture.00029 DOI:10.57760/sciencedb.agriculture.00029 |
| Financial support | National Key R&D Plan (2022YFF0711805); Science and Technology Special Fund for Sanya Yazhou Bay Science and Technology City (SCKJ-JYRC-2023-45);Innovation Engineering of the Chinese Academy of Agricultural Sciences (CAAS - ASTIP - 2023 - AII, ZDXM23011); Special funds for basic research business of central level public welfare research institutes (Y2022XK24, Y2022QC17, JBYW - AII - 2022 - 14, JBYW - AII - 2023 - 06); Sanya Chinese Academy of Agricultural Sciences National South Breeding Research Institute South Breeding Special Project (YDLH01, YDLH07, YBXM10, ZDXM23011, YBXM2312) |
ZHAO HongXin, SHAO MingYue, PAN Pan, WANG ZhiAo, MU Qiang, HE ZiKang, ZHANG JianHua . A Training Dataset for Deep Neural Network Model Recognition of Common Cotton Diseases[J]. Journal of Agricultural Big Data, 2023 , 5(4) : 47 -55 . DOI: 10.19788/j.issn.2096-6369.230405
| [1] | ZHANG J H, KONG F T, WU, J Z, et al. Automatic image segmentation method for cotton leaves with disease under natural environment[J]. Journal of Integrative Agriculture, 2018, 17(8):1800-1814. DOI:10. 1016/S2095- 3119(18)61915-X. |
| [2] | PAN P, GUO W, ZHENG X, et al. Xoo-YOLO: a detection method for wild rice bacterial blight in the field from the perspective of unmanned aerial vehicles[J]. Frontiers in Plant Science, 2023, 14: 1256545. DOI:10.3389/fpls.2023.1256545. |
| [3] | ZHANG J, KONG F, ZHAI Z, et al. Robust image segmentation method for cotton leaf under natural conditions based on immune algorithm and PCNN algorithm[J]. International Journal of Pattern Recognition and Artificial Intelligence, 2018, 32(5): 1854011. DOI:10.1142/ S0218001418540113. |
| [4] | 张建华, 孔繁涛, 吴建寨, 等. 基于改进VGG卷积神经网络的棉花病害识别模型[J]. 中国农业大学学报, 2018, 23(11):161-171. |
| [5] | 邵明月, 张建华, 冯全, 等. 深度学习在植物叶部病害检测与识别的研究进展[J]. 智慧农业(中英文), 2022, 4(1):29-46. |
| [6] | SHAO M, HE P, ZHANG Y, et al. Identification method of cotton leaf diseases based on Bilinear Coordinate Attention Enhancement Module[J]. Agronomy, 2023, 13(1):88. DOI:10.3390/agronomy13010088. |
| [7] | 周国民. 我国农业大数据应用进展综述[J]. 农业大数据学报, 2019, 1(1):16-23.doi:10.19788/j.issn.2096-6369.190102. |
| [8] | VASILEIADIS S, PUGLISI E, ARENA M, et al. Soil bacterial diversity screening using single 16S rRNA gene V regions coupled with Multi-million read generating sequencing technologies[J]. Plos One, 2012, 8(7): e42671. |
| [9] | ZHANG K, SHI Y, CUI X, et al. Salinity is a key determinant for soil microbial communities in a desert ecosystem[J]. Msystems, 2019, 4(1): 225-243. |
| [10] | VASILEIADIS S, PUGLISI E, ARENA M, et al. Soil bacterial diversity screening using single 16S rRNA gene V regions coupled with Multi-million read generating sequencing technologies[J]. Plos One, 2012, 8(7): e42671. |
| [11] | 闫靖昆, 黄毓贤, 秦伟森, 等. 棉田复杂背景下棉花黄萎病病斑分割算法研究[J]. 南京师大学报(自然科学版), 2021, 44(4):127-134. |
| [12] | 王曾龙, 蒋勇勇, 彭海峰, 等. 一种基于Faster-RCNN的棉花虫害识别与统计方法[J]. 大众科技, 2023, 25(5):5-7+12. |
| [13] | 伍维模, 吕双庆, 赵长巍, 等. 基于小型卷积神经网络的南疆棉花图像分类[J]. 智慧农业导刊, 2023, 3(8):17-23. |
| [14] | 张嘉镐, 杨济东, 赵俊杰, 等. 基于改进的YOLOv5对棉花枯萎病的识别算法[J]. 电脑知识与技术, 2023, 19(20):51-53+56. |
| [15] | 陈洛轩, 林成创, 郑招良, 等. Transformer在计算机视觉场景下的研究综述[J]. 计算机科学, 2023, 9(14):1-5. |
| [16] | 潘攀, 张建华, 郑晓明, 等. 深度学习在作物及其近缘种抗病性智能鉴定上的研究进展[J]. 浙江农业学报, 2023, 35(8):1993-2012. |
| [17] | BAI Y, QIAN J M, ZHOU J M, et al. Crop microbiome: break‐ through technology for agriculture[J]. Proceedings of the Chinese Academy of Sciences, 2017, 32(3):260-265. |
| [18] | 吕新, 梁斌, 张立福, 等. 新疆生产建设兵团棉花生产大数据平台建设与探索[J]. 农业大数据学报, 2020, 2(1):70-78. |
| [19] | 梁斌, 仵晓娟, 李继玲, 等. 林果大数据分析应用平台设计研究—以新疆生产建设兵团为例[J]. 中南林业科技大学学报, 2020, 40(9): 173-182. |
| [20] | 张萌, 董伟, 钱蓉, 等. 安徽省植保大数据平台建设与应用展望[J]. 农业大数据学报, 2020, 2(1):36-44. |
| [21] | ZHANG J J, KOBERT K, FLOURI T A, et al. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics, 2014, 30(5): 614-620. |
/
| 〈 |
|
〉 |