Image-Text Multi-Modal Dataset of Corn Leaf Diseases based on Manual Annotation and Contrast Generation Model

  • WANG YanFang ,
  • XIAN GuoJian ,
  • ZHAO RuiXue
Expand
  • 1. Agricultural Information Institute of CAAS, Beijing 100081, China
    2. Key Laboratory of Knowledge Mining and Knowledge Services in Agricultural Converging Publishing, National Press and Publication Administration, Beijing 100081, China
    3. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081, China

Received date: 2025-04-16

  Accepted date: 2025-05-20

  Online published: 2025-09-28

Abstract

Accurately identifying corn leaf diseases is an important part of intelligent agricultural management. The existing maize disease data sets have problems such as uneven quality, fuzzy label categories, and lack of multimodal data, especially the scarcity of disease description data in the Chinese context. This data set integrates the image data of corn disease from open source platforms such as AI Challenger, PlantVillage and OpenDataLab, and complements the high-definition disease images collected in the field. A Chinese multimodal data set containing 1653 images is constructed. Each image has its corresponding diagnostic text description, covering key information such as disease type, disease characteristics and severity. At the same time, the cn-clip and CPT2 Chinese large model are combined to achieve image description generation, which provides a method for automatic annotation. This data set can provide high-quality data support for the development of an intelligent diagnosis model of corn disease, the generation of Chinese image description and the construction of an agricultural multimodal knowledge map.

Data summary:

Item Description
Dataset name Image-Text Multi-Modal Dataset of Corn Leaf Diseases based on Manual Annotation and Contrast Generation Model
Specific subject area Agricultural Science, Computer Science
Research topic Computer vision, Cross-modal retrieval, Image captioning
Data types and technical formats .jpg
Dataset structure The dataset is composed of two parts: image data of corn leaf disease and corresponding text description data, including: 1. the original image data set of leaf disease, including 9 typical disease image data, including large leaf spot, small leaf spot, brown spot, Curvularia leaf spot, common rust, southern rust, gray spot, round spot and dwarf mosaic, with a total of 1653 pieces; 2. the diagnostic text description corresponding to the image has an average length of about 32 characters, a total of 1653.
Volume of dataset 3.87 GB
Key index in dataset Image and its corresponding description text
Data accessibility CSTR:17058.11.sciencedb.agriculture.00226; https://cstr.cn/17058.11.sciencedb.agriculture.00226
DOI:10.57760/sciencedb.agriculture.00226; https://doi.org/10.57760/sciencedb.agriculture.00226
Financial support National Science and Technology Major Project(2021ZD0113705).

Cite this article

WANG YanFang , XIAN GuoJian , ZHAO RuiXue . Image-Text Multi-Modal Dataset of Corn Leaf Diseases based on Manual Annotation and Contrast Generation Model[J]. Journal of Agricultural Big Data, 2025 , 7(3) : 371 -378 . DOI: 10.19788/j.issn.2096-6369.100060

References

[1] The Freezone Channel. FAO says plant diseases robs global economy of $220bn. (2021-06-03)[2025-04-08]. https://thefreezonechannel.com/2021/06/03/fao-says-plant-diseases-robs-global-economy-of-220bn/.
[2] 王晓杰, 甘鹏飞, 汤春蕾, 等. 植物抗病性与病害绿色防控:主要科学问题及未来研究方向. 中国科学基金, 2020, 34(4):381-392.
  WANG X, GAN P, TANG C, et al. Plant disease resistance and disease green prevention and control:major scientific issues and future research directions. Bulletin of National Natural Science Foundation of China, 2020, 34(4):381-392.
[3] HUGHES D P, SALATHE M. An open access repository of images on plant health to enable the development of mobile disease diagnostics. Computer Science, 2015.DOI:10.48550/arXiv.1511.08060.
[4] WANG X, CAO W. GACN: Generative adversarial classified network for balancing plant disease dataset and plant disease recognition. Sensors, 2023, 23(15).DOI:10.3390/s23156844.
[5] AHMAD A, SARASWAT D, GAMAL A E, et al. CD&S Dataset: Handheld imagery dataset acquired under field conditions for corn disease identification and severity estimation. arXiv, 2021. DOI:10.48550/arXiv.2110.12084.
[6] ZHOU H, HU Y, LIU S, et al. A precise framework for rice leaf disease image-text retrieval using FHTW-Net. Plant Phenomics, 2024, 6:0168. DOI: 10.34133/plantphenomics.0168.
[7] GU J, MENG X, LU G, et al. Wukong: 100 Million Large-scale Chinese cross-modal pre-training dataset and a foundation framework. arXiv,2022:2202.06767. https://arxiv.org/abs/2202.06767.
[8] LI X R, XU C X, WANG X X, et al. COCO-CN for cross-lingual image tagging, captioning and retrieval. IEEE Transactions on Multimedia, 2019, 21(9):2347-2360.DOI:10.1109/TMM.2019.2896494.
[9] LI X, LAN W, DONG J, et al. Adding Chinese captions to images// Acm on International Conference on Multimedia Retrieval.ACM, 2016. DOI:10.1145/2911996.2912049.
[10] 高鸿生, 王凤葵. 病虫害田间诊断口袋书-玉米病虫害诊治图册. 北京: 机械工业出版社, 2017.
  GAO H, WANG F. Field Diagnosis Pocket Book of Diseases And Insect Pests - Atlas of diagnosis and treatment of corn diseases and insect pests. Beijing: China Machine Press, 2017.
[11] 农业学术服务平台-作物病虫草害数据. (2019-07-08)[2025-04-08]. https://agri.nais.net.cn/specialtyresources/list19-1.html.
  AgriScholar - Crop Diseases,Pests and Weeds Dataset. (2019-07-08) [2025-04-08]. https://agri.nais.net.cn/specialtyresources/list19-1.html.
[12] YANG A, PAN J, LIN J, et al. Chinese CLIP: Contrastive vision- language pretraining in Chinese. arXiv, 2023. DOI: 10.48550/arXiv.2211.01335.
Outlines

/