农业大数据学报

• •    下一篇

大田作物病虫害图像数据集综述

赵小丹1,胡林1,2*,刘婷婷1,2*   

  1. 1.中国农业科学院农业信息研究所,北京100081;2. 国家农业科学数据中心,北京100081

Review of Image Datasets for Field Crop Pest and Disease Management

ZHAO XiaoDan1, HU Lin1,2*, LIU TingTing1,2*   

  1. 1.Institute of Agricultural Information, Chinese Academy of Agricultural Sciences, Beijing 100081, China; 2.National Agricultural Science Data Center, Beijing 100081, China

摘要: 粮食安全是国家安定与经济发展的重要基石,而大田作物病虫害严重威胁粮食生产,亟需高效精准的监测和防控手段。近年来,基于深度学习的病虫害图像识别技术崭露头角,其发展高度依赖高质量的病虫害图像数据集。然而,目前公开数据集在规模、覆盖范围和质量等方面存在不足,限制了研究与实际应用的进一步突破。本文系统梳理了现有水稻、小麦、玉米、马铃薯等主要大田作物病虫害图像数据集的资源来源、典型特征和应用场景,分析其在数据量、多样性和标准化方面的主要挑战。研究发现,数据集在采集时间、地点和病虫害种类覆盖方面具有一定代表性,但仍需进一步提升类别均衡性、多样性和跨领域共享性。对相关数据集资源的归纳整理,可为推动精准农业发展和粮食安全保障提供技术支持和理论依据。

关键词: 病虫害图像, 大田作物, 数据集

Abstract: Food security is a critical foundation for national stability and economic development, while pest and disease outbreaks in field crops pose a severe threat to grain production, necessitating efficient and precise monitoring and control measures. In recent years, deep learning-based pest and disease image recognition technologies have gained prominence, relying heavily on high-quality image datasets. However, the currently available public datasets face limitations in scale, coverage, and quality, hindering further breakthroughs in research and practical applications. This paper systematically reviews existing image datasets of major field crops, including rice, wheat, maize, and potato, focusing on their sources, key characteristics, and application scenarios. It also analyzes major challenges in data volume, diversity, and standardization. The findings reveal that while datasets exhibit a certain representativeness regarding collection time, location, and pest and disease types, improvements are needed in class balance, diversity, and cross-domain sharing. Summarizing and organizing these datasets provides technical support and theoretical insights to advance precision agriculture and ensure food security.

Key words: pest and disease images, field crops, datasets