农业大数据学报 ›› 2026, Vol. 8 ›› Issue (2): 258-265.doi: 10.19788/j.issn.2096-6369.100072

• 数据资源 • 上一篇    下一篇

地黄叶片虫害孔洞图像数据集(RPHD)的构建与基准评估

许琳娜(), 黄婷, 郑丽萍*(), 费选   

  1. 河南工业大学人工智能与大数据学院郑州 450001
  • 收稿日期:2026-03-07 接受日期:2026-04-20 出版日期:2026-06-26 发布日期:2026-06-26
  • 通讯作者: 郑丽萍,E-mail:pinglizheng@haut.edu.cn
  • 作者简介:许琳娜,E-mail:3098138162@qq.com

Construction and Benchmark Evaluation of the Rehmannia Leaf Pest-induced Hole Image Dataset (RPHD)

XU LinNa(), HUANG Ting, ZHENG LiPing*(), FEI Xuan   

  1. School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou 450001, China
  • Received:2026-03-07 Accepted:2026-04-20 Published:2026-06-26 Online:2026-06-26

摘要:

地黄是我国重要的道地中药材,规模化种植的地黄常受多种害虫侵袭,导致叶片孔洞频现,严重影响地黄的产量与品质。然而,当前基于深度学习的虫害孔洞检测方法面临田间环境复杂、目标尺度小、形态不规则等挑战。本研究构建了面向田间复杂环境的地黄叶片虫害孔洞图像数据集(Rehmannia Pest-induced Hole Dataset, RPHD)。该数据集包含初始采集版本(298张图像,5 059个标注)与精校增强版本(291张图像,2 678个高质量标注)。通过系统化的图像预处理流程,包括基于内容感知的裁剪与分辨率统一(1024×1024)、自适应双边滤波去噪与光照均衡化,并结合精细化人工标注与多轮交叉校验质量控制机制,显著提升了数据集的目标判别性。实验采用YOLOv10n、YOLOv11n和YOLO12n三种主流目标检测模型对数据集进行基准评估,对比两个版本在不同模型上的性能表现。结果表明,在三个模型上,使用精校增强数据集训练均取得显著性能提升:YOLOv10n的mAP@0.5从40.3%提升至87.2%,YOLOv11n从48.9%提升至92.1%,YOLO12n从51.2%提升至93.0%。该数据集为地黄小目标病虫害检测算法研究提供了标准化、高质量的基准数据支持,可供其他农作物相关研究借鉴,有助于推动农业视觉检测技术的实用化与智能化发展。

数据摘要:

项目 内容
数据集名称 地黄叶片虫害孔洞图像数据集(RPHD)
所属学科 计算机科学;农业科学
研究主题 虫害孔洞检测,图像数据集,目标检测,地黄
数据时间范围 2024年6月—2024年9月
时间分辨率 不适用
数据地理空间覆盖 河南省温县核心种植区(东经112.95°-113.15°,北纬34.88°-35.00°)
空间分辨率 不适用
数据类型与技术格式 原始田间图像(JPEG格式);预处理后图像(JPEG格式,1024×1024像素);目标检测标注文件(YOLO格式.txt)
数据库(集)组成 原始版本(298张图像,5 059个标注);优化版本(291张图像,2 678个标注)
数据量 约848 MB
主要数据指标 虫害孔洞边界框坐标(x_center, y_center, width, height),类别ID=0
数据可用性 DOI:10.57760/sciencedb.29865; https://www.scidb.cn/detail?dataSetId=8954e40ec0eb4f3fb7dad889e982547f
CSTR: 31253.11.sciencedb.29865; https://www.scidb.cn/s/RF7J3q

关键词: 地黄, 虫害孔洞检测, 图像数据集, 数据标注, YOLO, 农业人工智能

Abstract:

Rehmannia glutinosa is an important traditional Chinese medicinal herb. Its large-scale cultivation is often affected by various pests, leading to frequent leaf pest-induced holes that severely impact yield and quality. However, current deep learning-based Pest-induced Hole detection methods face challenges such as complex field environments, small target scales, and irregular morphological characteristics. To address these issues, this paper presents the first Rehmannia Leaf Pest-induced Hole Dataset (RPHD) specifically designed for complex field environments. The dataset comprises an initial version (298 images with 5,059 annotations) and a refined version (291 images with 2,678 high-quality annotations). Through a systematic image preprocessing pipeline, including content-aware cropping and resolution normalization (1024×1024), adaptive bilateral filtering for noise reduction, and illumination equalization, combined with meticulous manual annotation and multiple rounds of cross-validation quality control mechanisms, the annotation consistency and target discriminability of the dataset were significantly improved. Three mainstream object detection models—YOLOv10n, YOLOv11n, and YOLOv12n—were employed to evaluate the dataset, comparing the performance of the two versions across different models. Experimental results show that training on the refined dataset yields substantial performance improvements across all models:YOLOv10n: mAP@0.5 increased from 40.3% to 87.2%;YOLOv11n: mAP@0.5 increased from 48.9% to 92.1%;YOLO12n: mAP@0.5 increased from 51.2% to 93.0%. This dataset provides standardized, high-quality benchmark data for research on small-target pest and disease detection algorithms for Rehmannia and other crops, contributing to the practical and intelligent development of agricultural vision detection technologies.

Data summary:

Items Description
Dataset name Rehmannia Leaf Pest-induced Hole Image Dataset (RPHD)
Specific subject area Computer science; Agricultural science
Research topic Pest-induced hole detection, Image dataset, Object detection, Rehmannia glutinosa
Time range June 2024 - September 2024
Temporal resolution Not applicable
Geographical scope Core planting area of Wen County, Henan Province, China (112.95°E-113.15°E, 34.88°N-35.00°N)
Spatial resolution Not applicable
Data types and technical formats Raw field images (JPEG); Preprocessed images (JPEG, 1024×1024 pixels); Object detection annotation files (YOLO format.txt)
Dataset structure Raw version: 298 images, 5,059 annotations; Enhanced version: 291 images, 2,678 annotations.
Volume of dataset Approx. 848 MB
Key index in dataset Pest-induced hole bounding box coordinates (x_center, y_center, width, height), class ID = 0
Data accessibility DOI:10.57760/sciencedb.29865; https://www.scidb.cn/detail?dataSetId=8954e40ec0eb4f3fb7dad889e982547f
CSTR: 31253.11.sciencedb.29865; https://www.scidb.cn/s/RF7J3q
Financial support None

Key words: Rehmannia glutinosa, Pest-induced Hole detection, image dataset, data annotation, YOLO, agricultural artificial intelligence