应用于农业场景视觉解析任务的番茄数据集

周玲莉, 任妮, 张文翔, 程雅雯, 陈诚, 易中懿

doi:10.19788/j.issn.2096-6369.210408

农业大数据学报 >

2021 , Vol. 3 >Issue 4: 70 - 76

DOI: https://doi.org/10.19788/j.issn.2096-6369.210408

数据论文

应用于农业场景视觉解析任务的番茄数据集

展开

江苏省农业科学院信息中心，南京 210014

[1] 周玲莉|周玲莉，女，硕士，研究方向：计算机视觉; E-mail: 20210054@jaas.ac.cn|任妮，张文翔，等. 应用于农业场景视觉解析任务的番茄数据集[DB/OL]. 国家农业科学数据中心. DOI: 10.12205/E0292.20220113.36.cs.1926.|任妮，张文翔，等. 应用于农业场景视觉解析任务的番茄数据集[DB/OL]. 国家农业科学数据中心. DOI: 10.12205/E0292.20220113.36.cs.1926.

收稿日期: 2021-11-12

网络出版日期: 2022-01-28

基金资助

江苏省重点研发（现代农业）项目——基于数智融合的设施栽培物联网关键技术及装备研发(BE2021379)

收起

Tomato Dataset for Agricultural Scene Visual-Parsing Tasks

Expand

Information Center of Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China

Received date: 2021-11-12

Online published: 2022-01-28

Fold

摘要

农业机器人是农业现代化发展中的一项重要组成部分，计算机视觉技术通过对作物和环境的感知与解析，有效促进其在农业领域的落地与应用。但由于农业场景的复杂性、多样性，当前先进计算机视觉方法所需要的详细且注释的大规模图像数据集在农业领域十分稀缺，这也是阻碍计算机视觉技术在农业领域发展的一个主要瓶颈。针对这一痛点，该文提供了一个可用于图像语义分割、图像实例分割、目标检测任务的大规模番茄图像数据集。该数据集由两部分组成，分别为合成部分和经验部分。其中合成部分使用Wageningen大学等人提出的数据合成方法，生成3250张合成番茄图像以及对应的像素级别语义分割标签图；经验部分由RGB相机拍摄的750张单目图像和400张双目图像构成，并人工对它们部分进行了包括实例分割、目标检测等在内的精细标注。研究旨在从多个角度丰富该数据集，包括数据集的大小、标注信息的多维度、场景的复杂性等方面，为今后利用计算机视觉技术解决农业领域问题提供数据基础。

关键词： 番茄; 图像; 农业领域; 场景解析

本文引用格式

周玲莉, 任妮, 张文翔, 程雅雯, 陈诚, 易中懿 . 应用于农业场景视觉解析任务的番茄数据集[J]. 农业大数据学报, 2021 , 3(4) : 70 -76 . DOI: 10.19788/j.issn.2096-6369.210408

Abstract

Agricultural robots are an important part of the development of agricultural modernization, and computer vision technology effectively promotes their application in the field of agriculture by perceiving and analyzing crops and the environment. However, because of the complexity and diversity of agricultural scenes, the detailed and annotated large-scale image datasets required by advanced computer vision methods are scarce in the field of agriculture. This lack of datasets is the main challenge in the development of computer vision technology in the field. To solve this problem, this paper presents a large-scale tomato image dataset that can be used for semantic image segmentation, instance segmentation, target detection, and other tasks. The dataset consists of synthetic and real images. The synthetic images include 3250 synthetic tomato images and the corresponding pixel-level semantic segmentation label images; the real images consist of 750 monocular images and 400 binocular images taken by RGB cameras, some of which have detailed manual labels for instance segmentation and target detection. This research aims to enrich many aspects of the dataset, including its capacity, the dimensionality of the annotation information, and the complexity of the scene, and to provide data support for solving future problems in the field of agriculture using computer vision technology.

Key words： tomato; image; agricultural field; scene parsing

参考文献

1	Li L, Qian B, Lian J, et al. Traffic Scene Segmentation Based on RGB-D Image and Deep Learning[J]. IEEE Transactions on Intelligent Transportation Systems, 2017:1-6.
2	郑阳.医疗人工智能的关键技术及应用[J].医学信息,2021,34(02):19-22.
2	Zheng Y. Key Technology and Application of Medical Artificial Intelligence[J]. Medical information, 2021, 34(02):19-22
3	LeCun, Yann, Bengio. Deep learning[J]. Nature, 2015, 521(7553):436.
4	李优,穆林平.基于迁移学习的垃圾图像分类模型研究[J].电脑与信息技术,2021,29(04):17-21.
4	Li Y, Mu L P. Research on Garbage Image Classification Model Based on Transfer Learning[J]. Computer and Information Technology, 2021, 29(04):17-21
5	罗浩.深度视频物体检测中的时空上下文及时序调度算法研究[D].武汉:华中科技大学, 2019.
5	Luo H. Research on Spatiotemporal Context and Time Sequence Scheduling Algorithm in Deep Video Object Detection[D]. Wuhan: Huazhong University of Science and Technology, 2019.
6	Badrinarayanan V, Kendall A, Cipolla R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017:1-1.
7	Liu W, Rabinovich A, Berg A C. ParseNet: Looking Wider to See Better[J]. Computer Science, 2015.
8	Brostow G J, Fauqueur J, Cipolla R. Semantic Object Classes in Video: A High-definition Ground Truth Database[J]. Pattern Recognition Letters, 2008, 30(2) :88-97.
9	Geiger A, Lenz P, Stiller C, et al. Vision Meets Robotics: The KITTI dataset[J]. The International Journal of Robotics Research, 2013, 32(11).
10	Cordts M, Omran M, Ramos S, et al. The Cityscapes Dataset for Semantic Urban Scene Understanding[C] 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016.
11	Jia W, Jia Y, Luo R, et al. Detection and Segmentation of Overlapped Fruits Based on Optimized Mask R-CNN Application in Apple Harvesting Robot[J]. Computers and Electronics in Agriculture, 2020, 172.
12	Yu Y, Zhang K, Yang L, et al. Fruit Detection for Strawberry Harvesting Robot in Non-structural Environment Based on Mask-RCNN[J]. Computers and Electronics in Agriculture, 2019, 163.
13	Kang H, Chen C. Fruit Detection and Segmentation for Apple Harvesting Using Visual Sensor in Orchards[J]. Sensors (Basel, Switzerland), 2019, 19(20).
14	Zhang T, Huang Z, You W, et al. An Autonomous Fruit and Vegetable Harvester with a Low-Cost Gripper Using a 3D Sensor[J]. Sensors (Basel, Switzerland), 2020, 20(1).
15	高飞,王晓丽,刘婷婷,等.中国苹果叶片病虫害2015年光谱和图像数据集[J].农业大数据学报,2020,02(04):120-124.
15	Gao F, Wang X L, Liu T T, et al. Spectral and Imaging Datasets of Apple Leaf Disease and Insect Pests in China in 2015[J]. Journal of Agricultural Big Data, 2020, 02(04):120-124.
16	王晓丽,胡乾浩,樊景超,等.辽北苹果叶片氮含量、近红外光谱与图像数据集[J].农业大数据学报,2020,02(04):113-119.
16	Wang X L, Hu Q H, Fan J C, et al. Near-Infrared Spectral and Imaging Datasets of Fruit Tree Blooming in China in 2016[J]. Journal of Agricultural Big Data, 2020, 02(04):113-119.
17	Barth R, IJsselmuiden J, Hemming J, et al. Henten. Data Synthesis Methods for Semantic Segmentation in Agriculture: A Capsicum Annuum Dataset[J]. Computers and Electronics in Agriculture, 2018, 144:284-296.
18	Redmon J, Divvala S, Girshick R, et al. You Only Look Once: Unified, Real-Time Object Detection[C]. 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016.
19	He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[C]. 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016.
20	Chen L C, Papandreou G, Kokkinos I, et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献