小麦纹枯病防治领域本体构建
收稿日期: 2023-12-28
录用日期: 2024-03-03
网络出版日期: 2024-12-02
基金资助
国家重点研发计划项目子课题“农业及资源环境领域知识对象深度挖掘技术研究”(2022YFF0711902-01);现代农业产业技术体系北京市创新团队建设项目(BAIC10-2023-E10);NSTL下一代开放知识服务平台关键技术优化集成与系统研发“词表工具优化及智能参考咨询优化对接”(2023XM42-04)
Ontology Construction in the Field of Wheat Sharp Eyespot Control
Received date: 2023-12-28
Accepted date: 2024-03-03
Online published: 2024-12-02
小麦纹枯病是我国麦区常发的土传真菌病害,在小麦整个生育期均可发生,对我国小麦产量和品质影响极大。通过构建小麦纹枯病防治领域本体,对领域知识进行建模,以期整合和共享小麦纹枯病防治领域的知识,为农业决策、病害防治等方面提供重要的支持和指导。本研究以小麦纹枯病防治领域文献为数据源,采用KeyBERT关键词提取算法挖掘本体核心概念,通过层次聚类提取本体概念间层次关系。最后利用Protégé对本体概念、概念间关系进行可视化表达。本研究提出了小麦纹枯病防治领域本体构建方法,阐述了通过构建小麦纹枯病语料库来构建本体的基本方法,给出了领域本体构建的流程框架,详细阐述了构建中所使用的算法和构建工具。本研究数据源主要为科技文献,未来可进一步扩充数据源,对本体进行扩展。本体评估部分目前主要依靠领域专家评估,未来可增加量化评估。本研究所构建的小麦纹枯病防治领域本体包含了较完整的小麦纹枯病概念体系,符合本体评价标准和本体构建需求,可为领域本体的构建提供借鉴与参考,为小麦纹枯病防治领域的知识发现与智能问答、智能推荐等下游应用提供有力支持。
刘珂艺, 崔运鹏, 谷钢, 王末 . 小麦纹枯病防治领域本体构建[J]. 农业大数据学报, 2024 , 6(4) : 485 -496 . DOI: 10.19788/j.issn.2096-6369.000011
Wheat Sharp Eyespot is a soil-borne fungal disease commonly found in China's wheat areas, which can occur throughout the entire reproductive period of wheat and has a great impact on the yield and quality of wheat in China. By constructing a Wheat Sharp Eyespot control domain ontology and modeling domain knowledge, we aim to integrate and share the knowledge in the field of Wheat Sharp Eyespot control to provide important support and guidance for agricultural decision-making and disease control. The ontology construction process for Wheat Sharp Eyespot control is proposed to meet the actual needs of Wheat Sharp Eyespot control. For the problems of low efficiency and limited expert knowledge in constructing ontologies by manual methods, this study will explore new methods for ontology construction. Special attention will be paid to the methodology of mining core concepts of the ontology to reduce the subjectivity and limitations in the construction process, so that the ontology will have a wider application potential.In this study, used the literature in the field of Wheat Sharp Eyespot control as a data source, KeyBERT keyword extraction algorithm was used to mine the core concepts of ontology, and BERT embedding and cosine similarity were used to find out the subphrases in the document that were most similar to the document itself. Hierarchical relationships between ontology concepts were extracted by hierarchical clustering, topic modeling was performed using BERTopic, Transformer and c-TF-IDF were used to create dense clusters.Finally, Protégé was used to visualize and express the ontology concepts and inter-concept relationships.In this study, the results of thematic and hierarchical clustering were analyzed and condensed to classify the ontology of Wheat Sharp Eyespot control into eight parent concepts, which were pathogenicity pattern, wheat growth period, etiology of the disease, disease area, disease extent, symptoms and control measures. According to the characteristics of the Wheat Sharp Eyespot control domain, 11 object attributes, 16 first-level data attributes, and 8 second-level data attributes were defined for the Wheat Sharp Eyespot control ontology by organizing and analyzing the associations among the parent concepts. Finally, Protégé was used to visualize and express the ontology concepts and inter-concept relationships. This study proposed a method for constructing a domain ontology for Wheat Sharp Eyespot control, described the basic method for constructing an ontology by building a corpus of Wheat Sharp Eyespot, gived a process framework for constructing a domain ontology, and described in detail the algorithms and construction tools used in the construction. The data source of this study was mainly scientific and technical literature, and the ontology can be extended in the future by further expanding the data source. The assessment part of the ontology mainly relied on the assessment of domain experts at present, and quantitative assessment can be added in the future.The Wheat Sharp Eyespot control domain ontology constructed in this study contained a more complete conceptual system of Wheat Sharp Eyespot, meeting the ontology evaluation criteria and ontology construction requirements, and can provide reference for the construction of domain ontology, and provide powerful support for knowledge discovery and downstream applications in the field of Wheat Sharp Eyespot prevention and control, such as intelligent Q&A, intelligent recommendation, and so on.
| [1] | BOEREMA G H, VERHOEVEN A A. Check-list for scientific names of common parasitic fungi. Series 2b: Fungi on field crops: Cereals and grasses[J]. Netherlands Journal of Plant Pathology, 1977, 83(5):165-204. DOI:10.1007/BF01976647. |
| [2] | 贾廷祥, 吴桂本, 刘传德. 我国小麦根腐性病害研究现状及防治对策[J]. 中国农业科学, 1995, 28(3):41-48. |
| [3] | 潘灏. 小麦纹枯病的发生及其防治措施[J]. 河南农业, 2016(7):39. |
| [4] | 姚丽瑾, 王琦, 付学池, 等. 小麦纹枯病生防芽孢杆菌的筛选及鉴定[J]. 中国生物防治, 2008(1):53-57. |
| [5] | 郑泳智, 朱定局, 吴惠粦, 等. 知识图谱问答领域综述[J]. 计算机系统应用, 2022, 31(4):1-13. |
| [6] | 李景. 领域本体的构建方法与应用研究[D]. 北京: 中国农业科学院, 2009. |
| [7] | CONSORTIUM T P O. The plant ontology consortium and plant ontologies[J]. Comp Funct Genomics, 2002, 3(2):137-142. DOI:10.1002/cfg.154. |
| [8] | 常春. 联合国粮食与农业组织AOS项目[J]. 农业图书情报学刊, 2003(2):14-15+24. |
| [9] | 戴才萍, 黄义德, 钱平, 等. 水稻病虫草害本体的构建研究[J]. 广东农业科学, 2011, 38(1):191-194. |
| [10] | 于合龙, 沈金梦, 毕春光, 等. 基于知识图谱的水稻病虫害智能诊断系统[J]. 华南农业大学学报, 2021, 42(5):105-116. |
| [11] | 齐红, 官莹莹, 刘亚波. 面向中文文本的玉米病虫害本体学习研究[J]. 计算机工程与应用, 2011, 47(20):206-209. |
| [12] | 张伶子, 段青玲, 李道亮. 玉米病虫害诊治本体构建技术研究[J]. 农机化研究, 2012, 34(1):41-45. |
| [13] | 刘桂锋, 杨倩, 刘琼. 农业科学数据集的本体构建与可视化研究——以“棉花病害防治”领域为例[J]. 情报杂志, 2022, 41(9):143-149+175. |
| [14] | 任妮, 孙艺伟, 鲍彤, 等. 农业领域本体构建方法研究——以番茄病虫害为例[J]. 情报探索, 2021(4):51-57. |
| [15] | DEEPA R, VIGNESHWARI S. An effective automated ontology construction based on the agriculture domain[J]. ETRI Journal, 2022, 44(4): 573-587. DOI:10.4218/etrij.2020-0439. |
| [16] | 许多, 鲁旺平, 许瑞清, 等. 基于农业时空多模态知识图谱的水稻精准施肥决策方法[J]. 华中农业大学学报, 2023, 42(3):281-292. |
| [17] | 赵应秋. 基于本体的语义检索模型的研究[D]. 重庆: 重庆大学, 2010. |
| [18] | 李恒杰, 李明. 基于本体的Web分类技术研究[J]. 微计算机信息, 2006(21):215-217. |
| [19] | 徐亚军. 基于本体的软件构件描述及其检索研究[D]. 上海: 东华大学, 2010. |
| [20] | 索俊锋, 刘勇, 邹松兵. 基于地理本体的综合语义相似度算法[J]. 兰州大学学报(自然科学版), 2017, 53(1):19-27. |
| [21] | USCHOLD M, GRUNINGER M. Ontologies: Principles, methods and applications[J]. Knowledge Engineering Review, 1996, 11 (2): 14-17. |
| [22] | GRüNINGER M, FOX M S. Methodology for the design and evaluation of ontologies[EB/OL]. [2011-08-09]. http://stl.mie.utoron-to.ca/publications/gruniger-ijcai95.pdf. |
| [23] | FERNDANDEZ M, GORMEZ-PEREZ A, JURISTO N. Methontology: From ontological art towards ontological engineering[C]// Proceedings of the AAA197 Spring Symposium on Ontological Engineering. Stanford University, March 24 - 26th, 1997:33 - 40. |
| [24] | MAEDCHE A, STAAB S. Ontology learning for the semantic Web LM. Boston: Kluwer Academic Publishers, 2001:8 - 13. |
| [25] | NOY N F, McGUINNESS D L. Ontology development 101 : A guide to creating your first ontology[EB/OL].[2010-12-20]. http://www.sfu.ca/marques/federation/pdf/Noy-Ontology101.pdf.2001. |
| [26] | 张磊. 基于叙词表和文献数据库的农业领域本体构建方法研究[D]. 北京: 中国农业科学院, 2011. |
| [27] | 李景. 本体理论及在农业文献检索系统中的应用研究——以花卉学本体建模为例[D]. 北京: 中国农业科学院, 2004. |
| [28] | CARLOS GUZMáN. Wheat ontology[EB/OL]. (2019)[2023]. https://cropontology.org/term/CO_321:ROOT. |
| [29] | Planteome. org. Repository for the plant disease ontology. https://github.com/Planteome/plant-disease-ont-ology,2016. |
| [30] | Ontology-based Research Group@IITM. https://sites.google.com/site/ontoworks/ontologies. [Online; accessed 8-2023]. |
| [31] | Ontology for Plant Protection. https://sites.google.com/site/ppontology/home. [Online; accessed 8-2023]. |
| [32] | 杜军平, 王岳, 薛哲, 等. 中文学术关键词抽取方法, 装置和存储介质:CN202110814460.8[P].CN113268995A[2023-10-06]. |
| [33] | 杨洁. 基于本体的柑橘病虫害知识建模及推理研究[D]. 武汉: 华中师范大学, 2014. |
| [34] | 李景, 孟连生. 构建知识本体方法体系的比较研究[J]. 现代图书情报技术, 2004(7):17-22. |
| [35] | 宋锦文. 基于知识蒸馏的农业知识图谱构建研究[D]. 哈尔滨: 哈尔滨工业大学, 2022. |
| [36] | 贾君枝, 刘艳玲. 本体评估指标体系的构建研究[J]. 图书情报工作, 2009, 53(12):100-103. |
| [37] | CAUFIELD J H, HEGDE H, EMONET V, et al. Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning[OL]. 2023. ArXiv:2304.02711. |
| [38] | JOACHIMIAK M P, CAUFIELD J H, HARRIS N L, et al. Gene set summarization using large language models[OL]. 2023. arXiv.2305. 13338. |
| [39] | CORCHO ó, GóMEZ-PéREZ A, GONZáLEZ-CABERO R, et al. ODEval: A Tool for Evaluating RDF(S), DAML+OIL, and OWL Concept Taxonomies[C]// IFIP International Federation for Information Processing, vol154. Springer, Boston,MA. https://doi.org/10.1007/1-4020-8151-0_32. |
| [40] | TARTIR S, ARPINAR I B, MOORE M, et al. OntoQA: Metric-based ontology quality analysis[C]// IEEE ICDM 2005 Workshop on Knowledge Acquisition from Distributed, Autonomous, Semantically Heterogeneous Data and Knowledge Sources At: Houston, Texas. https://api.semanticscholar.org/CorpusID:10753368. |
| [41] | FERNáNDEZ M, CANTADOR I, CASTELLS P. CORE: A tool for Collaborative Ontology Reuse and Evaluation[OL]. 2006. |
| [42] | LJILJANA STOJANOVIC, NENAD STOJANOVIC, JORGE GONZALEZ, et al. OntoManager-A System for the Usage-Based Ontology Management[C]// Springer, Berlin, Heidelberg. Lecture Notes in Computer Science (LNCS, volume 2888), 2003:858-875. |
| [43] | 贾君枝, 牛雅楠. 本体评估工具的比较分析[J]. 图书情报工作, 2010, 54(6):87-90. |
/
| 〈 |
|
〉 |