数据处理与分析

基于图数据库的农业多本体解析导入方法

  • 陈晓静 ,
  • 李威 ,
  • 樊景超 ,
  • 闫燊 ,
  • 张建华 ,
  • 周国民
展开
  • 1.中国农业科学院农业信息研究所/农业农村部农业大数据重点实验室/国家农业科学数据中心北京 100081
    2.中国农业科学院国家南繁研究院海南三亚 572024
    3.中国农业科学院作物科学研究所北京 100081
    4.农业农村部南京农业机械化研究所 南京 210014
    5.中国农业科学院西部农业研究中心新疆昌吉 831100
陈晓静,E-mail:82101225580@caas.cn
李威,E-mail:yjzltd@qq.com
周国民,E-mail:zhouguomin@caas.cn
张建华,E-mail:zhangjianhua@caas.cn

收稿日期: 2025-07-29

  修回日期: 2025-10-20

  网络出版日期: 2025-12-26

基金资助

国家重点研发计划(2022YFF0711800);中国农业科学院国家南繁研究院南繁专项(YBXM2448);中国农业科学院国家南繁研究院南繁专项(YBXM2340);中国农业科学院国家南繁研究院南繁专项(YBXM2409);中国农业科学院国家南繁研究院南繁专项(YBXM2410);中国农业科学院国家南繁研究院南繁专项(YBXM2430);中国农业科学院国家南繁研究院南繁专项(YBXM2508);中国农业科学院国家南繁研究院南繁专项(YBXM2509);中央级公益性科研院所基本科研业务费专项(JBYW-AII-2025-05);中央级公益性科研院所基本科研业务费专项(Y2025YC90);国家农业科学数据中心项目(NASDC2025XM11)

A Method for Parsing and Importing Agricultural Multi-Ontologies Based on Graph Databases

  • CHEN XiaoJing ,
  • LI Wei ,
  • FAN JingChao ,
  • YAN Shen ,
  • ZHANG JianHua ,
  • ZHOU GuoMin
Expand
  • 1. Agricultural Information Institute of CAAS / Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs / National Agriculture Science Data Center, Beijing 100081, China
    2. National Nanfan Research Institute of CAAS, Sanya 572024, Hainan, China
    3. Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    4. Nanjing Research Institute of Agricultural Mechanization, Ministry of Agriculture and Rural Affairs, Nanjing 210014, China
    5. Western Research Institute, CAAS, Changji 831100, Xinjiang, China

Received date: 2025-07-29

  Revised date: 2025-10-20

  Online published: 2025-12-26

摘要

统一组织结构复杂、规模庞大的农业本体,对于消除平台之间的数据孤岛、优化农业知识的标准化表达以及提高信息检索效率具有重要意义。本研究利用图数据库在存储本体方面的天然结构优势,创新性提出一种将面向OBO和OWL两种格式的大规模农业本体数据导入图数据库的方法。该方法首先按语义信息拆分解析OBO本体,同时通过消除冗余概念和前缀资源解析OWL本体,其次,在减少存储压力的需求下,进一步设计了编码方案和基于共现次数的属性关系筛选,最后智能化建模和映射,将本体存储进图数据库中,完成具备167 887个实体和249 603条关系的农业多本体数据库的构建。实体和关系对比分析结果表明,该方法在保留本体内部结构的同时,也保留了广泛的本体间知识链接,案例分析结果证明,多本体解析融合机制可以直观有效地构建跨本体知识交互。该方法有助于进一步推动农业本体的重用和共享,有效提升了农业信息资源标准化程度,所构建的农业多本体集成知识库为农业语义搜索、知识深度挖掘和智能化农业管理决策奠定了坚实的数据基础。

本文引用格式

陈晓静 , 李威 , 樊景超 , 闫燊 , 张建华 , 周国民 . 基于图数据库的农业多本体解析导入方法[J]. 农业大数据学报, 2025 , 7(4) : 431 -445 . DOI: 10.19788/j.issn.2096-6369.000125

Abstract

Integrating complex and large-scale agricultural ontologies into a unified framework is crucial for eliminating data silos across platforms, optimizing the standardization of agricultural knowledge representation, and enhancing information retrieval efficiency. Leveraging the inherent structural advantages of graph databases in ontology storage, this study proposes an innovative method for importing large-scale agricultural ontology data in both OBO and OWL formats into a graph database. The method first involves semantically parsing and splitting OBO ontologies, while simultaneously processing OWL ontologies through the elimination of redundant concepts and resolution of prefixed resources. To reduce storage overhead, an encoding scheme and a co-occurrence frequency-based attribute-relation filtering strategy are further designed. Finally, intelligent modeling and mapping are performed to store the ontologies within the graph database, resulting in the construction of an agricultural multi-ontology database comprising 167,887 entities and 249,603 relationships. Comparative analysis of entities and relationships demonstrates that the proposed method effectively preserves both internal ontological structures and extensive inter-ontological knowledge links. Case studies confirm that the multi-ontology parsing and integration mechanism enables intuitive and effective cross-ontology knowledge interaction. This approach facilitates the reuse and sharing of agricultural ontologies, significantly improving the standardization of agricultural information resources. The constructed integrated agricultural multi-ontology knowledge base provides a robust data foundation for semantic search, deep knowledge mining, and intelligent decision-making in agriculture.

参考文献

[1] 陈宝发, 任妮. 面向农业学者领域的本体构建及可视化研究. 江苏农业科学, 2023, 51(18): 191-200. DOI:10.15889/j.issn.1002-1302.2023.18.028.
  CHEN B F, REN N. Ontology construction and visualization in the field of agricultural scholars. Jiangsu Agricultural Sciences, 2023, 51(18): 191-200. DOI:10.15889/j.issn.1002-1302.2023.18.028.
[2] GOLDSTEIN A, FINK L, RAVID G. A framework for evaluating agricultural ontologies. Sustainability, 2021, 13(11): 6387.
[3] ZHENG Y L, HE Q Y, QIAN P, LI Z. Construction of the ontology-based agricultural knowledge management system. Journal of Integrative Agriculture, 2012, 11(5): 700-709. DOI:10.1016/S2095-3119(12)60059-8.
[4] FONOU-DOMBEU J V, NAIDOO N, RAMNANAN M, et al. OntoCSA: A climate-smart agriculture ontology. International Journal of Agricultural and Environmental Information Systems (IJAEIS), 2021, 12(4): 1-20.
[5] 徐勇, 安祥生, 王志强. 基于农业资源分类的农业资源本体架构设计. 农业网络信息, 2009 (10): 8-12+27.
  XU Y, AN X S, WANG Z Q. Design of agricultural resource ontology architecture based on agricultural resource classification. Agriculture Network Information, 2009(10): 8-12+27.
[6] 张善庄, 刘怀亮, 赵舰波, 等. 领域顶层本体研究:模型与构建方法. 情报杂志, 2024, 43(7): 112-121.
  ZHANG S Z, LIU H L, ZHAO J B, et al. Research on domain upper ontology: Model and construction method. Journal of Intelligence, 2024, 43(7): 112-121.
[7] 苏玉宁, 姜艺, 陈贺胜, 等. 基于Ontology的农业科学领域知识库构建. 江苏农业科学, 2018, 46(5): 194-198. DOI:10.15889/j.issn.1002-1302.2018.05.052.
  SU Y N, JIANG Y, CHEN H S, et al. Construction of knowledge base in agricultural science field based on ontology. Jiangsu Agricultural Sciences, 2018, 46(5): 194-198. DOI:10.15889/j.issn.1002-1302.2018.05.052.
[8] 黄奇, 钱韵洁, 袁勤俭, 等. 基于图形数据库的OWL本体存储模型研究. 情报学报, 2019, 38(3): 310-321.
  HUANG Q, QIAN Y J, YUAN Q J, et al. Research on OWL ontology storage model based on graph database. Journal of the China Society for Scientific and Technical Information, 2019, 38(3): 310-321.
[9] 侯琛, 牛培宇. 农业知识图谱技术研究现状与展望. 农业机械学报, 2024, 55(6): 1-17.
  HOU C, NIU P Y. Research status and prospect of agricultural knowledge graph technology. Transactions of the Chinese Society for Agricultural Machinery, 2024, 55(6): 1-17.
[10] 张慧, 侯霞, 李宁. 本体存储方法研究. 北京信息科技大学学报(自然科学版), 2016, 31(3): 59-63.
  ZHANG H, HOU X, LI N. A survey of research on ontology storage methods. Journal of Beijing Information Science & Technology University (Natural Science Edition), 2016, 31(3): 59-63.
[11] QI C L, SONG Q, ZHANG P Z, YUAN H. Cn-MAKG: China Meteorology and Agriculture Knowledge Graph Construction Based on Semi-Structured Data// In 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS). IEEE, 2018: 692-696. DOI: 10.1109/ICIS.2018.8466485.
[12] SHI Y X, ZHANG B K, WANG Y X, et al. Constructing crop portraits based on graph databases is essential to agricultural data mining. Information, 2021, 12(6): 227. DOI:10.3390/info12060227.
[13] AYDIN S, AYDIN M N. Ontology-based data acquisition model development for agricultural open data platforms and implementation of OWL2MVC tool. Computers and Electronics in Agriculture, 2020, 175: 105589.
[14] POKORNY J. Graph Databases: Their Power and Limitations// IFIP International Conference on Computer Information Systems and Industrial Management. Springer International Publishing, Cham, 2015: 58-69.
[15] LOPEZ-VEYNA J I, CASTILLO-ZU?IGA I, ORTIZ-GARCIA M. A Review of Graph Databases // International Conference on Software Process Improvement. Springer International Publishing, Cham, 2022: 180-195.
[16] BHATTACHARYYA A, CHAKRAVARTY D. Graph Database: A Survey // 2020 International Conference on Computer, Electrical & Communication Engineering (ICCECE). IEEE, 2020: 1-8.
[17] ANGLES R. A Comparison of Current Graph Database Models // 2012 IEEE 28th International Conference on Data Engineering Workshops. IEEE, 2012: 171-177.
[18] RUBIN D L, NOY N F, MUSEN M A. Protégé: A tool for managing and using terminology in radiology applications. Journal of Digital Imaging, 2007, 20(Suppl 1): 34-46.
文章导航

/