农业大数据学报 ›› 2026, Vol. 8 ›› Issue (1): 98-112.doi: 10.19788/j.issn.2096-6369.000151

• 数据资源 • 上一篇    下一篇

大豆多组学数据资源全景导航

曹永荣1,2,3,5,#(), 任思伟1,2,3,5,#, 谢海霞1,2,3,5,#, 邵洲秦1,2,3,4,5,#, 田东梅1,2,3,*(), 宋述慧1,2,3,4,5,*()   

  1. 1 国家生物信息中心国家基因组科学数据中心北京 100101
    2 国家生物信息中心生物大数据智能治理与应用北京市重点实验室北京 100101
    3 中国科学院北京基因组研究所北京 100101
    4 中国科学院大学中丹学院北京 100049
    5 中国科学院大学北京 100049
  • 收稿日期:2026-01-07 接受日期:2026-03-18 出版日期:2026-03-26 发布日期:2026-04-01
  • 通讯作者: *宋述慧,E-mail: songshh@big.ac.cn
    田东梅,E-mail: tiandm@big.ac.cn
  • 作者简介:曹永荣,E-mail: caoyongrong@big.ac.cn

    曹永荣、任思伟、谢海霞、邵洲秦对该文有同等贡献。

  • 基金资助:
    国家重点研发计划[2025YFF1207901];中国科学院战略部署先导A类专项[XDA0460405];中国科学院信息化专项(CAS-WX2024GC-0602)

A Panoramic Guide to Multi-Omics Data Resources for Soybean

CAO YongRong1,2,3,5,#(), REN SiWei1,2,3,5,#, XIE HaiXia1,2,3,5,#, SHAO ZhouQin1,2,3,4,5,#, TIAN DongMei1,2,3,*(), SONG ShuHui1,2,3,4,5,*()   

  1. 1 National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
    2 Beijing Key Laboratory of Intelligent Governance and Application of Biological Big Data, China National Center for Bioinformation, Beijing 100101, China
    3 Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
    4 Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, China
    5 University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2026-01-07 Accepted:2026-03-18 Published:2026-03-26 Online:2026-04-01
  • Contact: *Corresponding authors.
  • About author:

    #These authors contributed equally to this work

摘要:

随着测序技术和高通量表型采集手段的迅速发展,大豆研究进入多组学数据快速积累的阶段,涵盖基因组、转录组、表观遗传及表型组等多维度信息,并催生了SoyBase、SoyOD、SoyMD、SoyOmics等一系列专业数据库,为功能基因解析与分子育种应用提供了丰富的数据资源。本综述系统梳理现有大豆多组学数据资源与数据库平台,归纳其数据类型、组织逻辑与功能定位,分析各平台间的互补关系,并总结多组学整合应用的最新进展。本综述旨在为研究人员提供系统、清晰且具有可操作性的大豆数据资源指引,促进各类组学数据的高效整合与利用,助力精准育种及性状遗传机制的深入研究。

关键词: 大豆, 数据库, 多组学, 基因组, 变异组, 转录组, 蛋白组, 代谢组, 表观组, 表型组

Abstract:

With the rapid development of sequencing technologies and high-throughput phenotyping approaches, soybean research has entered an era of rapid accumulation of multi-omics data. These data encompass multiple dimensions, including genomics, transcriptomics, epigenomics, and phenomics, and have driven the establishment of a series of specialized databases such as SoyBase, SoyOD, SoyMD, and SoyOmics. Together, these resources provide a solid data foundation for functional gene discovery and molecular breeding applications. In this review, we systematically summarize currently available soybean multi-omics data resources and database platforms, highlighting their data types, organizational frameworks, and functional characteristics. We further analyze the complementarity among these platforms and review recent advances in the integrative application of multi-omics data. This review aims to provide researchers with a systematic, clear, and practical guide to soybean data resources, facilitating the efficient integration and utilization of diverse omics datasets and supporting precision breeding and in-depth studies of the genetic mechanisms underlying complex traits.

Key words: soybean, database, multi-omics, genome, variome, transcriptome, proteome, metabolome, epigenome, phenome