农业大数据学报 ›› 2024, Vol. 6 ›› Issue (3): 325-332.doi: 10.19788/j.issn.2096-6369.000053

• “面向高质量共享的科学数据安全”专刊(下) • 上一篇    下一篇

生命组学大数据安全管理实践

王彦青1,2(), 陈婷婷1,2(), 张思思1,2(), 朱军伟1,2, 陈焕新1,2, 肖景发1,2,3, 宋述慧1,2,3, 章张1,2,3, 赵文明1,2,3,*(), 鲍一明1,2,3,*()   

  1. 1.国家生物信息中心,国家基因组科学数据中心, 北京 100101
    2.中国科学院北京基因组研究所, 北京 100101
    3.中国科学院大学, 北京 100049
  • 收稿日期:2024-06-13 接受日期:2024-09-13 出版日期:2024-09-26 发布日期:2024-10-01
  • 通讯作者: 赵文明,E-mail:zhaowm@big.ac.cn
    鲍一明,E-mail:baoym@big.ac.cn
  • 作者简介:王彦青,E-mail:wangyanqing@big.ac.cn
    陈婷婷,E-mail:chentt@big.ac.cn
    张思思,E-mail:zhangss@big.ac.cn
    第一联系人:王彦青、陈婷婷、张思思为同等贡献作者。
  • 基金资助:
    国家重点研发计划(2023YFC2605700);国家重点研发计划(2023YFC2604400);国家重点研发计划(2021YFF0703704);中国科学院基因组科学数据中心运行维护(CAS-WX2022SDC-XK05)

Practice of Security Management of Omics Big Data in Life Sciences

WANG YanQing1,2(), CHEN TingTing1,2(), ZHANG SiSi1,2(), ZHU JunWei1,2, CHEN HuanXin1,2, XIAO JingFa1,2,3, SONG ShuHui1,2,3, ZHANG Zhang1,2,3, ZHAO WenMing1,2,3,*(), BAO YiMing1,2,3,*()   

  1. 1. National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China
    2. Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
    3. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2024-06-13 Accepted:2024-09-13 Published:2024-09-26 Online:2024-10-01

摘要:

生命组学大数据是国家重要基础性、战略性资源,对支撑生命科学基础研究和应用创新、推动生物经济创新发展、维护国家安全具有重要意义。随着数据规模的不断增长,生命组学大数据的安全管理问题逐渐凸显。国家基因组科学数据中心(National Genomics Data Center, NGDC)面向我国人口健康和社会可持续发展的重大战略需求,建立了生命与健康大数据汇交存储、安全管理、开放共享与整合挖掘研究体系,形成了一系列数据安全管理的制度和措施。本文聚焦于生命组学大数据全生命周期的安全管理问题,探讨生命组学大数据安全管理框架,全面分析在数据汇交、存储、管理、共享全生命周期中涉及的安全管理内容,并总结了NGDC在生命组学大数据安全管理方面的成效。最后,本文展望了生命组学大数据安全管理的发展方向,包括完善数据分级分类制度、提升数据分级安全管理技术和加强数据异地灾备建设,以期实现生命组学大数据的安全管理与可持续发展。

关键词: 生命组学大数据, 数据汇交, 数据共享, 安全管理

Abstract:

Omics big data is a significant foundational and strategic resource for the country, which plays an important role in supporting the basic research and application innovation of life sciences, promoting the innovative development of bioeconomy, and maintaining national security. With the rapid accumulation of omics data, the security of data management has become increasingly prominent. Facing the major strategic needs of China's population health and sustainable social development, the National Genomics Data Center (NGDC) has established a comprehensive research architecture for collecting, storing, managing, sharing, and mining of big data in omics, forming a series of practices and measures for the security management of the data. This paper delves into the issues of security management of omics big data throughout its lifecycle, elaborating on NGDC's security management measures implemented in the collecting, storing, managing and sharing of the data. Furthermore, it summarizes NGDC’s achievements in the security management of omics big data. Finally, this paper envisions the future directions for the security management of omics big data, including enhancing the data classification and categorization system, enhancing data hierarchical security management technologies and strengthening the construction of off-site disaster recovery, in order to achieve the security management and sustainable development of omics big data in life sciences.

Key words: omics big data, data archive, data sharing, security management