Journal of Agricultural Big Data >
Practice of Security Management of Omics Big Data in Life Sciences
Received date: 2024-06-13
Accepted date: 2024-09-13
Online published: 2024-10-01
Omics big data is a significant foundational and strategic resource for the country, which plays an important role in supporting the basic research and application innovation of life sciences, promoting the innovative development of bioeconomy, and maintaining national security. With the rapid accumulation of omics data, the security of data management has become increasingly prominent. Facing the major strategic needs of China's population health and sustainable social development, the National Genomics Data Center (NGDC) has established a comprehensive research architecture for collecting, storing, managing, sharing, and mining of big data in omics, forming a series of practices and measures for the security management of the data. This paper delves into the issues of security management of omics big data throughout its lifecycle, elaborating on NGDC's security management measures implemented in the collecting, storing, managing and sharing of the data. Furthermore, it summarizes NGDC’s achievements in the security management of omics big data. Finally, this paper envisions the future directions for the security management of omics big data, including enhancing the data classification and categorization system, enhancing data hierarchical security management technologies and strengthening the construction of off-site disaster recovery, in order to achieve the security management and sustainable development of omics big data in life sciences.
Key words: omics big data; data archive; data sharing; security management
WANG YanQing, CHEN TingTing, ZHANG SiSi, ZHU JunWei, CHEN HuanXin, XIAO JingFa, SONG ShuHui, ZHANG Zhang, ZHAO WenMing, BAO YiMing . Practice of Security Management of Omics Big Data in Life Sciences[J]. Journal of Agricultural Big Data, 2024 , 6(3) : 325 -332 . DOI: 10.19788/j.issn.2096-6369.000053
| [1] | Genomics Expenditures and National Security Enhancement Act[EB/OL]. https://www.congress.gov/bill/117th-congress/senate-bill/1745/text. |
| [2] | Genomics Data Security Act[EB/OL]. https://www.congress.gov/bill/117th-congress/senate-bill/1744/text. |
| [3] | BIOSECURE Act[EB/OL]. https://www.congress.gov/bill/118th-congress/house-bill/7085/text |
| [4] | Executive Order on Preventing Access to Americans’ Bulk Sensitive Personal Data and United States Government-Related Data by Countries of Concern[EB/OL]. https://www.federalregister.gov/documents/2024/03/01/2024-04573/preventing-access-to-americans-bulk-sensitive-personal-data-and-united-states-government-related. |
| [5] | General Data Protection Regulation[EB/OL]. https://gdpr-info.eu/. |
| [6] | 中华人民共和国生物安全法[EB/OL]. https://www.gov.cn/xinwen/2020-10/18/content_5552108.htm?eqid=ee76ba160000091a000000036465eef7. |
| [7] | 中华人民共和国数据安全法[EB/OL]. https://www.gov.cn/xinwen/2021-06/11/content_5616919.htm. |
| [8] | 中华人民共和国个人信息保护法[EB/OL]. https://www.gov.cn/xinwen/2021-08/20/content_5632486.htm. |
| [9] | 中华人民共和国人类遗传资源管理条例[EB/OL]. https://www.safea.gov.cn/xxgk/xinxifenlei/fdzdgknr/fgzc/flfg/201906/t20190612_147044.html. |
| [10] | 人类遗传资源管理条例实施细则[EB/OL]. https://www.gov.cn/zhengce/202306/content_6887562.htm. |
| [11] | CNCB-NGDC Members and Partners. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024[J]. Nucleic Acids Research, 2024, 52(D1): D18-D32. |
| [12] | BIG Data Center Members. The BIG Data Center: from deposition to integration to translation[J]. Nucleic Acids Research, 2017, 45(D1): D18-D24. |
| [13] | WANG Y, SONG F, ZHU J, et al. GSA: Genome Sequence Archive[J]. Genomics Proteomics Bioinformatics, 2017, 15(1):14-18. |
| [14] | CHEN T, CHEN X, ZHANG S, et al. The Genome Sequence Archive Family: Toward explosive data growth and diverse data types[J]. Genomics Proteomics Bioinformatics, 2021, 19(4):578-583. |
| [15] | 张思思, 陈旭, 陈婷婷, 等. GSA-Human: 人类遗传资源数据管理的公共系统[J]. 遗传, 2021, 43(10):988-993. |
| [16] | CHEN M, MA Y, WU S, et al. Genome Warehouse: A public repository housing genome-scale data[J]. Genomics Proteomics Bioinformatics, 2021, 19(4):584-589. |
| [17] | BU C, ZHENG X, ZHAO X, et al. GenBase: A nucleotide sequence database[J]. Genomics Proteomics Bioinformatics, 2024, qzae047. |
| [18] | LI C, TIAN D, TANG B, et al. Genome Variation Map: A worldwide collection of genome variations across multiple species[J]. Nucleic Acids Research, 2021, 49(D1):D1186-D1191. |
| [19] | Single Sign-On[EB/OL]. https://www.apereo.org/projects/cas. |
| [20] | Apache MINA FtpServer[EB/OL]. https://cwiki.apache.org/confluence/display/FTPSERVER/Index. |
| [21] | 国家基因组科学数据中心人类遗传资源数据共享政策[EB/OL]. https://ngdc.cncb.ac.cn/gsa-human/document/Principle_of_Accessing_Human_Genetic_Resource_Data_in_NGDC_V1.pdf. |
| [22] | Mark Phillips. International data-sharing norms: from the OECD to the General Data Protection Regulation (GDPR)[J]. Human genetics, 2018. |
| [23] | 网络数据安全管理条例(征求意见稿)[EB/OL]. https://www.cac.gov.cn/2021-11/14/c_1638501991577898.htm. |
| [24] | 袁康, 鄢浩宇. 数据分类分级保护的逻辑厘定与制度构建——以重要数据识别和管控为中心[J]. 中国科技论坛, 2022(7):167-177. |
| [25] | 王秉, 朱媛媛. 大数据环境下国家生物安全情报工作体系构建[J/OL]. 情报杂志, 2021, 40(6):82-88. https://kns.cnki.net/kcms/detail/61.1167.G3.20210511.1427.028.html. |
| [26] | WAN Z, HAZEL J W, CLAYTON E W, et al. Sociotechnical safeguards for genomic data privacy[J]. Nature Reviews Genetics, 2022, 23:429-445. |
| [27] | Genomic Data Science Community Network. Diversifying the genomic data science research community[J]. Genome Research 2022, 32: 1231-1241. doi:10.1101/gr.276496.121. |
| [28] | LANGMEAD B, NELLORE A. Cloud computing for genomic data analysis and collaboration[J]. Nature Reviews Genetics, 2018, 19(4): 208-219. DOI: 10.1038/nrg.2017.113. |
/
| 〈 |
|
〉 |