“面向高质量共享的科学数据安全”专刊(上)

地学领域科学数据处理与分析软件自主性分析

展开
  • 1.中国科学院地理科学与资源研究所 资源与环境信息系统国家重点实验室,北京 100101
    2.中国科学院大学,北京 100049
    3.江苏省地理信息资源开发与利用协同创新中心,南京 210023
    4.中国科学院科技战略咨询研究院,北京 100190
王卷乐,E-mail:wangjl@igsnrr.ac.cn
*

收稿日期: 2024-04-08

  录用日期: 2024-05-07

  网络出版日期: 2024-07-03

基金资助

国家重点研发计划项目(2022YFF0711600);国家科技基础条件平台委托任务(2023WT14)

Analysis of Autonomy in Geosciences Data Processing and Analysis Software

Expand
  • 1. State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
    2. College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China
    3. Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China
    4. Institutes of Science and Development, Chinese Academy of Sciences, Beijing 100190, China

Received date: 2024-04-08

  Accepted date: 2024-05-07

  Online published: 2024-07-03

摘要

科学数据的重要性已得到广泛共识,随着科学数据的不断积累,其数据处理分析软件的能力水平将成为科学数据能否高效发挥作用的关键和瓶颈。地球科学领域研究涉及多尺度、多类型、多来源的数据,其数据处理与分析软件的需求极为强烈。本研究针对地球科学领域的特点,分析其主要数据处理和分析软件的现况,辨识我国的软件自主程度。调研分析涉及地理、海洋、地质、大气、生态、灾害、农业等16个专题,选取了177个主流软件/工具,重点获取其软件/工具名称、概况、主要功能、应用服务/典型案例、优缺点、对标产品等指标。分析发现地学科学数据处理与分析领域软件/工具完全开放(开源)的占三分之二,商业的、限制性的或开放程度不明确的占三分之一。主要软件/工具开发国家有美国、中国、加拿大、英国等以及一些国际组织。从专题分布看,主要体现在土地退化、人口社会经济、知识图谱、遥感处理等。自主保障风险较高的软件主要在空间化、大气、野火、冻土等专题。约三分之一的专业软件/工具可以直接应用到国家科学数据中心,且可与云平台结合。结合人工智能时代和“数据要素×”的发展,从五个方面提出加强我国自主科学数据处理软件/工具的开发应用建议。

本文引用格式

王卷乐, 李凯, 段博文, 苏娜 . 地学领域科学数据处理与分析软件自主性分析[J]. 农业大数据学报, 2024 , 6(2) : 230 -240 . DOI: 10.19788/j.issn.2096-6369.000046

Abstract

The importance of scientific data has been widely recognized, and as scientific data continues to accumulate, the capability of its data processing software will become a key bottleneck in determining whether scientific data can be effectively utilized. The field of Earth science involves multi-scale, multi-type, and multi-source data in research, leading to a strong demand for data processing and analysis software. This study, aimed at the characteristics of the earth science field, analyzes the current state of its main data processing and analysis software, identifies the degree of software autonomy in China, and expects to propose corresponding development suggestions. The survey covers 16 topics including geography, oceanography, geology, atmospheric sciences, ecology, disasters, agriculture, etc., and selects 177 mainstream software/tools, focusing on obtaining indicators such as software/tool names, summaries, main functions, application services/typical cases, advantages and disadvantages, and benchmarking software. The analysis found that these software/tools in the field of geoscience data processing and analysis are completely open (open source) accounting for two-thirds, the last one-thirds are commercial, restrictive, or unknown openness. The main software/tools are developed in countries such as the United States, China, Canada, the United Kingdom, and some international organizations. From the perspective of topic distribution, this is mainly reflected in the following areas: land degradation, socio-economic demographics, knowledge graphs, and remote sensing big data processing. From the perspective of autonomy, the main high-risk software packages are mainly distributed in fields such as spatialization, atmosphere, wildfires, and permafrost. Among the surveyed software/tools, about one-third of the professional software/tool can be applied to the National Science Data Center, and can be used in Cloud Platform. Combining the era of artificial intelligence and the development of "Data Element X", the future should strongly enhance the development and deployment application of China's autonomous scientific data processing software/tool from 5 perspectives.

参考文献

[1] 王卷乐, 王明明, 石蕾, 等. 科学数据管理态势及其对我国地球科学领域的启示.地球科学进展[J], 2019, 34(3):306-315.
[2] 王卷乐, 石蕾, 王淑强. 国际科学数据资源管理概述[M]. 北京: 科学技术文献出版社, 2021.
[3] 张玉娥, 王永珍. 欧盟科研数据管理与开放获取政策及其启示——以“欧盟地平线2020”计划为例[J]. 图书情报工作, 2017, 61(13): 70-76.
[4] 黄雨婷, 傅文奇. 日本政府数据开放的政策保障及其启示[J]. 数字图书馆论坛, 2020(9):9-17.
[5] 科技部财政部关于发布国家科技资源共享服务平台优化调整名单的通知(国科发基〔2019〕194 号)[EB/OL]. http://www.most.gov.cn/xxgk/xinxifenlei/fdzdgknr/qtwj/qtwj2019/201906/t20190610_147031.html.
[6] 石蕾, 高孟绪, 徐波, 等. 欧美建设发展科学数据中心的经验及对我国的启示[J]. 中国科技资源导刊, 2022, 54(3):31-36+110.
[7] 国家数据局等部门关于印发《“数据要素×”三年行动计划(2024—2026年)》的通知(国数政策〔2023〕11号). [EB/OL]. https://mp.weixin.qq.com/s/YyhLQo4lZIFNMiyupdvO1A.
[8] Oluwaseun O O, Uzochukwu B, Aderemi A. Data Analytics in Nigeria: Challenges, Opportunities and Future Directions[C]// Proceedings of the 2019 2nd International Conference on Computational Intelligence and Intelligent Systems (CIIS 2019) (pp. 110-114). Association for Computing Machinery.
[9] 廖方宇, 胡良霖, 王健, 等. 科学数据安全标准研究与工作建议[J]. 科学通报, 2024, 69(9):1142-1148.
[10] 中华人民共和国网络安全法[EB/OL]. https://www.cac.gov.cn/2016-11/07/c_1119867116.htm?eqid=ac15f20d0001418a00000006648a8b25.
[11] 中华人民共和国数据安全法[EB/OL]. https://www.gov.cn/xinwen/2021-06/11/content_5616919.htm.
[12] 中华人民共和国个人信息保护法[EB/OL]. https://www.gov.cn/xinwen/2021-08/20/content_5632486.htm?eqid=898a0e2f0002ace800000003648fb2c2.
[13] 王珺, 周小林, 沈云怡, 等. 科学数据跨境流动治理的现状、思考与展望[J/OL]. 科学通报, 2024:1-11.
[14] 国家科技基础条件平台中心. 中国科学数据资源发展研究报告[M]. 北京: 科学技术文献出版社, 2022.
文章导航

/