您当前的位置: > 详细浏览

数据多样性的理论研究

Theoretical Research on Data Diversity

摘要:数据多样性是数据的本质属性。在信息技术突飞猛进式发展和开放科学数据的时代背景下,数据多样性特征愈发明显。本文首先详细阐述数据多样性的内外表现,其中内部表现包括:科学数据生产过程的不同对象、数据出版的三位一体、不同学科采集暂存数据时不同的数据格式;外部表现包括数据生命周期加速了数据多样性、科研生命周期增加了数据多样性、数据在具体应用时被型塑而生发的多样性。随后,文章简要介绍了数据多样性的共同特征和影响因素,并从三个方面介绍了数据多样性的应用表征。对图书馆与馆员来说,认识数据多样性可以在一定程度上帮助科研人员解决数据汇交任务和数据披露压力,让数据重用变得简单并符合理想的数据生态体系。因此,作为一名数据馆员,需要有数据管理的能力并了解数据伦理的相关法律法规、政策与协议,努力为科研人员提供数据增值的业务。

英文摘要:Diversity is the essential attribute of data, especially scientific data. In the context of rapid development of information technologies (ITs) and the era of open research data, the characteristics of data diversity have become more obvious. Firstly, the paper elaborates on the internal and external manifestations of data diversity. The internal manifestations are different objects in the scientific data production process, the trinity of data publishing, and different data formats when collecting and depositing data in different disciplines. The external manifestations include data curation lifecycle accelerates the diversity of data, the research lifecycle increases the diversity of data, and diversity increased because of being sharpening when in practical use. Then, the paper describes the common features and impact factors of data diversity, and introduces the application representation of data diversity from three aspects. For libraries and data librarians, recognizing the diversity of data may probably help researchers solve the required task of data deposit and data release in open research data era, and making data reuse simple and creating an ideal data ecosystem. Therefore, as a data librarian, the data management capacity and the knowledge of relevant laws, regulations, policies, and agreements of data ethics are needed, in order to provide data value-added services.

版本历史

[V1] 2021-11-24 13:50:08 chinaXiv:202111.00029V1 下载全文
点击下载全文
同行评议状态
待评议
许可声明
metrics指标
  • 点击量22049
  • 下载量741
评论
分享
邀请专家评阅