Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

Social Science Data Repositories in Data Deluge: A Case Study at ICPSR Workflow and Practices

Jeng, Wei and He, Daqing and Chi, Yu (2017) Social Science Data Repositories in Data Deluge: A Case Study at ICPSR Workflow and Practices. The Electronic Library. (In Press)

Available under License : See the attached licence file.

Download (2MB) | Preview
[img] Plain Text (licence)
Available under License : See the attached licence file.

Download (1kB)


Design/methodology/approach: We conducted two focus group sessions and one individual interview with eight employees at the world’s largest social science data repository, the Interuniversity Consortium for Political and Social Research (ICPSR). By examining their current actions (activities regarding their work responsibilities) and IT practices, we studied the barriers and challenges of archiving and curating qualitative data at ICPSR. Purpose: Due to the recent surge of interest in the age of the data deluge, the importance of researching data infrastructures is increasing. The Open Archival Information System (OAIS) model has been widely adopted as a framework for creating and maintaining digital repositories. Considering that OAIS is a reference model that requires customization for actual practice, this study examines how the current practices in a data repository map to the OAIS environment and functional components. Findings: We observed that the OAIS model is robust and reliable in actual service processes for data curation and data archives. In addition, a data repository’s workflow resembles digital archives or even digital libraries. On the other hand, we find that: 1) the cost of preventing disclosure risk and 2) a lack of agreement on the standards of text data files are the most apparent obstacles for data curation professionals to handle qualitative data; 3) the maturation of data metrics seems to be a promising solution to several challenges in social science data sharing. Original value: We evaluated the gap between a research data repository’s current practices and the adoption of the OAIS model. We also identified answers to questions such as how current technological infrastructure in a leading data repository such as ICPSR supports their daily operations, what the ideal technologies in those data repositories would be, and the associated challenges that accompany these ideal technologies. Most importantly, we helped to prioritize challenges and barriers from the data curator’s perspective, and contribute implications of data sharing and reuse in social sciences.


Social Networking:
Share |


Item Type: Article
Status: In Press
CreatorsEmailPitt UsernameORCID
Jeng, Weiwej9@pitt.eduWEJ9
He, Daqingdah44@pitt.eduDAH44
Chi, Yuyuc73@pitt.eduYUC73
Date: 2017
Date Type: Publication
Journal or Publication Title: The Electronic Library
Publisher: Emerald
Schools and Programs: School of Information Sciences > Library and Information Science
Refereed: Yes
Article Type: Research Article
Date Deposited: 12 May 2017 17:42
Last Modified: 25 Aug 2017 04:55


Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item