Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form


Amirpour Amraii, Saman (2018) HUMAN-DATA INTERACTION IN LARGE AND HIGH-DIMENSIONAL DATA. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

Download (19MB) | Preview


Human-Data Interaction (HDI) is an emerging field which studies how humans make sense of large and complex data. Visual analytics tools are a central component of this sensemaking process. However, the growth of big data has affected their performance, resulting in latency in interactivity or long query-response times, both of which degrade one's ability to do knowledge discovery. To address these challenges, a new paradigm of data exploration has appeared in which a rapid but inaccurate result is followed by a succession of gradually more accurate answers. As the primary objective of this thesis, we investigated how this incremental latency affects the quantity and quality of knowledge discovery in an HDI system. We have developed a big data visualization tool and studied 40 participants in a think-aloud experiment, using this tool to explore a large and high-dimensional data. Our findings indicate that although incremental latency reduces the rate of discovery generation, it does not affect one's chance of making a discovery per each generated visualization, and it does not affect the correctness of those discoveries. However, in the presence of latency, utilizing contextual layers such as a map result in fewer mistakes while exploring higher-dimensional visualizations lead to more incorrect discoveries. As the secondary objective, we investigated what strategies improved a subject's performance. Our observations suggest that successful participants explore the data methodically, by first examining simple and familiar concepts and then gradually adding complexity to the visualizations, until they build a correct mental model of the inner workings of the tool. With this model, they generate several discovery patterns, each acting as a blueprint for forming new insights. Ultimately, some participants combined their discovery patterns to create multifaceted data-driven stories. Based on these observations, we propose design guidelines for developing HDI platforms for large and high-dimensional data.


Social Networking:
Share |


Item Type: University of Pittsburgh ETD
Status: Unpublished
CreatorsEmailPitt UsernameORCID
Amirpour Amraii, Samansaa78@pitt.edusaa78
ETD Committee:
TitleMemberEmail AddressPitt UsernameORCID
Committee ChairLewis, Michaelcmlewis@pitt.educmlewis
Committee CoChairNourbakhsh,
Committee MemberSchunn, Christian D.schunn@pitt.eduSCHUNN
Committee MemberLin, Yu-Ruyurulin@pitt.eduYURULIN
Committee MemberSargent,
Date: 31 January 2018
Date Type: Publication
Defense Date: 27 November 2017
Approval Date: 31 January 2018
Submission Date: 7 December 2017
Access Restriction: No restriction; Release the ETD for access worldwide immediately.
Number of Pages: 191
Institution: University of Pittsburgh
Schools and Programs: Dietrich School of Arts and Sciences > Intelligent Systems
Degree: PhD - Doctor of Philosophy
Thesis Type: Doctoral Dissertation
Refereed: Yes
Uncontrolled Keywords: Human-Data Interaction, Visual Analytics, Knowledge Discovery, Big Data, Visualization, Human-Computer Interaction
Date Deposited: 31 Jan 2018 14:15
Last Modified: 31 Jan 2018 14:15


Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item