Data Reliability Assessment based on subjective opinionsZhang, Danchen (2021) Data Reliability Assessment based on subjective opinions. Doctoral Dissertation, University of Pittsburgh. (Unpublished) This is the latest version of this item.
AbstractIn the big data era, numerous data fluctuates society and people's life. These data come from diverse sources, and various information can be inferred and extracted. However, data quality usually cannot be guaranteed, and hence decision making with such unreliable data may lead to considerable losses. Accurate data reliability assessment mechanisms can help recognize the distrustful information and then filter unreliable data out. In this work, I consider a novel approach to assess data reliability based on subjective opinions. I structure the data propagation model in terms of data sources producing and evaluating different statements. Next, I explore data history labels, value conflicts, and uncertainty. For different combinations of those parameters, I consider common scenarios, including handling fake news, truth discovery, data cleaning, as well as discovering cancer-driving genes. In my dissertation, I explore how to accurately assess data reliability and how to make a decision based on evaluated reliability. I propose a series of subjective opinion based models to assess each scenario's reliability and compare them with state-of-art models through experiments on real-world data. Share
Details
Available Versions of this ItemMetricsMonthly Views for the past 3 yearsPlum AnalyticsActions (login required)
|