Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

H-FUSE: Efficient fusion of aggregated historical data

Liu, Z and Song, HA and Zadorozhny, V and Faloutsos, C and Sidiropoulos, N (2017) H-FUSE: Efficient fusion of aggregated historical data. In: UNSPECIFIED.

[img]
Preview
PDF
Published Version

Download (1MB) | Preview
[img] Plain Text (licence)
Download (1kB)

Abstract

In this paper, we address the challenge of recovering a time sequence of counts from aggregated historical data. For example, given a mixture of the monthly and weekly sums, how can we find the daily counts of people infected with flu? In general, what is the best way to recover historical counts from aggregated, possibly overlapping historical reports, in the presence of missing values? Equally importantly, how much should we trust this reconstruction? We propose H-FUSE, a novel method that solves above problems by allowing injection of domain knowledge in a principled way, and turning the task into a welldefined optimization problem. H-FUSE has the following desirable properties: (a) Effectiveness, recovering historical data from aggregated reports with high accuracy; (b) Self-awareness, providing an assessment of when the recovery is not reliable; (c) Scalability, computationally linear on the size of the input data. Experiments on the real data (epidemiology counts from the Tycho project [13]) demonstrates that H-FUSE reconstructs the original data 30 - 81% better than the least squares method.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: Conference or Workshop Item (UNSPECIFIED)
Status: Published
Creators/Authors:
CreatorsEmailPitt UsernameORCID
Liu, Z
Song, HA
Zadorozhny, V
Faloutsos, C
Sidiropoulos, N
Date: 1 January 2017
Date Type: Publication
Journal or Publication Title: Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
Page Range: 786 - 794
Event Type: Conference
DOI or Unique Handle: 10.1137/1.9781611974973.88
Schools and Programs: School of Information Sciences > Information Science
Refereed: Yes
ISBN: 9781611974874
Date Deposited: 30 Jun 2017 14:46
Last Modified: 28 Jul 2022 19:27
URI: http://d-scholarship.pitt.edu/id/eprint/32613

Metrics

Monthly Views for the past 3 years

Plum Analytics

Altmetric.com


Actions (login required)

View Item View Item