Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

HomeRun: Scalable Sparse-Spectrum Reconstruction of Aggregated Historical Data.

Almutairi, Faisal and Yang, Fan and Song, Hyun Ah and Faloutsos, Christos and Sidiropoulos, Nicholas and Zadorozhny, Vladimir (2018) HomeRun: Scalable Sparse-Spectrum Reconstruction of Aggregated Historical Data. Journal Proceedings of the VLDB Endowment (PVLDB). (In Press)

Abstract

Recovering a time sequence of events from multiple aggregated and possibly overlapping reports is a major challenge in historical data fusion. The goal is to reconstruct a higher resolution event
sequence from a mixture of lower resolution samples as accurately as possible. For example, we may aim to disaggregate overlapping monthly counts of people infected with measles into weekly counts.
In this paper, we propose a novel data disaggregation method, called HOMERUN, that exploits an alternative representation of the sequence and finds the spectrum of the target sequence. More specifically, we formulate the problem as so-called basis pursuit using the Discrete Cosine Transform (DCT) as a sparsifying dictionary and impose non-negativity and smoothness constraints. HOMERUN utilizes the energy compaction feature of the DCT by finding the sparsest spectral representation of the target sequence that contains the largest (most important) coefficients. We leverage the Alternating Direction Method of Multipliers to solve the resulting optimization problem with scalable and memory efficient steps. Experiments using real epidemiological data show that our method considerably outperforms the state-of-the-art techniques, especially when the DCT of the sequence has a high degree of energy compaction.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: Article
Status: In Press
Creators/Authors:
CreatorsEmailPitt UsernameORCID
Almutairi, Faisal
Yang, FanFAY28@pitt.eduFAY28
Song, Hyun Ah
Faloutsos, Christos
Sidiropoulos, Nicholas
Zadorozhny, Vladimirviz@pitt.eduviz0000-0001-6420-1926
Date: 2018
Date Type: Publication
Journal or Publication Title: Journal Proceedings of the VLDB Endowment (PVLDB)
Schools and Programs: School of Computing and Information > Information Science
Refereed: Yes
Article Type: Research Article
Date Deposited: 05 Jul 2018 18:42
Last Modified: 05 Jul 2018 18:42
URI: http://d-scholarship.pitt.edu/id/eprint/34840

Metrics

Monthly Views for the past 3 years

Plum Analytics


Actions (login required)

View Item View Item