Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

Evaluation of Rules for Coping with Insufficient Data in Constraint-based Search Algorithms

de Jongh, Martijn and Druzdzel, Marek J. (2014) Evaluation of Rules for Coping with Insufficient Data in Constraint-based Search Algorithms. In: Probabilistic Graphical Models. Springer Lecture Notes in Computer Science, 8754 . Springer International Publishing, Heidelberg, 238 - 253. ISBN UNSPECIFIED

[img] Plain Text (licence)
Available under License : See the attached license file.

Download (1kB)

Abstract

A fundamental step in the PC causal discovery algorithm consists of testing for (conditional) independence. When the number of data records is very small, a classical statistical independence test is typically unable to reject the (null) independence hypothesis. In this paper, we are comparing two conflicting pieces of advice in the literature that in case of too few data records recommend (1) assuming dependence and (2) assuming independence. Our results show that assuming independence is a safer strategy in minimizing the structural distance between the causal structure that has generated the data and the discovered structure. We also propose a simple improvement on the PC algorithm that we call blacklisting. We demonstrate that blacklisting can lead to orders of magnitude savings in computation by avoiding unnecessary independence tests.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: Book Section
Status: Published
Creators/Authors:
CreatorsEmailPitt UsernameORCID
de Jongh, Martijn
Druzdzel, Marek J.druzdzel@pitt.eduDRUZDZEL
Date: 2014
Date Type: Publication
Series Name: Springer Lecture Notes in Computer Science
Volume: 8754
Publisher: Springer International Publishing
Place of Publication: Heidelberg
Page Range: 238 - 253
DOI or Unique Handle: 10.1007/978-3-319-11433-0_13
Schools and Programs: School of Information Sciences > Information Science
Refereed: Yes
Title of Book: Probabilistic Graphical Models
Editors:
EditorsEmailPitt UsernameORCID
van der Gaag, Linda C.UNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Feelders, Ad J.UNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Official URL: http://link.springer.com/chapter/10.1007/978-3-319...
Date Deposited: 02 Jul 2015 14:32
Last Modified: 01 Nov 2017 12:58
URI: http://d-scholarship.pitt.edu/id/eprint/25526

Metrics

Monthly Views for the past 3 years

Plum Analytics

Altmetric.com


Actions (login required)

View Item View Item