Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

Improving ChIP-seq peak-calling for functional co-regulator binding by integrating multiple sources of biological information

Osmanbeyoglu, HU and Hartmaier, RJ and Oesterreich, S and Lu, X (2012) Improving ChIP-seq peak-calling for functional co-regulator binding by integrating multiple sources of biological information. Series on Advances in Bioinformatics and Computational Biology, 13. ISSN 1751-6404

[img]
Preview
PDF
Published Version
Available under License : See the attached license file.

Download (992kB) | Preview
[img] Plain Text (licence)
Available under License : See the attached license file.

Download (1kB)

Abstract

© 2012 Osmanbeyoglu et al.; licensee BioMed Central Ltd. Background: Chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-seq) is increasingly being applied to study genome-wide binding sites of transcription factors. There is an increasing interest in understanding the mechanism of action of co-regulator proteins, which do not bind DNA directly, but exert their effects by binding to transcription factors such as the estrogen receptor (ER). However, due to the nature of detecting indirect protein-DNA interaction, ChIP-seq signals from co-regulators can be relatively weak and thus biologically meaningful interactions remain difficult to identify.Results: In this study, we investigated and compared different statistical and machine learning approaches including unsupervised, supervised, and semi-supervised classification (self-training) approaches to integrate multiple types of genomic and transcriptomic information derived from our experiments and public database to overcome difficulty of identifying functional DNA binding sites of the co-regulator SRC-1 in the context of estrogen response. Our results indicate that supervised learning with naïve Bayes algorithm significantly enhances peak calling of weak ChIP-seq signals and outperforms other machine learning algorithms. Our integrative approach revealed many potential ERα/SRC-1 DNA binding sites that would otherwise be missed by conventional peak calling algorithms with default settings.Conclusions: Our results indicate that a supervised classification approach enables one to utilize limited amounts of prior knowledge together with multiple types of biological data to enhance the sensitivity and specificity of the identification of DNA binding sites from co-regulator proteins.


Share

Citation/Export:
Social Networking:
Share |

Details

Item Type: Article
Status: Published
Creators/Authors:
CreatorsEmailPitt UsernameORCID
Osmanbeyoglu, HUosmanbeyogluhu@pitt.eduHUO30000-0002-3175-1777
Hartmaier, RJrjh70@pitt.eduRJH70orcid.org/0000-0001-7416-6036#sthash.wHE891bE.dpuf
Oesterreich, Ssto16@pitt.eduSTO16
Lu, Xxinghua@pitt.eduXINGHUA
Date: 1 January 2012
Date Type: Publication
Journal or Publication Title: Series on Advances in Bioinformatics and Computational Biology
Volume: 13
DOI or Unique Handle: 10.1186/1471-2164-13-s1-s1
Schools and Programs: School of Medicine > Biomedical Informatics
School of Medicine > Pharmacology and Chemical Biology
Refereed: Yes
ISSN: 1751-6404
Date Deposited: 01 Nov 2016 18:11
Last Modified: 05 Feb 2019 03:55
URI: http://d-scholarship.pitt.edu/id/eprint/29970

Metrics

Monthly Views for the past 3 years

Plum Analytics

Altmetric.com


Actions (login required)

View Item View Item