Incorporation of Knowledge for Network-based Candidate Gene Prioritization

Kimmel, Chad (2012) Incorporation of Knowledge for Network-based Candidate Gene Prioritization. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

Preview

PDF
Primary Text
Download (1MB) | Preview

Abstract

In order to identify the genes associated with a given disease, a number of different high-throughput techniques are available such as gene expression profiles. However, these high-throughput approaches often result in hundreds of different candidate genes, and it is thus very difficult for biomedical researchers to narrow their focus to a few candidate genes when studying a given disease. In order to assist in this challenge, a process called gene prioritization can be utilized. Gene prioritization is the process of identifying and ranking new genes as being associated with a given disease. Candidate genes which rank high are deemed more likely to be associated with the disease than those that rank low. This dissertation focuses on a specific kind of gene prioritization method called network-based gene prioritization. Network-based methods utilize a biological network such as a protein-protein interaction network to rank the candidate genes. In a biological network, a node represents a protein (or gene), and a link represents a biological relationship between two proteins such as a physical interaction.
The purpose of this dissertation was to investigate if the incorporation of biological knowledge into the network-based gene prioritization process can provide a significant benefit. The biological knowledge consisted of a variety of information about a given gene including gene ontology (GO) functional terms, MEDLINE articles, gene co-expression measurements, and protein domains to name just a few. The biological knowledge was incorporated into the network’s links and nodes as link and node knowledge respectively. An example of link knowledge is the degree of functional similarity between two proteins, and an example of node knowledge is the number of GO terms associated with a given protein. Since there were no existing network-based inference algorithms which could incorporate node knowledge, I developed a new network-based inference algorithm to incorporate both link and node knowledge called the Knowledge Network Gene Prioritization (KNGP) algorithm.
The results showed that the incorporation of biological knowledge via link and node knowledge can provide a significant benefit for network-based gene prioritization. The KNGP algorithm was utilized to combine the link and node knowledge.

Citation/Export:
Social Networking:	Share \|

Details

Item Type:

University of Pittsburgh ETD

Status:

Unpublished

Creators/Authors:

Creators	Email	Pitt Username	ORCID
Kimmel, Chad	cpk5@pitt.edu	CPK5

ETD Committee:

Member	Email Address	Pitt Username
Visweswaran, Shyam	shv3@pitt.edu	SHV3
Gopalakrishnan, Vanathi	vanathi@pitt.edu	VANATHI
Ganapathiraju, Madhavi	madhavi@pitt.edu	MADHAVI
Kaminski, Naftali	nak38@pitt.edu	NAK38

Date:

30 August 2012

Date Type:

Publication

Defense Date:

10 July 2012

Approval Date:

30 August 2012

Submission Date:

29 August 2012

Access Restriction:

1 year -- Restrict access to University of Pittsburgh for a period of 1 year.

Number of Pages:

111

Institution:

University of Pittsburgh

Schools and Programs:

School of Medicine > Biomedical Informatics

Degree:

PhD - Doctor of Philosophy

Thesis Type:

Doctoral Dissertation

Refereed:

Yes

Uncontrolled Keywords:

gene prioritization, network-based gene prioritization, knowledge networks

Date Deposited:

30 Aug 2012 11:47

Last Modified:

15 Nov 2016 14:02

URI:

http://d-scholarship.pitt.edu/id/eprint/13866

Metrics

Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item

My Account

Search

Browse

Information

Incorporation of Knowledge for Network-based Candidate Gene Prioritization

Abstract

Share

Details

Metrics

Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

Connect with us

Send Comments or Questions

Feeds