Use of artificial genomes in assessing methods for atypical gene detection

Azad, RK and Lawrence, JG (2005) Use of artificial genomes in assessing methods for atypical gene detection. PLoS Computational Biology, 1 (6). 0461 - 0473. ISSN 1553-734X

Preview

PDF
Published Version
Available under License : See the attached license file.
Download (480kB) | Preview

Plain Text (licence)
Available under License : See the attached license file.
Download (1kB)

Abstract

Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods - as well as the evaluation and proper implementation of existing methods - relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, "core" genes - those displaying patterns of mutational biases shared among large numbers of genes - are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple "core" gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes - representing those having experienced lateral gene transfer - were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying "atypical" genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently - i.e., they had different sets of strengths and weaknesses - when identifying atypical genes within chimeric artificial genomes. © 2005 Azad and Lawrence.

Citation/Export:
Social Networking:	Share \|

Details

Item Type:

Article

Status:

Published

Creators/Authors:

Creators	Email	Pitt Username	ORCID
Azad, RK
Lawrence, JG	jlawrenc@pitt.edu	JLAWRENC

Contributors:

Contribution	Contributors Name	Email	Pitt Username	ORCID
Editor	Borodovsky, Mark	UNSPECIFIED	UNSPECIFIED	UNSPECIFIED

Date:

1 December 2005

Date Type:

Publication

Journal or Publication Title:

PLoS Computational Biology

Volume:

Number:

Page Range:

0461 - 0473

DOI or Unique Handle:

10.1371/journal.pcbi.0010056

Schools and Programs:

Dietrich School of Arts and Sciences > Biological Sciences

Refereed:

Yes

ISSN:

1553-734X

PubMed ID:

16292353

Date Deposited:

11 Jul 2012 18:04

Last Modified:

23 Jan 2019 23:55

URI:

http://d-scholarship.pitt.edu/id/eprint/12836

Metrics

Monthly Views for the past 3 years

Plum Analytics

Altmetric.com

Actions (login required)

View Item

My Account

Search

Browse

Information

Use of artificial genomes in assessing methods for atypical gene detection

Abstract

Share

Details

Metrics

Monthly Views for the past 3 years

Plum Analytics

Altmetric.com

Actions (login required)

Connect with us

Send Comments or Questions

Feeds