Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

Statistical Methods for Recovering GWAS Data

Ozbek, Umut (2013) Statistical Methods for Recovering GWAS Data. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

This is the latest version of this item.

Primary Text

Download (1MB) | Preview


Genome-wide association studies (GWAS) are used to investigate associations between genetic variants and health and disease. GWAS often use a “chip” to genotype single nucleotide polymorphisms (SNPs) spanning the genome and test for association between genotype and phenotype at each SNP. The topic of this dissertation is methods for recovering some of the markers that are typically discarded or not analyzed in a GWAS. During data cleaning, prior to the statistical analysis, many genetic markers are discarded because they fail to meet standard quality control criteria. In addition, some analysis results are filtered out because they are considered unreliable. However, there are some discarded data that could be recovered and used in the analysis. For instance, markers that fail to meet a cutoff p-value for the Hardy-Weinberg equilibrium (HWE) test are typically discarded, as are markers with minor allele frequency below some arbitrary cutoff. In addition, markers on the X-chromosome are often not analyzed because sex chromosome analyses are not as straightforward as autosomal analyses, and the statistical methods for testing association on X-chromosome markers are not well established or well tested. In order to make use of more information from any given GWAS, the standard quality control criteria could be modified and more flexible and data specific analysis methods could be developed. This should have the potential to increase power.
Genetic variation and environmental/behavioral factors interact to cause almost all human diseases with great public health significance. Refinements in data analysis will help improve the process of identifying the genetic factors and their interactions. These are important in order to provide better prevention and treatment for human diseases so to maintain public health. The methods in this dissertation are proposed and investigated to help with providing a better public health.


Social Networking:
Share |


Item Type: University of Pittsburgh ETD
Status: Unpublished
CreatorsEmailPitt UsernameORCID
ETD Committee:
TitleMemberEmail AddressPitt UsernameORCID
Committee ChairFeingold, Eleanorfeingold@pitt.eduFEINGOLD
Committee CoChairWeeks, Daniel E.weeks@pitt.eduWEEKS0000-0001-9410-7228
Committee MemberArena, Vincentarena@pitt.eduARENA
Committee MemberLin,; yal2005@gmail.comYAL14
Chen, Weiwei.chen@chp.eduWEC47
Date: 18 March 2013
Date Type: Publication
Defense Date: 6 February 2013
Approval Date: 18 March 2013
Submission Date: 8 March 2013
Access Restriction: 5 year -- Restrict access to University of Pittsburgh for a period of 5 years.
Number of Pages: 79
Institution: University of Pittsburgh
Schools and Programs: School of Public Health > Biostatistics
Degree: PhD - Doctor of Philosophy
Thesis Type: Doctoral Dissertation
Refereed: Yes
Uncontrolled Keywords: GWAS, null allele, X chromosome association
Date Deposited: 18 Mar 2013 15:29
Last Modified: 30 Jun 2022 16:18

Available Versions of this Item

  • Statistical Methods for Recovering GWAS Data. (deposited 18 Mar 2013 15:29) [Currently Displayed]


Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item