Approaches to Handling Incomplete Data in Family-based Association Testing



The high throughput of data arising from the complete sequence of the human genome has left statistical geneticists with a rich and extensive information source. The wide availability of software and the increase in computing power has improved the possibilities to access and process such data. One problem is incompleteness of the data: unobserved or partially observed data points due to technical reasons or reasons associated with the patient's status or erroneous measurements of phenotype or genotype, to name a few. When not properly accounted for, these sources of incompleteness may seriously jeopardize the credibility of results from analyses.

In this paper we provide some perspectives on the occurrence and analysis of different forms of incomplete data in family-based genetic association testing.