Student and Faculty Publications
Publication Date
12-6-2022
Journal
Gigascience
Abstract
In the recent biobank era of genetics, the problem of identical-by-descent (IBD) segment detection received renewed interest, as IBD segments in large cohorts offer unprecedented opportunities in the study of population and genealogical history, as well as genetic association of long haplotypes. While a new generation of efficient methods for IBD segment detection becomes available, direct comparison of these methods is difficult: existing benchmarks were often evaluated in different datasets, with some not openly accessible; methods benchmarked were run under suboptimal parameters; and benchmark performance metrics were not defined consistently. Here, we developed a comprehensive and completely open-source evaluation of the power, accuracy, and resource consumption of these IBD segment detection methods using realistic population genetic simulations with various settings. Our results pave the road for fair evaluation of IBD segment detection methods and provide an practical guide for users.
Keywords
Humans, Biological Specimen Banks, identical-by-descent, biobank-scale data, IBD segment detection tools, benchmarking
Comments
This article has been corrected. See Gigascience. 2022 December 29; 12: giac129.
Supplementary Materials
PMID: 36472573