Identity by descent

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

A DNA segment is identical by state (IBS) in two or more individuals if they have identical nucleotide sequences in this segment. An IBS segment is identical by descent (IBD) in two or more individuals if they have inherited it from a common ancestor without recombination, that is, the segment has the same ancestral origin in these individuals. DNA segments that are IBD are IBS per definition, but segments that are not IBD can still be IBS due to the same mutations in different individuals or recombinations that do not alter the segment.

The origin of IBD segments is depicted via a pedigree.
The origin of IBD segments is depicted via a pedigree.

Theory

All individuals in a finite population are related if traced back long enough and will, therefore, share segments of their genomes IBD. During meiosis segments of IBD are broken up by recombination. Therefore, the expected length of an IBD segment depends on the number of generations since the most recent common ancestor at the locus of the segment. The length of IBD segments that result from a common ancestor n generations in the past (therefore involving 2n meiosis) is exponentially distributed with mean 1/(2n) Morgans (M).[1] The expected number of IBD segments decreases with the number of generations since the common ancestor at this locus. For a specific DNA segment, the probability of being IBD decreases as 2−2n since in each meiosis the probability of transmitting this segment is 1/2.[2]

Applications

Identified IBD segments can be used for a wide range of purposes. As noted above the amount (length and number) of IBD sharing depends on the familial relationships between the tested individuals. Therefore one application of IBD segment detection is to quantify relatedness.[3][4][5][6] Measurement of relatedness can be used in forensic genetics,[7] but can also increase information in genetic linkage mapping [3][8] and help to decrease bias by undocumented relationships in standard association studies.[6][9] Another application of IBD is genotype imputation and haplotype phase inference.[10][11][12] Long shared segments of IBD, which are broken up by short regions may be indicative for phasing errors.[5][13]:SI

IBD mapping

IBD mapping [3] is similar to linkage analysis, but can be performed without a known pedigree on a cohort of unrelated individuals. IBD mapping can be seen as a new form of association analysis that increases the power to map genes or genomic regions containing multiple rare disease susceptibility variants.[6][14]

Using simulated data, Browning and Thompson showed that IBD mapping has higher power than association testing when multiple rare variants within a gene contribute to disease susceptibility.[14] Via IBD mapping, genome-wide significant regions in isolated populations as well as outbred populations were found while standard association tests failed.[11][15] Houwen et al. used IBD sharing to identify the chromosomal location of a gene responsible for benign recurrent intrahepatic cholestasis in an isolated fishing population.[16] Kenny et al. also used an isolated population to fine-map a signal found by a genome-wide association study (GWAS) of plasma plant sterol (PPS) levels, a surrogate measure of cholesterol absorption from the intestine.[17] Francks et al. was able to identify a potential susceptibility locus for schizophrenia and bipolar disorder with genotype data of case-control samples.[18] Lin et al. found a genome-wide significant linkage signal in a dataset of multiple sclerosis patients.[19] Letouzé et al. used IBD mapping to look for founder mutations in cancer samples.[20]

Error creating thumbnail: File with dimensions greater than 25 MP
An IBD segment identified by HapFABIA in Asian genomes. Rare single nucleotide variants (SNVs) that tag the IBD segment are coloured purple. Below the turquoise bar, the IBD segment in ancient genomes is displayed.

IBD in population genetics

Detection of natural selection in the human genome is also possible via detected IBD segments. Selection will usually tend to increase the number of IBD segments among individuals in a population. By scanning for regions with excess IBD sharing, regions in the human genome that have been under strong, very recent selection can be identified.[21][22]

In addition to that, IBD segments can be useful for measuring and identifying other influences on population structure.[6][23][24][25][26] Gusev et al. showed that IBD segments can be used with additional modeling to estimate demographic history including bottlenecks and admixture.[24] Using similar models Palamara et al. and Carmi et al. reconstructed the demographic history of Ashkenazi Jewish and Kenyan Maasai individuals.[25][26][27] Botigué et al. investigated differences in African ancestry among European populations.[28] Ralph and Coop used IBD detection to quantify the common ancestry of different European populations [29] and Gravel et al. similarly tried to draw conclusions of the genetic history of populations in the Americas.[30] Using the 1000 Genomes data Hochreiter found differences in IBD sharing between African, Asian and European populations as well as IBD segments that are shared with ancient genomes like the Neanderthal or Denisova.[13]

Methods and Software

Programs for the detection of IBD segments in unrelated individuals:

  • Parente: identifies IBD segments between pairs of individuals in unphased genotype data [31]
  • BEAGLE/fastIBD: finds segments of IBD between pairs of individuals in genome-wide SNP data [32]
  • BEAGLE/RefinedIBD: finds IBD segments in pairs of individuals using a hashing method and evaluates their significance via a likelihood ratio [33]
  • IBDseq: detects pairwise IBD segments in sequencing data [34]
  • GERMLINE: discovers in linear-time IBD segments in pairs of individuals [5]
  • DASH: builds upon pairwise IBD segments to infer clusters of individuals likely to be sharing a single haplotype [15]
  • PLINK: is a tool set for whole genome association and population-based linkage analyses including a method for pairwise IBD segment detection [6]
  • Relate: estimates the probability of IBD between pairs of individuals at a specific locus using SNPs [3]
  • MCMC_IBDfinder: is based on Markov Chain Monte Carlo (MCMC) for finding IBD segments in multiple individuals [35]
  • IBD-Groupon: detects group-wise IBD segments based on pairwise IBD relationships [36]
  • HapFABIA: identifies very short IBD segments characterized by rare variants in large sequencing data simultaneously in multiple individuals [13]

See also

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. 3.0 3.1 3.2 3.3 Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. 5.0 5.1 5.2 Lua error in package.lua at line 80: module 'strict' not found.
  6. 6.0 6.1 6.2 6.3 6.4 Lua error in package.lua at line 80: module 'strict' not found.
  7. Lua error in package.lua at line 80: module 'strict' not found.
  8. Lua error in package.lua at line 80: module 'strict' not found.
  9. Lua error in package.lua at line 80: module 'strict' not found.
  10. Lua error in package.lua at line 80: module 'strict' not found.
  11. 11.0 11.1 Lua error in package.lua at line 80: module 'strict' not found.
  12. Lua error in package.lua at line 80: module 'strict' not found.
  13. 13.0 13.1 13.2 Lua error in package.lua at line 80: module 'strict' not found.
  14. 14.0 14.1 Lua error in package.lua at line 80: module 'strict' not found.
  15. 15.0 15.1 Lua error in package.lua at line 80: module 'strict' not found.
  16. Lua error in package.lua at line 80: module 'strict' not found.
  17. Lua error in package.lua at line 80: module 'strict' not found.
  18. Lua error in package.lua at line 80: module 'strict' not found.
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. Lua error in package.lua at line 80: module 'strict' not found.
  23. Lua error in package.lua at line 80: module 'strict' not found.
  24. 24.0 24.1 Lua error in package.lua at line 80: module 'strict' not found.
  25. 25.0 25.1 Lua error in package.lua at line 80: module 'strict' not found.
  26. 26.0 26.1 Lua error in package.lua at line 80: module 'strict' not found.
  27. Lua error in package.lua at line 80: module 'strict' not found.
  28. Lua error in package.lua at line 80: module 'strict' not found.
  29. Lua error in package.lua at line 80: module 'strict' not found.
  30. Lua error in package.lua at line 80: module 'strict' not found.
  31. Rodriguez JM, Batzoglou S, Bercovici S. An accurate method for inferring relatedness in large datasets of unphased genotypes via an embedded likelihood-ratio test. RECOMB 2013, LNBI 7821:212-229.
  32. Lua error in package.lua at line 80: module 'strict' not found.
  33. Lua error in package.lua at line 80: module 'strict' not found.
  34. Lua error in package.lua at line 80: module 'strict' not found.
  35. Lua error in package.lua at line 80: module 'strict' not found.
  36. Lua error in package.lua at line 80: module 'strict' not found.