Consanguinity


Contributor(s)

Written 2002-06 Robert Kalmes, Jean-Loup Huret
Genetics, Dept Medical Information, UMR 8125 CNRS, University of Poitiers, CHU Poitiers Hospital, F-86021 Poitiers, France (JLH)

  1. I. Definition
  2. II. Coefficient of consanguinity of an individual
    1. Formulation
    2. General formula
  3. III. Consanguinity of a population
  4. IV. Self-fertilization
  5. V. Generalization
    1. Genotype frequencies at equilibrium
    2. Properties of consanguinity
  6. VI. Human population
  7. VII. Genetic counselling
  8. VIII. Rare alleles - Common alleles
    1. Exercise
    2. Practical consequences
  9. IX. Consanguinity - Heterozygotism - Isogenetic line
  10. X. Multiallele system
    1. Exercise: Consanguinity for a locus and three alleles

I. Definition

  • A subject is in a situation of consanguinity if, for a given locus, (s)he has two identical alleles, per copy of one and the same ancestor gene.
  • The coefficient of consanguinity (Cc or F) is the probability that the two allele genes that an individual has at a locus are identical by descendance.
  • --> this assumes that a common ancestor (A) is shared by the parents, F and M, of the individual, I studied.

II. Coefficient of consanguinity of an individual

2.1 Formulation

  • First, look for the common ancestor (or common ancestors) on the genealogical tree.
  • Then, calculate the probabilities; for example:
    Figure 1
    Figure 1
  • A possesses alleles a1 and a2. It transmits to the great-grandparents GGP:
    • either identical alleles (a1 and a1 or: a2 and a2) --> proba: 1/2
    • or different alleles (a1 and a2 or: a2 et a1),
    but... if A itself exhibits consanguinity (with a coefficient of consanguinity FA), then a1 and a2 have a probability F A of being identical, and A transmits a1 and a2 with a proba 1/2, i.e. F A x 1/2
    Overall, A transmits the identity with a proba: 1/2 + 1/2 FA, or: 1/2 (1 + FA) Note: FA can be equal to zero.
  • Each generation i has a proba 1/2 of transmitting this allele to i+1; therefore a proba (1/2)n after n generations; or (1/2)p to go from GGP1 to I and (1/2)m to go from GGP2 to I, if f and m are the number of links linking the father and mother respectively to the common ancestor (here p = m = 3)
  • Therefore:    FI = (1/2)p+m+1(1+FA) ...
  • And, if there are several common ancestors (not consanguinous with each other), a sum S, addition of the various consanguinities, gives us the:

2.2 General formula

FI = S(1/2)p+m+1(1+FAi)

III. Consanguinity of a population

The mean coefficient of consanguinity is equal to the mean of the various individual coefficients weighted by the frequencies of the various types of crosses between related individuals.
To evaluate this, an inventory is compiled for the individuals of the different types of crossings between related individuals, and they are classified on the basis of the value of Fx.

EQUATION a= Fifi where fi is the frequency of the subjects with consanguinity Fi.

  • Example: a population in which 6% are consanguin, and among these: 2.5% have a value of F = 1/8; 2% a value of F = 1/16, and 1.5% a value of F = 1/32; what is the consanguinity of this population?
  • Answer: a= (2.5 X 1/8) + (2 X 1/16) + (1.5 X 1/32) = 0.484%

IV. Self-fertilization

This means that each genotype is fertilized exclusively by itself (a situation that is possible in maize (corn), but not in Drosophila, or in Man).
In a population of plants, underHW in Go, which is then put in a situation of self-fertilization:


 

AA

 

Aa

 

aa

Go

0.25

 

0.50

 

0.25
           

self-fertilization

AA

AA

Aa

aa

aa

 

X 1

X1/4

X1/2

X1/4

X1
           

G1

0.25

0.125

0.25

0.125

0.25
           

 

AA

 

Aa

aa

etc...

What is the frequency Hn of the heterozygotes in generation n?

Hn = 1/2 Hn-1 --> Hn = (1/2)nHo; tends towards zero.
Dn = Dn-1 + 1/4 Hn-1
Rn = Rn-1 + 1/4 Hn-1

--> at the equilibrium of self-fertilization: Deq = Do + 1/2 Ho; Heq = 0; Req = Ro + 1/2 Ho

Being under HW in Go, Do = p2; Ho = 2pq; Ro = q2
--> Deq = p2 + 1/2 2pq = p2 + pq = p (p +q) = p; similarly for Req -->

Genotype frequencies at equilibrium:

  • Deq = p
  • Heq = 0
  • Req = q

V. Generalization


Genotype

General case

panmixia

self-fertilization

 

0 <= F <= 1

F=0

F=1

 

allozygotism + autozygotism

 

 

AA

p2(1-F) + pF

p2

p

Aa

2pq(1-F)

2pq

0

aa

q2(1-F) + qF

q2

q

Thus, any population (and this includes a consanguin population) will behave as though:

  • one fraction (1-F) was developing in panmixia
  • one fraction F was developing in self-fertilization
    F being the mean coefficient of consanguinity of the population. F=0 in panmixia, F=1 in self-fertilization

The genotype frequencies at equilibrium will be:
F(AA)eq = p2(1 - F) + pF = p2 - p2F + pF = p2 + Fp (1 - p) = p2 + Fpq; similarly for F(aa); thus:

5.1 Genotype frequencies at equilibrium

EQUATION:
F(AA)eq = p2 + Fpq
F(Aa)eq = 2pq(1 - F)
F(aa)eq = q2 + Fpq

The risk that a consanguin subject will be homozygotic for the allele a is: F(aa) = q2 + Fpq

F(A) = D + H/2 = p2 + Fpq + 2pq(1 - F)/2 = p2 + Fpq + pq - Fpq = p2 + pq = p(p + q) = p --> invariable; therefore:

5.2 Properties of consanguinity

Consanguinity:

  • modifies the genotype frequencies. We can see an increase in the frequency of homozygotes and a reduction in that of heterozygotes.
  • does not modify the allele frequencies

VI. Human population

It is usual within the human population for there to be several common ancestors (e.g. below: AM and AF male and female ancestors, parents of GP 1 and 2). In practice, the equation is simplified to:

EQUATION:
CcI = S(1/2)p+m+1

  • Example: Parents first cousins: p=2; m=2; S is the sum of 2 terms, since there are 2 possibilities of having identical alleles: via AM and via AF (i.e. 2 common ancestors); so,
  • Answer: Fi = (1/2)2+2+1 +(1/2)2+2+1 = 1/16
Figure 2
Figure 2

VII. Genetic counselling

For a deleterious mutant recessive autosomal allele (rare by definition) with a frequency q, the risk that a consanguin child will be homozygotic for this allele is: q x Cc whereas it is q2 for the children of non-consanguin parents.

NOTE: the exact equation q2+ pqCc is replaced by the approximation: q x Cc. This is applicable to human genetics (genetic counselling) if/because q is very small.

  • Exercise 1: Parents first cousins; for a mutant gene with recessive autosomal transmission with frequency q = 1/100 (example of phenylketonuria, one of the most common recessive autosomal diseases): what is the risk in the general population? what is the risk that I could be affected?
    • Answer:
      • for the general population: q 2 = 1/10 000
      • for I, it is, according to the equation: q x Cc = 1/100 x 1/16  = 1/1 600
      • Note: the risk of a recessive autosomal disease in the individual I, relative to the general population, is increased by the factor: q x Cc / q 2 = Cc / q here = 6.25 (or, if we use the accurate equation (q2+ pqCc) / q 2 = 7.19)
  • Exercise 2: same exercice, but for a frequency q = 1/10 000 of the mutant gene.br
    • Answer:
      • for the general population: q 2 = 1/100 000 000
      • for I, it is, according to the equation: q x Cc = 1/10 000 x 1/16  = 1/160 000
      • the risk for individual I relative to the general population, is increased by a factor Cc / q here = 625 or, if we use the accurate equation, q2+ pqCc / q 2 = 626 (the rarer the allele, the better the approximation q x Cc).

VIII. Rare alleles - Common alleles

8.1 Exercise

Consider:

  • a gene A of which the recessive allele a has a freqency F(a) = q = 0.5,
  • and a gene B of which the recessive allele b has a freqency F(b) = q = 0.0001, a fairly usual frequency for a morbid allele,

Calculate the frequency/risk of being recessive homozygote(s) for each of these two genes
  1. according to Hardy-Weinberg (HW)
  2. for a consanguin child whose parents are first-cousins
  3. compare.

Answer:

  1. according to HW:
    • F(aa) = q2 = (0.5)2 = 0.25
    • F(bb) = q2 = (0.0001)2 = (10-4)2 = 10-8
  2. for a consanguin child whose parents are first-cousins:
    • S(1/2)p+m+1= (1/2)5 + (1/2)5 = (1/2)4 = 0.0625
    • F(aa) = q2 + Fpq = (0.5)2 + 0.0625 X 0.5 X 0.5 = 0.2656
    • F(bb) = q2 + Fpq = (10-4)2 + 0.0625 X 1 X 10-4 = (1 + 625)10-8 = 626 X 10-8
  3. comparison: the increase in the frequency (risk) of homozygotism due to the consanguinity will be F(consang.)/F(under HW), i.e.:
    • for the common allele: 0.26256 / 0.25 = 1.06, slight increase
    • or the rare morbid allele: 626 X 10-8/10-8 = 626 !!!

8.2 Practical consequences

In other words, the increase is: (q2 + Fpq)/q2 = 1 + Fp/q

  • when p = q: --> = 1 + F @ 1
  • when q is rare, p @ 1 --> = 1 + F/q @ F/q, with Fmax = 0.25 (incest), F is often of the order of 1 to 5 X 10-2 and q = 10-3 or 10-4, therefore a risk increased by a factor of 10to 103, which is generally the case for recessive diseases. Isolates, with a high level of consanguinity, allow unusual diseases to emerge.

IX. Consanguinity - Heterozygotism - Isogenetic line

In the human species, the percentage of heterozygotic loci, calculated from enzymatic polymorphism, has a value H = 0.067. We can take it that there are 30000 structural genes, and in consequence 2010 genes in the heterozygotic state in the human genome (30000 x 0.067 = 2010).
If an individual results from an uncle-niece cross:

  • this individual will be more "homogenous" than his or her parents, because of the increased consanguinity,
  • the percentage of his/her heterozygotic genes falls from 2010 to 1759 genes (2010 x 7/8) since Fi = 1/8 (1/8 of genes are identical as a result of consanguinity).

Consequences: If regular consanguin crosses are made (for example brother/sister crosses in mice), at each generation:
--> Fi tends towards a value of 1,
--> the individuals will become totally homozygotic.

Within each familly, all the individuals will be identical in the genetic sense of the word.
exactly the same genome
exactly the same genes.

This leads to the concept of the isogenetic line.

X. Multiallele system

The genotype frequencies at equilibrium for every AiAi homozygote and every AiAj: heterozygote will be:

  • F(AiAi) = pi2(1 - F) + piF
  • F(AiAj) = 2pipj(1 - F)

10.1 Exercise: Consanguinity for a locus and three alleles

In a population of a diploid species with separate sexes and separate generations, we are dealing with a triallele autosomal locus (three possible allele states: A1, A2 and A3).
A sample of 400 individuals is examined. The numbers of the various genotypes are as follows:

Estimate the allele frequencies.

A1A1

A1A2

A1A3

A2A2

A2A3

A3A3

32

36

60

57

90

125

  1. Estimate the allele frequencies.
  2. Can we assume that we have the proportions of panmixia in the sample?
  3. Knowing that there is no selection, no mutation, no migration, no drift (large population), can consanguinity account for the difference? What are the theoretical proportions of the various genotypes, knowing that the mean coefficient of consanguinity is F?
  4. Estimate the value of F from the proportions of the sample.

Answer:

  1. The allele frequencies are estimated by counting the alleles. Thus, for allele A1 for example:
    • frequency of A1 = ((2 x 32) + 36 + 60) / (2 x 400) = 0.20 = p
    • similarly: frequency of A2= 0.30 = q; frequency of A3 = 0.50 = r
  2. The proportions of panmixia are in fact those indicated by the Hardy-Weinberg law.
    • Theoretical frequencies of the genotypes according to the Hardy-Weinberg law

      A1A1:

      p2 = 0.202

      A1A2:

      2 pq = 2 x 0.20 x 0.30

      A2A2:

      q2 = 0.302

      A1A3:

      2 pr = 2 x 0.20 x 0.50

      A3A3:

      r2 = 0.502

      A2A3:

      2 qr = 2 x 0.30 x 0.50
      with p, q, r: the respective frequencies of the alleles A1, A2 and A3.
    • Theoretical numbers

      A1A1:

      0.202 x 400 =16

      A1A2:

      48

      A2A2:

      36

      A1A3:

      80

      A3A3:

      100

      A2A3:

      100
    • Comparison of the theoretical numbers and the actual numbers found by a chi-squared test of conformity
      c2 = (32 - 16)2/16 + ....... + (125 - 120)2/100 = 50
      dd1 = 6 - 2 - 1 = 3; to a 5% threshold, [[chi]]2 calculated is greater than the value given in the table (7.815) and is therefore highly significant. Consequently the proportions of the genotypes do not comply with those of the Hardy-Weinberg law, the hypothesis of panmixia can be rejected.
  3. Consanguinity has the effect of increasing the frequency of homozygotes and of reducing that of heterozygotes, relative to the proportions given by the Hardy-Weinberg law. This is indeed what we find. Consequently, consanguinity can account for the significant differences between the previous theroretical and actual numbers.

    For the whole population, the theoretical genotype frequencies are:

    A1A1: (1- F) p2 + Fp

    A1A2: 2 pq (1 - F)

    A2A2: (1 - F) q2 + Fq

    A1A3: 2 pr (1 - F)

    A3A3: (1 - F) r2 + Fr

    A2A3: 2 qr (1 - F)
  4. The frequency of A1A2 can be used simply to calculate F:
    frequency of A1A2 = 2 pq (1 - F)
    36/400 = 2 x 0.20 x 0.30 (1 - F) therefore 1 - F = 0.75
    F = 0.25

Citation

Kalmes R, Huret JL

Atlas of Genetics and Cytogenetics in Oncology and Haematology 2002-06-01

Consanguinity

Online version: http://atlasgeneticsoncology.org/teaching/30039/consanguinity