Consanguinity
Contributor(s)
| Written | 2002-06 | Robert Kalmes, Jean-Loup Huret |
| Genetics, Dept Medical Information, UMR 8125 CNRS, University of Poitiers, CHU Poitiers Hospital, F-86021 Poitiers, France (JLH) |
- I. Definition
- II. Coefficient of consanguinity of an individual
- Formulation
- General formula
- III. Consanguinity of a population
- IV. Self-fertilization
- V. Generalization
- Genotype frequencies at equilibrium
- Properties of consanguinity
- VI. Human population
- VII. Genetic counselling
- VIII. Rare alleles - Common alleles
- Exercise
- Practical consequences
- IX. Consanguinity - Heterozygotism - Isogenetic line
- X. Multiallele system
- Exercise: Consanguinity for a locus and three alleles
I. Definition
- A subject is in a situation of consanguinity if, for a given locus, (s)he has two identical alleles, per copy of one and the same ancestor gene.
- The coefficient of consanguinity (Cc or F) is the probability that the two allele genes that an individual has at a locus are identical by descendance. --> this assumes that a common ancestor (A) is shared by the parents, F and M, of the individual, I studied.
II. Coefficient of consanguinity of an individual
2.1 Formulation
- First, look for the common ancestor (or common ancestors) on the genealogical tree.
- Then, calculate the probabilities; for example:

Figure 1 - A possesses alleles a1 and a2. It transmits to the great-grandparents GGP:
- either identical alleles (a1 and a1 or: a2 and a2) --> proba: 1/2
- or different alleles (a1 and a2 or: a2 et a1),
Overall, A transmits the identity with a proba: 1/2 + 1/2 FA, or: 1/2 (1 + FA) Note: FA can be equal to zero. - Each generation i has a proba 1/2 of transmitting this allele to i+1; therefore a proba (1/2)n after n generations; or (1/2)p to go from GGP1 to I and (1/2)m to go from GGP2 to I, if f and m are the number of links linking the father and mother respectively to the common ancestor (here p = m = 3)
- Therefore: FI = (1/2)p+m+1(1+FA) ...
- And, if there are several common ancestors (not consanguinous with each other), a sum S, addition of the various consanguinities, gives us the:
2.2 General formula
FI = S(1/2)p+m+1(1+FAi)
- Note: FA is negligible in man and at the level of the individual, but may not be in Drosophila, particularly in an entire population.
- Genealogy studies are indispensable for determing consanguinity; see: Genealogy and Coefficient of Consanguinity, Exercices
III. Consanguinity of a population
The mean coefficient of consanguinity is equal to the mean of the various individual coefficients weighted by the frequencies of the various types of crosses between related individuals.
To evaluate this, an inventory is compiled for the individuals of the different types of crossings between related individuals, and they are classified on the basis of the value of Fx.
EQUATION a= Fifi where fi is the frequency of the subjects with consanguinity Fi.
- Example: a population in which 6% are consanguin, and among these: 2.5% have a value of F = 1/8; 2% a value of F = 1/16, and 1.5% a value of F = 1/32; what is the consanguinity of this population?
- Answer: a= (2.5 X 1/8) + (2 X 1/16) + (1.5 X 1/32) = 0.484%
IV. Self-fertilization
This means that each genotype is fertilized exclusively by itself (a situation that is possible in maize (corn), but not in Drosophila, or in Man).
In a population of plants, underHW in Go, which is then put in a situation of self-fertilization:
| | AA | | Aa | | aa |
| Go | 0.25 | | 0.50 | | 0.25 |
| self-fertilization | AA | AA | Aa | aa | aa |
| | X 1 | X1/4 | X1/2 | X1/4 | X1 |
| G1 | 0.25 | 0.125 | 0.25 | 0.125 | 0.25 |
| | AA | | Aa | aa | etc... |
What is the frequency Hn of the heterozygotes in generation n?
Hn = 1/2 Hn-1 --> Hn = (1/2)nHo; tends towards zero.
Dn = Dn-1 + 1/4 Hn-1
Rn = Rn-1 + 1/4 Hn-1
--> at the equilibrium of self-fertilization: Deq = Do + 1/2 Ho; Heq = 0; Req = Ro + 1/2 Ho
Being under HW in Go, Do = p2; Ho = 2pq; Ro = q2
--> Deq = p2 + 1/2 2pq = p2 + pq = p (p +q) = p; similarly for Req -->
Genotype frequencies at equilibrium:
- Deq = p
- Heq = 0
- Req = q
V. Generalization
| Genotype | General case | panmixia | self-fertilization |
| | 0 <= F <= 1 | F=0 | F=1 |
| | allozygotism + autozygotism | | |
| AA | p2(1-F) + pF | p2 | p |
| Aa | 2pq(1-F) | 2pq | 0 |
| aa | q2(1-F) + qF | q2 | q |
Thus, any population (and this includes a consanguin population) will behave as though:
- one fraction (1-F) was developing in panmixia
- one fraction F was developing in self-fertilization
F being the mean coefficient of consanguinity of the population. F=0 in panmixia, F=1 in self-fertilization
The genotype frequencies at equilibrium will be:
F(AA)eq = p2(1 - F) + pF = p2 - p2F + pF = p2 + Fp (1 - p) = p2 + Fpq; similarly for F(aa); thus:
5.1 Genotype frequencies at equilibrium
EQUATION:
F(AA)eq = p2 + Fpq
F(Aa)eq = 2pq(1 - F)
F(aa)eq = q2 + Fpq
The risk that a consanguin subject will be homozygotic for the allele a is: F(aa) = q2 + Fpq
- Another demonstration of the relations F(AA) = p2 + Fpq, F(Aa) = 2pq(1 - F), et F(aa) = q2 + Fpq is given in: Genetic Constitution of Consanguin Populations
- Are the allele frequencies altered?
5.2 Properties of consanguinity
Consanguinity:
- modifies the genotype frequencies. We can see an increase in the frequency of homozygotes and a reduction in that of heterozygotes.
- does not modify the allele frequencies
VI. Human population
It is usual within the human population for there to be several common ancestors (e.g. below: AM and AF male and female ancestors, parents of GP 1 and 2). In practice, the equation is simplified to:
EQUATION:
CcI = S(1/2)p+m+1
- Example: Parents first cousins: p=2; m=2; S is the sum of 2 terms, since there are 2 possibilities of having identical alleles: via AM and via AF (i.e. 2 common ancestors); so,
- Answer: Fi = (1/2)2+2+1 +(1/2)2+2+1 = 1/16

VII. Genetic counselling
For a deleterious mutant recessive autosomal allele (rare by definition) with a frequency q, the risk that a consanguin child will be homozygotic for this allele is: q x Cc whereas it is q2 for the children of non-consanguin parents.
NOTE: the exact equation q2+ pqCc is replaced by the approximation: q x Cc. This is applicable to human genetics (genetic counselling) if/because q is very small.
- Exercise 1: Parents first cousins; for a mutant gene with recessive autosomal transmission with frequency q = 1/100 (example of phenylketonuria, one of the most common recessive autosomal diseases): what is the risk in the general population? what is the risk that I could be affected?
- Answer:
- for the general population: q 2 = 1/10 000
- for I, it is, according to the equation: q x Cc = 1/100 x 1/16 = 1/1 600
- Note: the risk of a recessive autosomal disease in the individual I, relative to the general population, is increased by the factor: q x Cc / q 2 = Cc / q here = 6.25 (or, if we use the accurate equation (q2+ pqCc) / q 2 = 7.19)
- Answer:
- Exercise 2: same exercice, but for a frequency q = 1/10 000 of the mutant gene.br
- Answer:
- for the general population: q 2 = 1/100 000 000
- for I, it is, according to the equation: q x Cc = 1/10 000 x 1/16 = 1/160 000
- the risk for individual I relative to the general population, is increased by a factor Cc / q here = 625 or, if we use the accurate equation, q2+ pqCc / q 2 = 626 (the rarer the allele, the better the approximation q x Cc).
- Answer:
VIII. Rare alleles - Common alleles
8.1 Exercise
Consider:
- a gene A of which the recessive allele a has a freqency F(a) = q = 0.5,
- and a gene B of which the recessive allele b has a freqency F(b) = q = 0.0001, a fairly usual frequency for a morbid allele,
Calculate the frequency/risk of being recessive homozygote(s) for each of these two genes
- according to Hardy-Weinberg (HW)
- for a consanguin child whose parents are first-cousins
- compare.
Answer:
- according to HW:
- F(aa) = q2 = (0.5)2 = 0.25
- F(bb) = q2 = (0.0001)2 = (10-4)2 = 10-8
- for a consanguin child whose parents are first-cousins:
- S(1/2)p+m+1= (1/2)5 + (1/2)5 = (1/2)4 = 0.0625
- F(aa) = q2 + Fpq = (0.5)2 + 0.0625 X 0.5 X 0.5 = 0.2656
- F(bb) = q2 + Fpq = (10-4)2 + 0.0625 X 1 X 10-4 = (1 + 625)10-8 = 626 X 10-8
- comparison: the increase in the frequency (risk) of homozygotism due to the consanguinity will be F(consang.)/F(under HW), i.e.:
- for the common allele: 0.26256 / 0.25 = 1.06, slight increase
- or the rare morbid allele: 626 X 10-8/10-8 = 626 !!!
8.2 Practical consequences
In other words, the increase is: (q2 + Fpq)/q2 = 1 + Fp/q
- when p = q: --> = 1 + F @ 1
- when q is rare, p @ 1 --> = 1 + F/q @ F/q, with Fmax = 0.25 (incest), F is often of the order of 1 to 5 X 10-2 and q = 10-3 or 10-4, therefore a risk increased by a factor of 10to 103, which is generally the case for recessive diseases. Isolates, with a high level of consanguinity, allow unusual diseases to emerge.
IX. Consanguinity - Heterozygotism - Isogenetic line
In the human species, the percentage of heterozygotic loci, calculated from enzymatic polymorphism, has a value H = 0.067. We can take it that there are 30000 structural genes, and in consequence 2010 genes in the heterozygotic state in the human genome (30000 x 0.067 = 2010).
If an individual results from an uncle-niece cross:
- this individual will be more "homogenous" than his or her parents, because of the increased consanguinity,
- the percentage of his/her heterozygotic genes falls from 2010 to 1759 genes (2010 x 7/8) since Fi = 1/8 (1/8 of genes are identical as a result of consanguinity).
Consequences: If regular consanguin crosses are made (for example brother/sister crosses in mice), at each generation:
--> Fi tends towards a value of 1,
--> the individuals will become totally homozygotic.
Within each familly, all the individuals will be identical in the genetic sense of the word.
exactly the same genome
exactly the same genes.
This leads to the concept of the isogenetic line.
X. Multiallele system
The genotype frequencies at equilibrium for every AiAi homozygote and every AiAj: heterozygote will be:
- F(AiAi) = pi2(1 - F) + piF
- F(AiAj) = 2pipj(1 - F)
10.1 Exercise: Consanguinity for a locus and three alleles
In a population of a diploid species with separate sexes and separate generations, we are dealing with a triallele autosomal locus (three possible allele states: A1, A2 and A3).
A sample of 400 individuals is examined. The numbers of the various genotypes are as follows:
| A1A1 | A1A2 | A1A3 | A2A2 | A2A3 | A3A3 |
| 32 | 36 | 60 | 57 | 90 | 125 |
- Estimate the allele frequencies.
- Can we assume that we have the proportions of panmixia in the sample?
- Knowing that there is no selection, no mutation, no migration, no drift (large population), can consanguinity account for the difference? What are the theoretical proportions of the various genotypes, knowing that the mean coefficient of consanguinity is F?
- Estimate the value of F from the proportions of the sample.
Answer:
- The allele frequencies are estimated by counting the alleles. Thus, for allele A1 for example:
- frequency of A1 = ((2 x 32) + 36 + 60) / (2 x 400) = 0.20 = p
- similarly: frequency of A2= 0.30 = q; frequency of A3 = 0.50 = r
- The proportions of panmixia are in fact those indicated by the Hardy-Weinberg law.
- Theoretical frequencies of the genotypes according to the Hardy-Weinberg law
with p, q, r: the respective frequencies of the alleles A1, A2 and A3.
A1A1:
p2 = 0.202
A1A2:
2 pq = 2 x 0.20 x 0.30
A2A2:
q2 = 0.302
A1A3:
2 pr = 2 x 0.20 x 0.50
A3A3:
r2 = 0.502
A2A3:
2 qr = 2 x 0.30 x 0.50 - Theoretical numbers
A1A1:
0.202 x 400 =16
A1A2:
48
A2A2:
36
A1A3:
80
A3A3:
100
A2A3:
100 - Comparison of the theoretical numbers and the actual numbers found by a chi-squared test of conformity
c2 = (32 - 16)2/16 + ....... + (125 - 120)2/100 = 50
dd1 = 6 - 2 - 1 = 3; to a 5% threshold, [[chi]]2 calculated is greater than the value given in the table (7.815) and is therefore highly significant. Consequently the proportions of the genotypes do not comply with those of the Hardy-Weinberg law, the hypothesis of panmixia can be rejected.
- Theoretical frequencies of the genotypes according to the Hardy-Weinberg law
- Consanguinity has the effect of increasing the frequency of homozygotes and of reducing that of heterozygotes, relative to the proportions given by the Hardy-Weinberg law. This is indeed what we find. Consequently, consanguinity can account for the significant differences between the previous theroretical and actual numbers.
For the whole population, the theoretical genotype frequencies are:
A1A1: (1- F) p2 + Fp
A1A2: 2 pq (1 - F)
A2A2: (1 - F) q2 + Fq
A1A3: 2 pr (1 - F)
A3A3: (1 - F) r2 + Fr
A2A3: 2 qr (1 - F) - The frequency of A1A2 can be used simply to calculate F:
frequency of A1A2 = 2 pq (1 - F)
36/400 = 2 x 0.20 x 0.30 (1 - F) therefore 1 - F = 0.75
F = 0.25
Citation
Kalmes R, Huret JL
Atlas of Genetics and Cytogenetics in Oncology and Haematology 2002-06-01
Consanguinity
Online version: http://atlasgeneticsoncology.org/teaching/30039/copyright/haematological-explorer/submit-meetings/
