Written 2002-04 Patricia Ridgway, Christçle Maison, Genevièvee Almouzni
Institut Curie, Section de recherche, UMR218, 26, rue dUlm, 75231 Paris Cedex 05, France

  1. I.Introduction
  2. II.The nucleosome
  3. III.Histone proteins
    1. Core histones
    2. Linker histones
  4. IV.General steps in chromatin assembly
  5. V.Variation in basic constituents
  6. VI.Stimulatory Assembly Factors
    1. Histone interacting factors
    2. Remodelling machines and histone-modifying enzymes
  7. VII.Organization of the genome in the nucleus

I. Introduction

In eukaryotic cells the genetic material is organized into a complex structure composed of DNA and proteins and localized in a specialized compartment, the nucleus. This structure was called chromatin (from the Greek "khroma" meaning coloured and "soma" meaning body). Close to two meters of DNA in each cell must be assembled into a small nucleus of some mm in diameter. Despite this enormous degree of compaction, DNA must be rapidly accessible to permit its interaction with protein machineries that regulate the functions of chromatin:

  • Replication
  • Repair
  • Recombination

The dynamic organization of chromatin structure thereby influences, potentially, all functions of the genome.

The fundamental unit of chromatin, termed the nucleosome, is composed of DNA and histone proteins. This structure provides the first level of compaction of DNA into the nucleus. Nucleosomes are regularly spaced along the genome to form a nucleofilament which can adopt higher levels of compaction (Fig 1 and 3), ultimately resulting in the highly condensed metaphase chromosome. Within an interphase nucleus chromatin is organized into functional territories.

Chromatin has been divided into:

  • euchromatin
  • heterochromatin

Heterochromatin was defined as a structure that does not alter in its condensation throughout the cell cycle whereas euchromatin is decondensed during interphase. Heterochromatin is localized principally on the periphery of the nucleus and euchromatin in the interior of the nucleoplasm. We can distinguish:

  • constitutive heterochromatin, containing few genes and formed principally of repetitive sequences located in large regions coincident with centromeres and telomeres, from
  • facultative heterochromatin composed of transcriptionally active regions that can adopt the structural and functional characteristics of heterochromatin, such as the inactive X chromosome of mammals.

In this review we will define the components of chromatin and outline the different levels of its organization from the nucleosome to domains in the nucleus.

We will discuss how variation in the basic constituents of chromatin can impact on its activity and how stimulatory factors play a critical role in imparting diversity to this dynamic structure.

Finally we will summarize how chromatin influences the organization of the genome at the level of the nucleus.

II.The nucleosome

The partial digestion of DNA assembled into chromatin, generated fragments of 180-200 base pairs in length which were resolved by electrophoretic migration. This regularity of chromatin structure was later confirmed by electron microscope analysis that revealed chromatin as regularly spaced particles or "beads on a string". The stoichiometry of DNA and histones in the nucleosome was found to be 1/1 based on their mass.

The nucleosome is the fundamental unit of chromatin. It is composed of:

  • a core particle and
  • a linker region (or internucleosomal region) that joins adjacent core particles (Fig 1).

The core particle is highly conserved between species and is composed of 146 base pairs of DNA wrapped 1.7 turns around a protein octamer of two each of the core histones H3, H4, H2A and H2B.

The length of the linker region, however, varies between species and cell type. It is within this region that the variable linker histones are incorporated. Therefore, the total length of DNA in the nucleosome can vary with species from 160 to 241 base pairs.

Analyses revealed, firstly, the distortion of the DNA wound around the histone octamer and, secondly, that the histone/DNA and histone/histone interactions through their "histone fold domain" formed a configuration remniscent of a hand shake.

Figure 1
Figure 1. Defining elements of nucleosomes and chromatosome

III.Histone proteins

3.1 Core histones

The core histones, H3, H4, H2A and H2B, are small, basic proteins highly conserved in evolution (Figure 2). The most conserved region of these histones is their central domain structurally composed of the "histone fold domain" consisting of three a-helicies separated by two loop regions. In contrast, the N-terminal tails of each core histone is more variable and unstructured. The tails are particularily rich in lysine and arginine residues making them extremely basic. This region is the site of numerous post-translational modifications that are proposed to modify its charge and thereby alter DNA accessibility and protein/protein interactions with the nucleosome.

It is significant to note that other proteins that interact with DNA also contain the "histone fold domain".

Figure 2
Figure 2. The core histones. A.Structure of nucleosomal histones. B.Amino-terminal tails of core histones. The numbers indicate amino acid position. The post-translational modifications are indicated (red ac = acetylation sites ; blue p = phosphorylation sites ; green m = methylation sites ; purple rib = ADP ribosylation).

3.2 Linker histones

Linker histones associate with the linker region of DNA between two nucleosome cores and, unlike the core histones, they are not well conserved between species. In higher eukaryotes, they are composed of three domains: a globular, non-polar central domain essential for interactions with DNA and two non-structured N- and C- terminal tails that are highly basic and proposed to be the site of post translational modifications. The linker histones have a role in spacing nucleosomes and can modulate higher order compaction by providing an interaction region between adjacent nucleosomes.

IV.General steps in chromatin assembly

The assembly of DNA into chromatin involves a range of events, beginning with the formation of the basic unit, the nucleosome, and ultimately giving rise to a complex organization of specific domains within the nucleus. This step-wise assembly is described schematically in Fig 3.

  1. The first step is the deposition onto the DNA of a tetramer of newly synthesized (H3-H4)2 to form a sub-nucleosomal particle, which is followed by the addition of two H2A-H2B dimers.
    This produces a nucleosomal core particle consisting of 146 base pairs of DNA wound around the histone octamer.
    This core particle and the linker DNA together form the nucleosome.
    Newly synthesized histones are specifically modified (e.g.the acetylation of histone H4).
  2. The next step is the maturation step that requires ATP to establish regular spacing of the nucleosome cores to form the nucleofilament. During this step the newly incorporated histones are de-acetylated.
  3. Next the incorporation of linker histones is accompanied by folding of the nucleofilament into the 30nm fibre, the structure of which remains to be elucidated. Two principal models exist : the solenoid model and the zig zag.
  4. Finally, further successive folding events lead to a high level of organization and specific domains in the nucleus.

At each of the steps described above, variation in the composition and activity of chromatin can be obtained by modifying its basic constituents and the activity of stimulatory factors implicated in the processes of its assembly and disassembly.

Figure 3
Figure 3. General steps in chromatin assembly.

Assembly begins with the incorporation of the H3/H4 tetramer (1), followed by the addition of two H2A-H2B dimers (2) to form a core particle. The newly synthesized histones utilized are specifically modified; typically, histone H4 is acetylated at Lys5 and Lys12 (H3-H4*). Maturation requires ATP to establish a regular spacing, and histones are de-acetylated (3). The incorporation of linker histones is accompanied by folding of the nucleofilament. Here the model presents a solenoid structure in which there are six nucleosomes per gyre (4). Further folding events lead ultimately to a defined domain organization within the nucleus (5).

V.Variation in basic constituents

In the first steps of chromatin assembly, the elementary particle can assume variations:

  • at the level of DNA (for example by methylation) or
  • at the histone level by differential post-translational modification and the incorporation of variant forms (for example CENP-A, a variant of H3).

All of these variations are capable of introducing differences in the structure and activity of chromatin. The vast array of post-translational modifications of the histone tails summarized in Fig 2 (such as acetylation, phosphorylation, methylation, ubiquitination, polyADP-ribosylation), and their association with specific biological processes has led to a proposed hypothesis of a language, refered to as the "histone code", that marks genomic regions (It must be emphasized that this code is a working hypothesis)). The code is "read"by other proteins or protein complexes that are capable of understanding and interpreting the profiles of specific modifications. The incorporation of histone variants may be important at specific domains of the genome: in this context, CENP-A, a variant of histone H3 is associated with silent centromeric regions and macro H2A on the inactive X chromosome of female mammals. H2A-X is implicated in the formation of foci containing DNA repair factors in the regions of DNA double-strand breaks. Growing evidence exists that H2A.Z has a role in modifying chromatin structure to regulate transcription.

During the maturation step, incorporation of linker histones, non-histone chromatin associated proteins, called HMG(High Mobility Group),and other specific DNA-binding factors help to space and fold the nucleofilament. Therefore the early steps in assembly can have a great impact on the final characteristics of chromatin in specific nuclear domains.

VI.Stimulatory Assembly Factors

6.1 Histone interacting factors

Acidic factors can form complexes with histones and enhance the process of histone deposition. They act as histone chaperones by facilitating the formation of nucleosome cores without being part of the final reaction product. These histone-interacting factors, also called chromatin-assembly factors, can bind preferentially to a subset of histone proteins.

For instance, Chromatin Assembly Factor-1 (CAF-1) interacts with newly synthesized acetylated histones H3 and H4 to preferentialy assemble chromatin during DNA replication. CAF-1 is also capable of promoting the assembly of chromatin specifically coupled to the repair of DNA. The recent demonstration of the interaction of CAF-1 with the protein PCNA (Proliferating Cell Nuclear Antigen) established a molecular link between the assembly of chromatin and the processes of replication and repair of DNA. The assembly of specialized structures in centromeric regions, by deposition of variant histones such as CENP-A, or telomeres may be a result of the specificity and the diversity of as yet uncharacterised histone chaperones.

6.2 Remodelling machines and histone-modifying enzymes

Stimulatory factors also act during the chromatin maturation stage to organize and maintain a defined chromatin state. Their effects on chromatin can induce changes in conformation at the level of the nucleosome or more globally over large chromatin domains. These factors are of two types; one requiring energy in the form of ATP, generally refered to as chromatin remodelling machines, and the other that act as enzymes to post-translationally modify histones.

  • Chromatin remodelling machines: are multi-protein complexes (SWI/SNF, ISWI, Mi2/NuRD families).The activity of the ATPase permits the complex to modify nucleosomal structure, driven by the liberation of energy during the hydrolysis of ATP. The study of factors that stimulate the regular arrangement of nucleosomes during the assembly of chromatin led to the identification of several multi-protein complexes, capable in vitro of "sliding" nucleosomes along DNA. The common feature of these chromatin remodelling factors is their large size and multiple protein subunits including the ATPase, however, they display differences in abundance and activity.
  • post-translational modifications: the "histone code" hypothesis has been proposed to explain the diversity of chromatin activity in the nucleus. The unstructured N-terminal histone tails extend outside the nucleosome core and are the sites of action for enzymes that catalyze with high specificity their post-translational modification. The most well characterized of these modifications is the acetylation of lysine residues. Acetylation is the result of an equilibrium between two opposing activities: histone acetyl transferase (HAT) and histone deacetylation (HDAC) (e.g. HAT A, with a histone acetyltransferase activity and HDAC1, a histone deacetylase). Numerous proteins that play a role in the regulation of transcription have intrinsic histone acetyltransferase activity. Similarly, histone deacetylases have been described as components of multi-protein complexes associated with repressive chromatin. Also within these complexes are the Mi-2 family of remodeling factors providing a link between remodelling of nucleosomes and histone deacetylation during chromatin-mediated repression.

Methylation of histones plays a functionally important role. A histone-methyltransferase specifically methylates histone H3 on lysine residue 9 and this methylation modifies the interaction of H3 with heterochromatin associated proteins.

The two possible modifications (acetylation and methylation) on the same residue (lysine 9) of the N-terminal tail of H3 is a perfect illustration of the "histone code" hypothesis in action. Indeed, acetylated lysine in H3 and H4 N-terminal tail selectively interact with chromodomain present in numerous proteins having intrinsic histone acetyltransferase activity. However, H3 methylated on lysine residue 9 interact specifically with the chromodomain of an heterochromatin associated protein HP1.

Therefore, in addition to producing alterations in the overall charge of the histone tails, proposed to physically destabilize the nucleosome, modifications appear to impart specificity to protein:protein interactions with the histones. They are associated with different regions of the genome and are correlated with precise nuclear functions.

VII.Organization of the genome in the nucleus

The higher level of compaction of chromatin is not as well characterized. The nucleofilament is compacted to form the 30nm fibre that is organized into folds of 150 to 200 Kbp (250nm during interphase) to obtain a maximum level of compaction in the metaphase chromosome (850nm).

At interphase the organization of the genome relies on the structure of chromosomes that have been characterized into different regions based on a specific banding pattern.

The principle bands are:

  • G and C bands that are late replicating in S phase and correspond to heterochromatin and the
  • R bands that replicate earlier in S-phase and represent euchromatin. The R bands are enriched in acetylated histones and this modification is conserved through mitosis suggesting that histone acetylation may serve as a marker for the memory of domain organization through the cell cycle.

The localization of chromosomes in the interphase nucleus reveals that each chromosome occupies a defined space. In mammals, the organization of the chromosomes in the nucleus varies as a function of cell type. During interphase, regions that correspond to the bands of metaphase chromosomes are located in the nucleus based on the timing of their replication:

  • on the nuclear periphery are the later replicating regions, corresponding to G and C bands and the transcriptionally silent telomeres, while
  • gene rich regions are preferentially localized more internally.

Therefore, although each chromosome occupies a different territory, distinct parts of chromosomes can unite to form functional domains. The localization of coincident and non-coincident regions by FISH suggests that genes tend to be localized at the surface of chromosome territories. In the model based on the localization of some genes, transcripts are released into interchromosomal channels, transferred to sites for processing, then exported to the cytoplasm after maturation.

Several studies have led to the proposal that the nucleus is organized into domains. The localization of DNA in these domains is perhaps, in part, a consequence of the activities of chromatin. Targeting proteins might help to bring specialized proteins to specific domains in the nucleus. In a hypothetical model, the proteins associated with heterochromatin (for example HP1, Polycomb, Sir3p/Sir4p and ATRX), transcription factors (such as Ikaros) and assembly factors (such as CAF-1) may all be involved in the for establishment and maintenance of nuclear domains.

Table.Revised nomenclature for the HMG chromosomal proteins

List of Abbreviations

  • ATP: adenosine triphosphate
  • C-terminal: carboxy-terminal
  • CAF-1: Chromatin Assembly Factor-1
  • CENP-A: CENtromere Protein- A
  • HAT: Histone Acetyl Transferase
  • HDAC: Histone DeACetylase
  • HMG: High Mobility Group
  • HP1: Heterochromatin Protein 1
  • N-terminal: amino-terminal
  • PCNA: Proliferating Cell Nuclear Antigen
  • SWI/SNF: mating type SWItching/Sucrose Non-Fermenting

List of Definitions

  • Chromatin: is the carrier of genetic information. It is a complex structure composed of DNA and proteins and localized in the cellular nucleus.
  • Chromatin remodeling machines: require energy in the form of ATP and induce changes in conformation at the level of the nucleosome or more globally over large chromatin domains.
  • Constitutive heterochromatin: is formed principally of repetitive sequences and contains few genes. It is usually located in large regions coincident to centromeres and telomeres.
  • Euchromatin: represents chromatin that is decondensed during interphase.
  • Facultative heterochromatin: is composed of transcriptionally active regions that can adopt the structural and functional characteristics of heterochromatin.
  • G, C and R bands: correspond to the metaphase chromosome organization in bands.
  • Heterochromatin: represents a condensed form of chromatin that does not alter in its condensation throughout the cell cycle.
  • Histone chaperones: are acidic factors that can form complexes with histones and enhance the process of histone deposition. They act as histone chaperones by facilitating the formation of nucleosome cores without being part of the final reaction product.
  • Histone code: is the hypothesis of a language linked to the vast array of post-translational modifications to the histone tails and their association in specific biological activities. The functional significance of the interplay between these modified histones is the subject of intense investigation. This code is "read" by other proteins or protein complexes that are capable of understanding and interpreting the profiles of specific modifications.
  • HMG (High Mobility Group) proteins: are non-histone chromatin associated proteins. These DNA-binding proteins can help to space and fold the nucleofilament.
  • Nucleosome: is the fundamental unit of chromatin. It is composed of DNA and histone proteins. It provides the first level of compaction of DNA into the nucleus.


Ridgway P, Maison C, Almouzni G

Atlas of Genetics and Cytogenetics in Oncology and Haematology 2002-04-01


Online version: