YEATS4 (YEATS domain containing 4)

2017-09-01   Ivan H. Still , Brenda Lauffart 

Dept. of Biological Sciences, Arkansas Tech University, 1701 N Boulder Ave, Russellville, AR 72801, USA; istill@atu.edu

Identity

HGNC
LOCATION
12q15; Extends between Chromosome 12: 69,359,703-69,390,796, (according to hg38-Dec 2013)(Fig. 1)
IMAGE
Atlas Image
LEGEND
Fig. 1. Location of the YEATS4 gene is shown within a 500kb region of 12p15. A) Protein-coding genes; B) Genes for non-coding and microRNAs; C) Processed pseudogenes.
LOCUSID
ALIAS
4930573H17Rik,B230215M10Rik,GAS41,NUBI-1,YAF9
FUSION GENES

Abstract

This entry reviews the structure, function and clinical significance of YEATS4, a gene originally identified from an amplicon on 12p15 found in a glioblastoma cell line and originally named glioma-amplified sequence (GAS41). The gene is amplified in several other cancers and translocations or rearrangements of chromosome 12 with breakpoints in YEATS4 are noted in such cancers as glioblastoma, lung cancer and soft tissue sarcomas. Several frameshift and one nonsense mutations have also been detected in cancer. As the YEATS4 protein is involved in chromatin modeling, transcriptional regulation and mitotic regulation, such aberrations are likely to deregulate cellular mechanisms involved in the normal control of cellular proliferation and cell death.

DNA/RNA

Atlas Image
Fig. 2. Genomic structure of YEATS4. The exon distributions for the two alternatively spliced products (RefSeq accessions NM_006530 and NM_001300950 are shown, with the intron sizes given in base pairs (bp). Coding regions are shown in thicker lines; non-coding regions (5 untranslated and 3 untranslated regions) are depicted as thinner lines.

Description

YEATS4 is encoded by 7 exons spread over 31,094bp (Fig. 2). The gene encodes two alternatively spliced products (RefSeq accessions NM_006530 and NM_001300950). Transcript 1 (NM_006530) is 1501 nucleotides in length and encodes a protein of 227 amino acids. Transcript 2 (NM_001300950) is 1339 nucleotides long and encodes a protein of 173 amino acids.

Proteins

Atlas Image
Fig. 3. Structure of the YEATS4 protein. Location of exon boundaries (vertical lines) are shown relative to the protein structure. The YEATS and Coiled domains are involved in the functional interactions of the protein. Alternative splicing removes exon 3 and 4, yielding transcript 2, such that isoform 2 (NP_001287879) misses 55 amino acids of the YEATS domain.

Description

The YEATS4 transcripts produce two proteins; isoform 1 is 227 amino acids long (RefSeq accession: NP_006521) and isoform 2 is 173 amino acids long (RefSeq accession: NP_001287879). Isoform 1 has two defined structural/functional features: the YEATS domain located between amino acid residue 44 and 123, and a coiled coiled domain (amino acid 168 - ter). The YEATS domain is conserved in YNK7 (Yaf9), MLLT1 (ENL), MLLT3 (AF-9) and the TFIIF small subunit. Yaf9 in S. cerevisiae is a subunit of the NuA4 histone acetyltransferase complex (Le Masson et al., 2003). Thus, the YEATS domain is thought to be involved in transcriptional regulation. A C-terminal coiled coil domain interacts with several proteins indicated below and is critical for incorporation of YEATS4 into the human TIP60 complex (Park and Roeder 2006). Six chemical modifications have been annotated in the Prosite database on amino acids conserved between human and mouse (Fig. 4), four of which are ubiquitinylation of lysines located in the coiled coil domain (Hornbeck et al., 2014). Fifty five amino acids of the YEATS domain are missing from isoform 2.
Atlas Image
Fig. 4. Location of chemically modified amino acid side chains. Amino acid residues positions are given relative to NP_006521. Key: p- phosphorylation, ac- acetylation, ub- ubiquitination.

Expression

Northern blot analysis detects a ubiquitously expressed transcript of approximately 1.7kb, with high expression in brain, heart and skeletal muscle (Harbroth et al., 2000). In general, RNA-seq analysis consistently indicates highest expression in the testes, tissues of the immune system (bone marrow, lymph node and thymus) and the brain, specifically the cerebellum (Fagerberg et al., 2013; Duff et al., 2015; GTEx Portal 2017).

Localisation

Several articles indicate that YEATS4 is located in punctate structures in the nucleoplasm of the interphase nucleus of the majority of tested cells, including glioblastoma cell lines (Harborth et al., 2000; Munnia et al., 2001; Debernardi et al., 2002; Lauffart et al., 2002). However, YEATS4 is localized to nucleoli in the neuroblastoma cell lines SK-N-SH, Neuro 2a, and NS20 (Munnia et al., 2001), suggesting a possible functional differentiation for YEATS4 in tumors of the peripheral compared to central nervous system.

Function

Based on its homology with Yaf 9 (by what is now called the YEATS domain [Fig. 1]), YEATS4 was predicted to be a subunit of the human homologue of the NuA4 histone acetyltransferase complex, the Tip60 complex. This complex is involved in gene regulation and the repair of double-stranded DNA breaks. This multisubunit complex has histone H4/H2A acetyltransferase activity, is capable of ATP-dependent H2AZ -H2B histone dimer exchange, and has DNA helicase activity (Auger et al., 2008). YEATS4 is also a subunit of the SNP2-related CBP activator protein ( SRCAP) complex, which is a distinct complex also involved in the exchange of Histone H2A for H2Z in chromatin, although the complex functions without acetyltransferase activity (Auger et al., 2008). YEATS4 also directly interacts with SWI/SNF component SMARCB1 (BAF47 or INI1) (Debernardi et al., 2002), MLLT10, which is a histone lysine methyltransferase DOT1L cofactor (Debernardi et al., 2002), the KDM1A demethylase (Piccinni et al., 2011), and the TFIIF component GTF2F2 (RAP30) (Heisel et al., 2010). It has been suggested that the YEATS domain acts as an evolutionarily conserved chromatin modification reader, and possibly directly binds to histones (Schulze et al., 2010).
Further evidence for a function of YEATS4 in transcription has come from direct interactions with transcription factor TFAP2A (AP-2) (Ding et al., 2006), and the C-terminal region of the myc family members MYCN and MYC (Piccinni et al., 2011), identified by affinity capture techniques such as yeast two-hybrid analysis, GST-pull down and coimmunoprecipitation. The β-catenin gene (CTNNB1) is regulated by binding of YEATS4 to its promoter, either via these transcription factors, or in combination with the chromatin modeling complexes discussed above (Ji et al., 2017; Jixiang et al., 2017). Downstream targets of β-catenin such as CCND1 (Cyclin D1), CDK4, CDK6 and c-Myc, and the apoptotic proteins BCL2 and BAX also respond to overexpression and decreased expression of YEATS4 in gastric cancer and pancreatic cancer (Ji et al., 2017; Jixiang et al., 2017).
As noted above, the Tip60 and SCRAP complex play roles in double-stranded DNA repair. However, YEATS4 is involved in the regulation of DNA repair independent of these complexes. YEATS4 overexpression results in destabilization of TP53, with downregulation of YEATS4 leading to stabilization of p53 with concomitant upregulation of p53 target genes such as CDKN1A (p21), and induction of senescence and apoptosis (Llanos et al., 2006; Park et al., 2006; Pikor et al., 2013; Tao et al., 2015). Furthermore, overexpression of YEATS4 increases resistance of normal bronchial epithelial cells to the platinum drug cisplatin and p53- MDM2 interaction inhibitor, nutlin (Pikor et al., 2013). Mechanistically, YEATS4 acts with PP2Cβ to dephosphorylate ser-366 on p53, and can decrease cell death due to UV-induced DNA damage (Park et al., 2011). Thus, overexpression of YEATS4 may contribute to the cancers described below in part by overcoming the tumor suppressor properties of p53.
Additional functions of YEATS4 in nuclear architecture and regulation of mitotic spindle assembly were uncovered shortly after its discovery. Harborth et al., (2000) performed yeast two-hybrid analysis to identify proteins that bind to the C-terminus (amino acids 1048-2115) of NUMA1 (NuMA). Further mapping indicated that YEATS4 bound via its coiled coil domain to the C-terminal region of the coiled-coil rod domain of NuMA (amino acids 1048-1700). This interaction was confirmed independently by co-immunoprecipitation, Dot Overlay Assays, Surface Plasmon Resonance, and by a separate study (Munnia et al., 2001). Later studies showed that interaction between NuMA and YEATS4 was cell-cycle regulated with the interaction being low in G1 to S, increasing in S phase and being maximal at G2/M (Schmitt et al., 2012). NuMA is a nuclear matrix protein that relocalizes to and anchors microtubules to the spindle poles (reviewed in Haren et al., 2009). Manipulation of the expression of YEATS4 and NuMA via expression plasmids and siRNA results in an increased rate of spindle defects leading to multipolar spindles and misaligned chromosomes (Schmitt et al., 2012). Additional interactions of YEATS4 with CEP162 (Munnia et al., 2001), and TACC1 - TACC2 - TACC3 (Lauffart et al., 2002; Lauffart et al., 2003), proteins with known roles in centrosomal dynamics during mitosis (Gergley et al., 2000a,b; Leon et al., 2006), have also been identified. However, even these YEATS4-interacting proteins have been implicated in transcriptional regulation. For instance, during interphase NuMA binds to and acts as a transcriptional coregulator of p53 upon DNA-damage (Endo et al., 2013; Ohata et al., 2013); CEP162 associates with predominantly nuclear localized KAT14 lysine acetyltransferase (Gupta et al., 2015; Wang et al., 2008); and the TACC proteins bind to nuclear localized histone acetyltransferases (Gangisetty et al., 2004), nuclear hormone receptors (Vettaikkorumakankauv et al., 2008; Guyot et al., 2010; Hein et al., 2015; Huttlin et al., 2017) and other transcription factors (Sadek et al., 2000; Simpson et al., 2004); Bargo et al., 2010), and coregulate their transcriptional targets.
YEATS4 has been found in complexes with a further 108 proteins by high-throughput analysis via affinity purification-mass spectrometry (Li et al., 2015; Huttlin et al., 2015; Huttlin et al., 2017). Many of these interactions support the proposed functions of YEATS4 in the TIP60 and SCRAP complexes, transcriptional regulation, and mitotic regulation. For instance, additional transcription factors of relevance to cancer, such as FOS, Fox family members, and the myc partner MAX have been identified as YEATS4-binding factors (Li et al., 2015).

Homology

The YEATS domain is highly conserved within eukaryotes and defines a family of proteins involved in key functions including transcriptional regulation and chromatin structure (see above). Four YEATS family members are found in humans (MLLT1 [ENL], MLLT3 [AF9], YEATS2, and YEATS4). A BLASTp search of the protein database can identify a defined YEATS4 protein in metazoan eukaryotes representing fungi, arthopods, cnirdarians, and chordates, but not plants. In S. cerevisiae, Yaf9 has been proposed to be the closest functional homolog to YEATS4, containing both the YEATS domain and the C-terminal coiled coil region (Le Masson et al., 2003). The Yaf9 protein shows 33% amino acid identity and 57% homology to human YEATS4. The N-terminal YEATS domain of YEATS4 is able to functionally interchange with that of Yaf9 (Piccinni et al., 2011).

Mutations

Somatic

A total of 39 somatic sequence variations involving the coding region of YEATS4 are recorded in COSMIC v81, released 09-May-17, at http://cancer.sanger.ac.uk/cosmic (Bamford et al., 2004; Forbes et al., 2017). Of those, seven are silent, twenty-seven are missense, four are frameshifts, and one is a nonsense mutation. Each sequence variant has been found in a single case, except for G75D which was detected in 3 colon cancer cell lines HCT8, HCT15, DLD1. c.63C>T, c.66T>G, c.333T>C, c.475C>T, c.498T>C, c.680_681insA are noted as single nucleotide polymorphisms in the dSNP database (https://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?locusId=8089).
Position (AA)Mutation (CDS)Mutation (Amino Acid)Mutation TypeCancer type
2c.6C>Gp.F2Ls.m.Rectal adenocarcinoma
8c.22T>Cp.F8Ls.m.Hepatocellular carcinoma
10c.28C>Tp.P10Ss.m.Stomach adenocarcinoma
21c.63C>T#p.I21Is.sMalignant melanoma
22c.66T>G#p.V22Vs.sClear cell renal cell carcinoma
35c.103G>Ap.G35Rs.m.Thyroid carcinoma
43c.127C>Tp.H43Ys.m.Malignant melanoma
43c.124delGp.H43fs*9d.fs.Lung adenocarcinoma
52c.156A>Gp.K52Ks.s.Stomach adenocarcinoma
58c.172G>Cp.D58Hs.m.Lung adenocarcinoma
75c.224G>Ap.G75Ds.m.Colon
78c.234A>Gp.L78Ls.sLung carcinoma
93c.279G>Tp.W93Cs.m.Primitive neuroectodermal tumour - medulloblastoma
95c.284A>Cp.E95As.m.Papillary renal cell carcinoma
97c.289G>Ap.E97Ks.m.Rectal Adenocarcinoma
100c.300C>Gp.I100Ms.m.Pancreatic ductal carcinoma
110c.328A>Gp.R110Gs.m.Malignant melanoma
111c.333T>C#p.P111Ps.sAdenocarcinoma of Large intestine
125c.375C>Ap.T125Ts.sMalignant melanoma
133c.398C>Gp.T133Rs.m.Clear cell renal cell carcinoma
136c.407C>Tp.S136Ls.m.Malignant melanoma
141c.421G>Tp.E141*s.n.Endometrioid carcinoma
147c.438delCp.P147fs*4d.fs.Breast Ductal carcinoma
150c.449T>Cp.M150Ts.m.Pancreas Ductal carcinoma
151c.453G>Ap.M151Is.m.Liver neoplasm
154c.461T>Cp.L154Ss.m.Head neck Squamous cell carcinoma
159c.475C>T#p.R159Cs.m.Malignant melanoma
162c.485C>Tp.T162Is.m.Papillary renal cell carcinoma
163c.489A>Cp.L163Fs.m.Breast Carcinoma
166c.498T>C#p.Y166Ys.s.Head neck Squamous cell carcinoma
170c.509C>Gp.T170Rs.m.Basal (triple-negative) carcinoma
174c.521A>Cp.E174As.m.Breast Phyllodes tumour
180c.539G>Cp.R180Ts.m.Lung Adenocarcinoma
192c.575T>Cp.F192Ss.m.Stomach Intestinal adenocarcinoma
207c.619A>Tp.T207Ss.m.Central nervous system Oligodendroglioma
207c.619delAp.T207fs*2d.fs.Endometrium Serous carcinoma
208c.624A>Gp.I208Ms.m.Hepatocellular carcinoma
224c.671C>Tp.A224Vs.m.Breast Ductal carcinoma
228c.680_681insA#p.*228fs?i.fs.Breast Carcinoma
).
Table 1: Sequence variations tabulated from COSMIC v81 (accessed 7/25/2017). KEY: d.fs. - deletion leading to frameshift; i.fs. - insertion leading to frameshift; s.m. - missense substitutions; s.s. - silent substitution; s.n. - nonsense substitution; # variants recorded as single nucleotide polymorphisms. Cells in green denote variations located in the YEATS domain, and cells in blue denote variations located in the C-terminal coiled coil domain.

Implicated in

Top note
The YEATS4 gene is located in a region of Chromosome 12 that is subject to genomic instability and amplification in several different cancers. 117 of 508 human cancer cell lines (23%) exhibited copy number gains of YEATS4 (Pikor et al., 2015). Fusion genes have been identified by RNA-seq analysis of the samples in the Cancer Genome Atlas, as well as in other tumor samples. As many rearrangements resulting in fusions between YEATS4 and other coding genes are found in the same region of the long arm of chromosome 12 that is also subject to amplification, they may be the result of intrachromosomal inversions or deletions. However, as the mechanism of the rearrangements is unclear, the genomic alterations will be designated as translocations for the purpose of this review.
Entity name
Bladder urothelial carcinoma
Note
RNA-seq analysis of 414 bladder urothelial carcinomas, detected one rearrangement, t(12;12)(q15;q15), fusing exon 4 of YEATS4 out-of-frame to exon 2 of the human lysozyme (LYZ) gene (Yoshihara et al., 2015). Presumably non-sense mediated decay would prevent significant protein product accumulating.
Atlas Image
Fig. 5. Hybrid gene and protein produced by t(12;12)(q15;q15). With a breakpoint located in intron 4 of YEATS4 and intron 1 of LYZ, RNA-seq detected an out-of-frame fusion gene that could only produce a significantly truncated YEATS4 protein.
Entity name
Brain cancer
Note
YEATS4 was identified as an amplified sequence (GAS41) in glioblastoma multiforme (GBM) cell line TX3868, with amplification subsequently noted at higher frequency in lower grade astrocytoma grades I and II. (Fischer et al., 1997). In a larger study by Schmitt et al. (2012) of 258 glioblastoma samples from the Cancer Genome Atlas. 9 glioblastomas had high level of YEATS4 amplification. However, 26.1% of the samples exhibited reduced expression.
Four genomic rearrangements involving YEATS4 have been noted in glioblastoma. Frattini et al. (2013) performed RNA-sequencing of 161 primary GBM and 24 short-term glioma sphere cultures. These authors detected a single rearrangement t(12;12)(q14.1;q15), resulting in a fusion between YEATS4 and XRCC6BP1 (X-ray repair cross-complementation group 6 binding protein 1). XRCC6BP1 (AKA ATP23) is involved in double-stranded DNA break repair (Fischer et al., 2013) and, based on homology via its metalloprotease domain, is suggested to be important in the biosynthesis of mitochondrial ATPase (Zeng et al., 2007). The detected transcript indicates that the fusion protein contains the YEATS domain fused to the majority of the peptidase M76 domain of XRCC6BP1 (missing 20 amino acids at the N-terminus) (Fig. 6).
Atlas Image
Fig. 6. Hybrid gene and protein produced by t(12;12)(q14.1;q15) that fuses YEATS4 exon 6 to exon 3 of XRCC6BP1. The Mitelman database contains records generated from paired-end whole genome-sequencing of 42 paired TCGA glioblastoma tumors and matching normal samples (Zheng et al., 2013, Yoshihara et al., 2015). A translocation t(12;12)(q15;q24) resulting in an in-frame fusion between YEATS4 exon 4 and exon 33 of EP400 (Fig. 7) was reported in the Mitelman database, however this fusion is absent from the TUMOR FUSION GENE DATA PORTAL developed by the same authors (http://54.84.12.177/PanCanFusV2/), and may not have survived further validation. The YEATS4-EP400 hybrid protein would contain contains 111 amino acids of YEATS4, containing part of the YEATS domain fused to the C-terminal 1811 amino acids of EP400, containing the SNF2 family domain and DNA_pol3_delta2 super family domain (cl26247). Yoshihara et al (2015) detected one case of a t(12;12)(q13.3;q15), generating an out-of-frame fusion of YEATS4 exon 6 to methionyl-tRNA synthetase ( MARS) exon 12 (Fig. 8). If the fusion protein is produced from the initiator methionine of YEATS4, the protein would be 187 amino acids long and contain the YEATS domain. However, one might predict that nonsense-mediated decay may reduce the accumulation of the fusion mRNA, and subsequent protein. A larger RNA-seq analysis followed by sequencing of 185 GBM samples from TCGA and Ivy center cohort also detected a fusion between YEATS4 (exon 1) and exon 4 of SLC35E3 located 601kb centromeric (Shah et al., 2013). While the fusion RNA detected would be driven from the YEATS4 promoter, the 106 amino acid fusion protein would only contain part of the triose phosphate transporter domain of SLC35E3 (Fig. 9). While it is possible that the fusion could inhibit the function of any normal YEATS4 or SLC35E3 protein in the tumor cell, it would seem more likely that the reciprocal product, if produced, would be of functional significance. However, the reciprocal SLC35E3-YEATS4 fusion transcript was not recorded.
Fig. 7. Fusion between exon 4 of YEATS4 and exon 33 of EP400 (numbering based on NM_015409 and protein NP_056224). The fusion protein is 1922 amino acids long. Domain Key: Y: YEATS domain; cc: Coiled coil domain; EP400_N: pfam15790, E1A-binding protein p400, N-terminal domain is characterized by low-complexity but its function is unknown; Atroph: Atrophin-1 super family domain, a polyglutamine stretch may interact with histone acetytransferase CBP; HSA: domain found in helicases and associated with SANT domains; SNF2_N: SNF2 family N-terminal domain; SNF: SNF2 family domain; DN: DNA_pol3_delta2 super family domain (cl26247).
Atlas Image
Fig. 8. Out-of-frame fusion between YEATS4 and MARS. The hybrid gene could produce a truncated YEATS4 protein missing the coiled coil domain. Domain Key: Y: YEATS domain; cc: Coiled coil domain; GST: Glutathione S-transferase C-terminal-like, alpha helical domain of Methionyl-tRNA synthetase from higher eukaryotes; PRK12267 super family domain: Methionyl-tRNA synthetase.
Fig. 9. Hybrid gene and protein produced by t(12;12)(q15;q15) that fuses YEATS4 exon 1 to exon 4 of SLC35E3. SLC35E3 encodes a protein of 313 amino acids (NP_061126) and is identified as a member of the triose-phosphate transporter (TPT) family based upon a central 278 amino acid domain.
Entity name
Colorectal carcinoma
Note
In a study of 86 colorectal carcinoma patients, increased expression (> 4th quartile) of YEATS4 correlated with decreased survival HR = 1.910, 95% CI: 1.005-3.632, P = 0.048 (Tao et al., 2015).
One rearrangement involving YEATS4 was detected in a study of 95 rectal adenocarcinomas by RNA-seq analysis and recorded in the TUMOR FUSION GENE DATA PORTAL. The rearrangement is identical to that observed in hepatocellular adenocarcinoma, fusing YEATS4 exon 6 to the final (noncoding exon) of CPSF6 (Fig. 11). Thus, the predicted fusion protein contains a truncated YEATS4 coding sequence contain the YEATS domain, but lacking the C-terminal coiled coil domain.
Entity name
Gastric Cancer
Note
Quantitative real time reverse transcriptase PCR analysis of 30 Gastric cancers by Ji et al. (2017) detected increased expression of the YEATS4 transcript compared to normal adjacent tissue. 13% of samples showed decreased expression. Overexpression of YEATS4 mRNA and protein was associated with cancer stage, with highest expression observed in Stage III gastric cancer. Overall survival and disease-free survival was decreased with high expression of YEATS4 (Ji et al., 2017). The authors detected increased occupancy of the beta-catenin promoter by the YEATS4 protein, leading to histone acetylation and increased transcription of the beta catenin gene.
One case of a t(12;12)(q15;q15) fusing exon 6 of YEATS4 to exon 2 of the human lysozyme gene (LYZ) was identified from 414 cases in the Tumor Fusion Gene Data Portal. The resulting fusion protein contains the YEATS domain and the hydrolytic catalytic cleft of lysozyme. This rearrangement is identical to that detected in liver hepatocarcinoma cancer (see Fig. 10).
Entity name
Hypertension
Note
The SNP rs7297610, located between YEATS4 and FRS2, was associated with antihypertensive response to hydrochlorothiazide. Specifically, the C/C genotypes was associated with significant improvement in blood pressure in treated African Americans. Baseline YEATS4 expression was increased in peripheral blood of C/C genotype individuals, but decreased upon treatment with hydrochlorothiazide (Duarte et al., 2013).
Entity name
Liposarcoma
Note
Amplification and overexpression of YEATS4 occurs in liposarcomas (Italiano et al., 2008; Barretina et al., 2010; Creytens et al., 2015). In a study of 28 well-differentiated liposarcomsas and 10 dedifferentiated liposarcomas, Italiano et al., (2008) subdivided the 12q13-15 amplicon into three independent amplicons. One amplicon, centered on MDM2 in 12q15 often contained YEATS4 with concomitant increased expression of the gene. A larger study of 77 lipomatous soft tissue tumors (including lipomas, well-differentiated liposarcomas (WDLPS), dedifferentiated liposarcomas (DDLPS) and pleomorphic liposarcomas) found that amplification of the YEATS4 gene was more frequent and at a higher level in dedifferentiated liposarcomas than in well-differentiated liposarcomas (Creytens et al., 2015). Knockdown of YEATS4 by shRNA in dedifferentiated liposarcoma independently reduced cell proliferation suggesting that YEATS4 acts as a driver gene within the 12q15 amplicon (Barretina et al., 2010).
Entity name
Liver hepatocellular carcinoma
Note
Two rearrangements involving YEATS4 have been detected in a study of 374 hepatocellular. carcinomas by RNA-seq analysis and recorded in the TUMOR FUSION GENE DATA PORTAL (2017). One case of a t(12;12)(q15;q15) fusing exon 6 of YEATS4 to exon 2 of the human lysozyme gene (LYZ) was identified. The resulting fusion protein contains the YEATS domain and the hydrolytic catalytic cleft of lysozyme (Fig. 10). This rearrangement is identical to that detected in stomach cancer (see below). The second rearrangement involves CPSF6 (cleavage and polyadenylation specific factor 6), located 101kb centromeric to YEATS4 (Fig. 1), and fuses exon 6 to the final, noncoding, exon of CPSF6 (Fig. 11). Thus, the fusion protein would be a truncated YEATS4 coding sequence containing the YEATS domain, but lacking the C-terminal coiled coil domain. The rearrangement is identical to that detected in rectal adenocarcinoma (see below).
Atlas Image
Fig. 10. Fusion between YEATS4 and LYZ, with a breakpoint located in intron 6 of YEATS4 and intron 2 of LYZ. The resulting fusion gene produces a fusion protein containing the YEATS domain and the Lysozyme catalytic cleft (LCC).
Atlas Image
Fig. 11. The CPSF6 protein (NP_0008938) is coded by the first 9 exons of the CPSF6 transcript (NM_007007). CPSF6 has one conserved domain (RRM_CFIm68) defined as a RNA recognition motif of pre-mRNA cleavage factor Im 68 kDa subunit (CFIm68 or CPSF6) and similar protein. The 4846 base pair tenth exon is untranslated, thus the fused hybrid gene produces a truncated YEATS4 protein.
Entity name
Lung cancer
Note
Pikor et al., (2015) performed gene expression and copy number analysis of 261 non-small cell lung cancers (NSCLC) relative to matched normal tissues. The 432 kb region encompassing LYZ to BEST3 (Fig. 1) (and RAB3IP located telomeric to BEST3) was amplified in 20% of the samples tested. Of the 7 genes within the amplicon, only YEATS4 was both gained/amplified and concomitantly overexpressed in lung tumors with YEATS4 amplified in 18% (47/261) and overexpressed in 31% (15/48) of tumor samples originally tested. YEATS4 copy gain/amplification was detected in an additional 43/128 (33.6%) lung cancer cell lines. YEATS4 was overexpressed in 18% (15/83) of the Early Detection Research Network samples and 33% (14/42) from the Cancer Genome Atlas. 3/83 tumors showed higher amplification of YEATS4 compared to the MDM2 protooncogene, that is the main suspected driver of 12q15 amplification, suggesting that YEATS4 can provide an independent advantage over MDM2 in some lung cancers. The study validated overexpression of the YEATS4 protein in 15/59 (25.4%) tumors and 8/18 (44.4%) NSCLC cell lines. Knockdown of YEATS4 in cell lines containing YEATS4 amplification induced senescence and apoptosis in a p53/p21 mediated manner. Overexpression of YEATS4 increased resistance of normal bronchial epithelial cells to the platinum drug cisplatin and p53-MDM2 interaction inhibitor, nutlin (Pikor et al., 2015).
The TUMOR FUSION GENE DATA PORTAL (2017) reports one case of a t(12;12)(q15;q15) from 541 Lung adenocarcinomas, involving YEATS4 and SLC35E3. The rearrangement fuses YEATS4 (exon 4) to exon 4 of SLC35E3 resulting in part of the YEATS domain to 65 amino acids of the triose phosphate transporter domain (Fig. 12).
Atlas Image
Fig. 12. In-frame fusion between YEATS4 and SLC35E3. SLC35E3 encodes a protein of 313 amino acids (NP_061126) and is identified as a member of the triose-phosphate transporter (TPT) family based upon a central 278 amino acid domain. The YEATS4-SLC35E3 fusion protein contains 111 amino acids of YEATS4 fused to 88 amino acids of the SLC35E3 protein containing the C-terminal 65 amino acids of the triose phosphate transporter domain.
Entity name
Müllerian adenosarcomas of the uterus
Note
In an analysis of 15 uterine and one ovarian Müllerian adenosarcomas, five cases showed amplification of a 900kb region stretching from SLC35E3 to BEST3, and thereby encompassing the YEATS4 locus on 12q15 (Lee et al., 2016).
Entity name
Pancreatic cancer
Note
YEATS4 transcript overexpression was noted in 31 pancreatic cancer samples relative to adjacent non-cancerous tissues (Jixiang et al., 2017). An expansion of this study to 93 pancreatic samples in the ONCOMINE database (Rhodes et al., 2004) further indicated clinical significance of YEATS4 expression in pancreatic cancer. Introduction of YEATS4 into a normal pancreatic epithelial cell lines promote growth invasion and motility. These effects were mediated, at least in part, by activation of the Wnt pathway and direct interaction with beta catenin (Jixiang et al., 2017).
Entity name
Soft tissue Sarcomas (unclassified)
Note
Five translocations were identified by Verhaak and colleagues from 263 Sarcomas and reported in the TUMOR FUSION GENE DATA PORTAL (2017). The adult sarcomas are unclassified in the National Cancer Institute Genomic Data Commons Data Portal. Each translocation fuses YEATS4 3 prime to its fusion partner.
One case t(1;12)(q21.3;q15) fuses the exon 4 of KCNN3 (potassium calcium-activated channel subfamily N member 3)(based on transcript NM_002249) in-frame to exon 2 of YEATS4. The fusion protein would be 740 amino acids long and contain the calcium-activated SK potassium channel domain and part of the ion channel domain (pfam03530 and pfam07885) from KCNN3 and the YEATS4 YEATS domain and coiled coil domain (Fig. 13).
In a different tumor, another in-frame fusion occurs between exon 2 of MARCH9 on 12q14.1 and exon 2 of YEATS4 (Fig. 14). MARCH9 is a member of the MARCH family of membrane-bound E3 ubiquitin ligases. The fusion protein product is 380 amino acids long, containing the C4HC3 zinc-finger like RING domain and the YEATS4 YEATS domain and coiled coil domain.
The other three translocations observed fuse YEATS4 exon 7 out of frame 3-prime to its fusion partner, suggesting that these rearrangements produce a truncated protein lacking any YEATS4 components. One case of t(1;12)(q24.2;q15) fuses exon 16 of a coactivator of ligand-dependent nuclear receptors, DCAF6, to exon 7 of YEATS4 (Fig. 15). A truncated protein would contain the WD40 and cl26247 (DNA polymerase III, delta subunit) domains involved in transcriptional regulation, but miss the final 200 amino acids of the coactivator. A single t(12;12)(q21.1;q15) fuses exon 1 of the THAP domain containing, apoptosis associated protein 2 ( THAP2) out-of-frame to exon 7 of YEATS4. The "fusion protein would also be truncated, and only contain the first 24 amino acids of the THAP2 protein, and 5 amino acids encoded out of frame from exon 7 of YEATS4 (Fig. 16). The last case noted was an out of frame fusion between exon 10 of TMTC1 and exon 7 of YEATS4 t(12;12)(p11.22;q15). If the truncated 564 amino acid fusion protein is produced, it would lack the C-terminal tetratricopeptide repeats of TMTC1 (Fig. 17).
Atlas Image
Fig. 13. In-frame fusion between KCNN3 and YEATS4. Exon numbering for KCNN3 is based upon transcript NM_002249 that codes for isoform a (NP_002240). Key: Y: YEATS domain; CC: coiled coil domain; SK_C: calcium-activated SK potassium channel domain; I_T2: Ion Transporter_2 (ion channel) domain; CM: Calmodulin binding domain; Grey box: Prefoldin super family domain structure.
Fig. 14. The MARCH9 gene contains 4 exons and encodes a transcript (NM_138396) of 2985 nucleotides, with a 5 untranslated region of 431 nucleotides and a 3 untranslated region of 1517 nucleotides. The full length MARCH9 protein is 346 amino acids long (NP_612405). The fusion between exon 2 of MARCH9 and exon 2 of YEATS4 could produce a fusion peptide of 380 amino acids.
Fig. 15. The DCAF6 gene contains 22 exons and encodes multiple alternatively splice transcripts. Protein Isoform b (NP_001017977) was selected as the canonical form by UniProtKB, coded by a transcript 2 (NM_001017977) of 3360 nucleotides, with a 5 untranslated region of 353 nucleotides and a 3 untranslated region of 1064 nucleotides. This transcript is missing exon 11, 12, and 17. W: WDR11 repeat region.
Atlas Image
Fig. 16. The THAP2 gene contains 3 exons and encodes a transcript of 4840 nucleotides, with a 5 untranslated region of 613 nucleotides and a 3 untranslated region of 3959 nucleotides. The full length THAP2 protein is 228 amino acids long (NP_113623), containing the DNA binding THAP domain. The fusion between exon 1 of THAP2 and exon 7 of YEATS could produce a fusion peptide of only 29 amino acids in size.
Fig. 17. The TMTC1 mRNA is encoded by 18 exons and is 8.75kb long with a 3 untranslated region of 6.2kb. The full length TMTC1 protein (NP_001180380) is 882 amino acid long and contains two domains: the DUF1736 (D) and the TPR-11 domain. The TMTC1-YEATS4 fusion gene would produce a 2468 nucleotide mRNA encoding 559 amino acids of TMTC1, missing the C-terminal 323 amino acid region containing most of the tetratricopeptide repeats of the TPR-11 domain.

Bibliography

Pubmed IDLast YearTitleAuthors
182120472008Eaf1 is the platform for NuA4 molecular assembly that evolutionarily links chromatin acetylation to ATP-dependent exchange of histone H2A variants.Auger A et al
278995782017COSMIC: somatic cancer genetics at high-resolution.Forbes SA et al
208047272010Transforming acidic coiled-coil protein-3 (Tacc3) acts as a negative regulator of Notch signaling through binding to CDC10/Ankyrin repeats.Bargo S et al
206019552010Subtype-specific genomic alterations define new targets for soft-tissue sarcoma therapy.Barretina J et al
258628362015Characterization of the 12q amplicons in lipomatous soft tissue tumors by multiplex ligation-dependent probe amplification-based copy number analysis.Creytens D et al
117561822002The MLL fusion partner AF10 binds GAS41, a protein that interacts with the human SWI/SNF complex.Debernardi S et al
166989632006GAS41 interacts with transcription factor AP-2beta and stimulates AP-2beta-mediated transactivation.Ding X et al
223501082013Association of chromosome 12 locus with antihypertensive response to hydrochlorothiazide may involve differential YEATS4 expression.Duarte JD et al
259702442015Genome-wide identification of zero nucleotide recursive splicing in Drosophila.Duff MO et al
238285762013Nuclear mitotic apparatus protein, NuMA, modulates p53-mediated transcription in cancer cells.Endo A et al
243098982014Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics.Fagerberg L et al
93022581997Cloning of a novel transcription factor-like gene amplified in human glioma including astrocytoma grade I.Fischer U et al
236705972013Glioma-amplified sequence KUB3 influences double-strand break repair after ionizing radiation.Fischer U et al
239174012013The integrated landscape of driver genomic alterations in glioblastoma.Frattini V et al
147674762004The transforming acidic coiled coil proteins interact with nuclear histone acetyltransferases.Gangisetty O et al
111210382000The TACC domain identifies a family of centrosomal proteins that can interact with microtubules.Gergely F et al
266380752015A Dynamic Protein Interaction Landscape of the Human Centrosome-Cilium Interface.Gupta GD et al
200788632010The transforming acidic coiled coil (TACC1) protein modulates the transcriptional activity of the nuclear receptors TR and RAR.Guyot R et al
109131142000GAS41, a highly conserved protein in eukaryotic nuclei, binds to NuMA.Harborth J et al
194009372009NuMA is required for proper spindle assembly and chromosome alignment in prometaphase.Haren L et al
264966102015A human interactome in three quantitative dimensions organized by stoichiometries and abundances.Hein MY et al
206189992010The YEATS family member GAS41 interacts with the general transcription factor TFIIF.Heisel S et al
255149262015PhosphoSitePlus, 2014: mutations, PTMs and recalibrations.Hornbeck PV et al
28514441988Cytotoxicity of 4-hydroxyanisole and tyrosinase activity in variant cell lines of B16 melanoma.Thody AJ et al
182148542008HMGA2 is the partner of MDM2 in well-differentiated and dedifferentiated liposarcomas whereas CDK4 belongs to a distinct inconsistent amplicon.Italiano A et al
282518872017YEATS Domain Containing 4 Promotes Gastric Cancer Cell Proliferation and Mediates Tumor Progression via Activating the Wnt/β-Catenin Signaling Pathway.Ji S et al
284459532017YEATS4 promotes the tumorigenesis of pancreatic cancer by activating beta-catenin/TCF signaling.Jixiang C et al
12620391976The urinary drop spectrometer--an electrooptical instrument for urological analysis based on the external urine stream.Ritter RC et al
129173322003Yaf9, a novel NuA4 histone acetyltransferase subunit, is required for the cellular response to spindle stress in yeast.Le Masson I et al
272551642016Genomewide copy number analysis of Müllerian adenosarcoma identified chromosomal instability in the aggressive subgroup.Lee JC et al
163020012006QN1/KIAA1009: a new essential protein for chromosome segregation and mitotic spindle assembly.Leon A et al
256096492015Proteomic analyses reveal distinct chromatin-associated and soluble transcription factor complexes.Li X et al
169291792006A high-throughput loss-of-function screening identifies novel p53 regulators.Llanos S et al
115211962001Expression, cellular distribution and protein binding of the glioma amplified sequence (GAS41), a highly conserved putative transcription factor.Munnia A et al
235893282013NuMA is required for the selective induction of p53 target genes.Ohata H et al
167051552006GAS41 is required for repression of the p53 tumor suppressor pathway during normal cellular proliferation.Park JH et al
213172902011The GAS41-PP2Cbeta complex dephosphorylates p53 at serine 366 and regulates its stability.Park JH et al
220681082011Direct interaction of Gas41 and Myc encoded by amplified genes in nervous system tumours.Piccinni E et al
241701262013YEATS4 is a novel oncogene amplified in non-small cell lung cancer that regulates the p53 pathway.Pikor LA et al
150686652004ONCOMINE: a cancer microarray database and integrated data-mining platform.Rhodes DR et al
110252032000Isolation and characterization of AINT: a novel ARNT interacting protein expressed during murine embryonic development.Sadek CM et al
226190672012GAS41 amplification results in overexpression of a new spindle pole protein.Schmitt J et al
206571832010Reading chromatin: insights from yeast into YEATS domain structure and function.Schulze JM et al
242619842013Exploration of the gene fusion landscape of glioblastoma using transcriptome sequencing and copy number data.Shah N et al
152349872004A classic zinc finger from friend of GATA mediates an interaction with the coiled-coil of transforming acidic coiled-coil 3.Simpson RJ et al
260459002015Knockdown of YEATS4 inhibits colorectal cancer cell proliferation and induces apoptosis.Tao K et al
188383862008Human ATAC Is a GCN5/PCAF-containing acetylase complex with a novel NC2-like histone fold module that interacts with the TATA-binding protein.Wang YL et al
255005442015The landscape and therapeutic relevance of cancer-associated transcript fusions.Yoshihara K et al
171352902007The metalloprotease encoded by ATP23 has a dual function in processing and assembly of subunit 6 of mitochondrial ATPase.Zeng X et al
237968972013A survey of intragenic breakpoints in glioblastoma identifies a distinct subset associated with poor survival.Zheng S et al

Other Information

Locus ID:

NCBI: 8089
MIM: 602116
HGNC: 24859
Ensembl: ENSG00000127337

Variants:

dbSNP: 8089
ClinVar: 8089
TCGA: ENSG00000127337
COSMIC: YEATS4

RNA/Proteins

Gene IDTranscript IDUniprot
ENSG00000127337ENST00000247843O95619
ENSG00000127337ENST00000548020F8W0J4
ENSG00000127337ENST00000549685F8VTR4
ENSG00000127337ENST00000552955F8W1B9

Expression (GTEx)

0
10
20
30
40
50
60

Pathways

PathwaySourceExternal ID
Gene ExpressionREACTOMER-HSA-74160
Generic Transcription PathwayREACTOMER-HSA-212436
Chromatin organizationREACTOMER-HSA-4839726
Chromatin modifying enzymesREACTOMER-HSA-3247509
HATs acetylate histonesREACTOMER-HSA-3214847
Transcriptional regulation by the AP-2 (TFAP2) family of transcription factorsREACTOMER-HSA-8864260
Activation of the TFAP2 (AP-2) family of transcription factorsREACTOMER-HSA-8866907

PharmGKB

Entity IDNameTypeEvidenceAssociationPKPDPMIDs
PA444552HypertensionDiseaseClinicalAnnotationassociatedPKPD
PA447288Essential hypertensionDiseaseClinicalAnnotationassociatedPKPD
PA449899hydrochlorothiazideChemicalClinicalAnnotationassociatedPKPD

References

Pubmed IDYearTitleCitations
185914612008Genomic association analysis suggests chromosome 12 locus influencing antihypertensive response to thiazide diuretic.50
182148542008HMGA2 is the partner of MDM2 in well-differentiated and dedifferentiated liposarcomas whereas CDK4 belongs to a distinct inconsistent amplicon.42
167051552006GAS41 is required for repression of the p53 tumor suppressor pathway during normal cellular proliferation.30
119030632002Interaction of the transforming acidic coiled-coil 1 (TACC1) protein with ch-TOG and GAS41/NuBI1 suggests multiple TACC1-containing protein complexes in human cells.18
223501082013Association of chromosome 12 locus with antihypertensive response to hydrochlorothiazide may involve differential YEATS4 expression.16
241701262013YEATS4 is a novel oncogene amplified in non-small cell lung cancer that regulates the p53 pathway.15
294377252018Recognition of histone acetylation by the GAS41 YEATS domain promotes H2A.Z deposition in non-small cell lung cancer.11
294637092018Identification of the YEATS domain of GAS41 as a pH-dependent reader of histone succinylation.11
213172902011The GAS41-PP2Cbeta complex dephosphorylates p53 at serine 366 and regulates its stability.10
206189992010The YEATS family member GAS41 interacts with the general transcription factor TFIIF.6

Citation

Ivan H. Still ; Brenda Lauffart

YEATS4 (YEATS domain containing 4)

Atlas Genet Cytogenet Oncol Haematol. 2017-09-01

Online version: http://atlasgeneticsoncology.org/gene/40686/yeats4