Cloning of the Mouse and Human tendin cDNAs
A blast search in the mouse EST database with the mature CHM1 protein yielded two overlapping ESTs (IMAGE Clone 463876 and IMAGE Clone 2811628) for tendin. The corresponding cDNA clones revealed a sequence consisting of 202-bp 5′-untranslated sequence, 951bp open reading frame (ORF), and 184-bp 3′-untranslated sequence (Fig. 1A). 5′RACE experiments from a whole mouse embryo marathon cDNA library (Clontech) did not result in clones extending the sequenced cDNA (data not shown). Alternative internal splicing was investigated by reverse transcriptase-polymerase chain reaction (RT-PCR) experiments amplifying the complete coding region of the cDNA from several mouse tissues such as heart, liver, brain, cartilage, skin, muscle, and kidney. All experiments resulted in a single band with no evidence for alternative splicing (data not shown). The homologous human cDNA with a 951-bp open reading frame (ORF) was identified by sequencing the IMAGE Clone 1657819 (Fig. 1B). Mouse and human cDNAs have a homology of 89%. Because this new gene is highly expressed in tendons and ligaments (see below), we named it tendin.
The predicted Tendin protein, deduced from the cDNA sequence, consists of 317 amino acids. Protein prediction programs (Tmpred, TMAP, Phd-TM, DAS) proposed for Tendin a structure similar to CHM1, with a type II transmembrane topology, a transmembrane region from amino acid (aa) 31 to aa 49 and no signal peptide (Fig. 2A). An alignment of Tendin and CHM-I reveals high overall homology to CHM1 (similarity 54%, identity 31%), especially in the C-terminal cysteine-rich region (77%/66%) (Fig. 2B). The C-terminal extracellular part contains 10 cysteines, which are also present in CHM-I. Eight of these make the cysteine pattern of the cleaved mature CHM1. For the mature CHM1, disulfide bonds have been experimentally shown between the fourth and seventh and fifth and sixth cysteine. The eighth cysteine has a disulfide bond with either cystein1 or 2. The remaining two cysteines, first or second and third establish a fourth disulfide bond (Neame et al., 1990). In Tendin, the sixth cysteine at position 292 of the C-terminal domain is shifted by 4 aa N-terminally compared with CHM1. Two N-glycosylation sites and one O-glycosylation site have been described in the mature bovine CHM1 (Neame et al., 1990). One N-glycosylation site present at position 243 is conserved in human and mouse CHM1. The other sites are present neither in human nor mouse protein sequences. All three possible glycosylation sites are absent in Tendin. Instead, the PROSITE program (Bairoch et al., 1997) predicts two N-glycosylation sites for position 94 and 180. Although the endoprotease cleavage site RERR present in CHM1 is not conserved in Tendin, a potential protease cleavage motif RXXR could be found at position 233-236 that might allow cleavage by endoproteases (Barr, 1991). Sequences homologous to CHM1 or Tendin have only been found in vertebrates and are absent in Drosophila and C. elegans genomes.
Genomic Structure of the Human tendin
The PAC dJ479J7 that contains the human tendin sequences was mapped by FISH to Xq22 (Maeda et al., 1998). For the human CHM1 and tendin genes, genomic sequences are available from the Human Genome Project. Based on these sequences, the exon intron boundaries and the exon and intron sizes were analyzed by using the NIX platform at the Human Genome Mapping Project Resource Centre (URL: http://www.hgmp.mrc.ac.uk/Registered/Menu/). The CHM1 gene (unfinished sequence RP11-16F6, acc. AC012239; RP11-93H24, acc. AL139089; RP11-431O22, acc. AL139085) covers more than 43 kb, whereas tendin spans approximately 15 kb [Fig. 2A]). Both genes consist of seven exons, which all follow the AG-GT splicing consensus. The cytoplasmic N-termini of both genes are encoded by exon 1. Exon 2 encodes the transmembrane region. The extracellular part of the protein is encoded by exons 3-7. Exon 7 contains the complete C-terminal domain with eight conserved cysteine residues. With the exception of intron 5, which is in phase1 in the CHM1 as well as in the tendin gene, all introns are in phase 0. The exon sizes are well conserved, whereas the intron sizes differ between the two genes (Table 1).
Table 1. Genomic Organization of CHM1 and Tendina| | Splice acceptor | Splice donor | Exon size (bp) | Intron size (bp) | Intron phase |
|---|
|
| CHM1 exon |
| 1 | CCCCGgtgagtaccgcc | 72 | 505 | 0 |
| Pro |
| 2 | cccggcccgcagGCGTA | GTCACgtaagtccagag | 141 | 5698 | 0 |
| Ala | His |
| 3 | tccactttcgagATTTA | AGAATgtaagtatattt | 141 | 9113 | 0 |
| Ile | Asn |
| 4 | tttgtttcatagGGCAT | AACTGgtgggtaccaac | 114 | 11140 | 0 |
| Gly | leu |
| 5 | ttttactttcagGAAGG | AAAAGgtaacattttaa | 154 | >14 kb | 1 |
| Glu | LysG |
| 6 | aaatttttgcagAAATC | ATCATgtgcgttccggt | 167 | >3.8 kb | 0 |
| luIle | His |
| 7 | tttatttcacagCAGCA |
| Gln |
| Tendin exon |
| 1 | TAAATgtaagttgattc | 163 | 146 | 0 |
| Asn |
| 2 | ttctttggttagGCAGA | AAAAAgtaagtaaatac | 132 | 8532 | 0 |
| Ala | Lys |
| 3 | ctgttctcccagGCCTA | AAAACgtaagttggatg | 140 | 225 | 0 |
| Ala | Asn |
| 4 | tctgttttatagGGATA | ATGAGgtatgtaagaag | 102 | 3140 | 0 |
| Gly | Glu |
| 5 | ttcttctttcagAATGA | ATCAGgtatgacattct | 154 | 1357 | 1 |
| Asn | SerV |
| 6 | ttattatcttagTTTCT | ACTATgtgagttatgtt | 167 | 326 | 0 |
| alSer | Tyr |
| 7 | ttcttctttcagACTGA | 210 |
| Thr |
Expression Analysis
Analysis of the available sequences in the EST database revealed 11 ESTs from nine cDNA clones for the human form of tendin. Among the nine cDNA clones, four are from heart cDNA libraries; four from pooled libraries derived from heart, melanocyte, and lung tissue; and one clone is from a metastasizing colon tumor library.
Despite the significantly lower number of EST sequences in the mouse database, there are 33 sequences for tendin present, which were derived from several tissues, including 1 from bone, 4 from mammary cancer, 1 from skin, 1 from medulla oblongata, 1 from corpora quadrigemina, and 25 from whole embryos from stages E12.5 to neonate.
Northern blot analysis of newborn mouse tissues showed high expression of tendin in skeletal muscle, diaphragm, and eye. Weak signals were visible in almost all other tissues investigated, including brain, liver, lung, kidney, heart, skin, thymus, and perichondrium- and periosteum-free rib cartilage (Fig. 3). After probing the same blot with a CHM1 cDNA probe, signals were visible in cartilage, thymus, eye, and skeletal muscle. The latter signal was most probably due to cartilage contamination of the skeletal muscle RNA preparation (data not shown). To determine the temporal expression profile of tendin during development, we analyzed total RNAs isolated from embryos by Northern hybridization at various time points. tendin expression was already detected at E9.5, the earliest embryonic stage that has been analyzed to date (data not shown).
To localize the expression at the cellular level, in situ hybridization of mouse embryos and adult tissues was performed with sense and antisense tendin riboprobes. At E17.5, strong expression was detected in the tendons and ligaments of the skeletomuscular system, including the knee joint (Fig. 4A), the upper limb (Fig. 4B), and the intercostal ligaments (Fig. 4C). In the knee, tendons inserting into the patella, tendons of muscles, and all ligaments, including the cruciate ligaments, gave very high signals. A weaker signal was also detected in the developing medial meniscus (Fig. 4A). In the forelimb, high expression was seen in all tendons investigated, including the tendons of the flexor muscles (Fig. 4B). A high level of tendin expression was observed in the tendinous part of the diaphragm (Fig. 4D). Low-level expression of tendin was detected in resting and proliferative chondrocytes of long bones (Fig. 4E) and vertebral bodies (Fig. 4E), and in the cartilaginous part of the intervertebral disks (Fig. 4F). In hypertrophic chondrocytes, no tendin signals were observed (Fig. 4E).
Outside the skeletomuscular system signals were further detected in cortex and medulla of the thymus (Fig. 5A). In the liver, only single cells throughout the parenchyme express tendin (Fig. 5B). Signals in the lung were weak and mainly detected in the bronchial regions (Fig. 5C). In heart tissues, cells in the aortal and pulmonary outflow tract, as well as in the aortic and pulmonary valves showed weak positive signals (Fig. 5D). Epithelial cells at the basis of crypts and in villi in duodenum showed also evidence of tendin expression (Fig. 5E). In the eye, signal was visible in the tendons of the ocular muscles and within single cells in the retina (data not shown). tendin expression was also detected in cells of the olfactory epithelium (data not shown) and in the spleen (Fig. 5F).
In brain, tendin expression was detected in neuronal cells of all brain regions at E17.5. Highest levels of expression were found in the developing cerebellum, the neocortex, the olfactory bulb, and the spinal cord (data not shown). tendin expression was studied in more detail on adult mouse brain where it was found in all regions analyzed. Expression was observed in all hippocampal fields and in a dentate gyrus (Fig. 6A). Neuronal cells in the corpus striatum and thalamus also expressed tendin (Fig. 6B). In the cerebellum, Purkinje cells and neurons in the cerebellar nuclei expressed high levels of tendin mRNA (Fig. 6C). In the cervical, thoracic, and sacral regions of the spinal cord, neuronal cells in the anterior and posterior horns were positive for tendin (Fig. 6D). In the cortex, neuronal cells of all layers were positive for tendin mRNA (Fig. 6E). Expression in glial cells was not detected by in situ hybridization. However, very low level of expression cannot be excluded.