Sp140 is a nuclear leukocyte-specific protein involved in primary biliary cirrhosis and a risk factor in chronic lymphocytic leukemia. The presence of several chromatin related modules such as plant homeodomain (PHD), bromodomain and SAND domain suggests a role in chromatin-mediated regulation of gene expression; however, its real function is still elusive. Herein we present the solution structure of Sp140-PHD finger and investigate its role as epigenetic reader in vitro. Sp140-PHD presents an atypical PHD finger fold which does not bind to histone H3 tails but is recognized by peptidylprolyl isomerase Pin1. Pin1 specifically binds to a phosphopeptide corresponding to the L3 loop of Sp140-PHD and catalyzes cis–trans isomerization of a pThr-Pro bond. Moreover co-immunoprecipitation experiments demonstrate FLAG-Sp140 interaction with endogenous Pin1 in vivo. Overall these data include Sp140 in the list of the increasing number of Pin1 binders and expand the regulatory potential of PHD fingers as versatile structural platforms for diversified interactions.
Human Sp140 is an interferon inducible, 85.9 kDa nuclear leukocyte-specific protein expressed in mature B cells, plasma cell lines and in some T cells . Originally identified as autoantigen in the serum of a patient affected by primary biliary cirrhosis , it is also implicated in innate immune response to HIV-1 by its interaction with the virus Vif protein . Importantly, a genome-wide association study of 299 983 tagging single-nucleotide polymorphisms for chronic lymphocytic leukemia (CLL) showed that Sp140 is a CLL risk locus . In this case a significant dose relationship between genotype and Sp140 expression in lymphocytes was demonstrable, with risk alleles associated with reduced levels of mRNA . In accordance with these data, 16 single-nucleotide polymorphisms were recently suggested to be involved in the etiology of CLL and linked to a decreased Sp140 expression by means of expression quantitative trait loci analysis . Sp140 was also identified in a large set of new genes supposed to drive the development of CLL and displaying somatic mutations in CLL with relevant clinical correlates . Sp140 localizes to LYSp100-associated nuclear dots (LANDs) in B-lymphocytic cell lines  and as a member of the Sp100 family of proteins it is also found in promyelocytic leukemia nuclear bodies (PML-NBs) of differentiated HL60 and NB4 cells and in adenovirus-Sp140-infected T24 and HeLa cells [1, 8]. Sp140 localization to PML-NBs, which are subnuclear structures involved in the regulation of gene transcription, cellular growth, apoptosis and maintenance of chromatin architecture , along with the presence of several chromatin related modules in its primary structure, suggest a role in chromatin-mediated regulation of gene expression. Indeed coactivator activity was inferred for Sp140 by virtue of its Gal4 DNA-binding domain fusion activity in transfected COS cells [10, 11]; it has therefore been hypothesized that Sp140 might regulate the expression of genes involved in CLL development . In line with its putative role in transcriptional regulation, Sp140 has strong sequence homology with autoimmune regulator (AIRE), a transcriptional activator governing the ectopic expression of peripheral tissue-specific antigens in the thymus . Similarly to AIRE, Sp140 harbors a nuclear localization signal, a dimerization domain (HSR or CARD domain), a SAND domain, and a plant homeodomain (PHD) finger (Fig. 1A). At variance with AIRE, which contains a second PHD finger, Sp140 harbors a bromodomain (BRD). Both BRD and PHD fingers are evolutionarily conserved ‘reader/effector’ modules that bind to specific histone post-translational modifications (PTMs) promoting chromatin changes and/or protein recruitment . The Zn2+ binding PHD finger (~ 60 amino acids) is the latest addition to the list of epigenetic readers. It is found in ~ 200 human proteins, many of which act as nucleosome interaction determinants playing a fundamental role in histone recognition and epigenetic mechanisms [14-16]. It can recognize the methylation status of histone lysines, such as histone H3 lysine 4 (K4me0 versus K4me3/2) or H3 lysine 36 (H3K36), to smaller degree the methylation state of H3R2 (R2me0 versus R2me2), and the acetylation state of H3K14 [15, 16]. Depending on the PTMs and the molecular context, decoding of histone H3 by PHD fingers can lead to gene activation or repression. Importantly, Sp140-PHD finger shares 52% sequence identity with AIRE-PHD1 and contains the typical N-terminal acidic hallmark usually suggestive of recognition of the unmodified histone H3 tail (H3K4me0; Fig. 1B). Notably, non-histone dependent functions have also been attributed to the PHD finger, which behaves as a versatile structural scaffold characterized by a wide functional diversity, ranging from protein–protein interaction hub to E3 ligase activity in sumoylation or ubiquitinylation reactions [14-17]. Despite several indications suggesting an implication of Sp140 in human malignancies, until now its function in both physiological and pathological conditions has remained extremely elusive and unexplored. As a first step towards the understanding of Sp140 function we have solved the solution structure of its PHD finger (Sp140-PHD) and investigated its possible role as epigenetic reader in vitro. Sp140-PHD presents an atypical PHD finger fold characterized by the presence of four short α helices and by cis–trans isomerization of a peptidyl–prolyl bond located in the variable L3 loop (according to the definition in ). Importantly, biochemical experiments and NMR titrations show that Sp140-PHD is not able to decode the histone H3 tail neither in its modified nor in its unmodified form. Conversely, NMR titrations provide evidence that the peptidylprolyl isomerase (PPIase) Pin1 binds directly to Sp140-PHD and is able to recognize a phosphorylated peptide corresponding to the Sp140-PHD finger L3 loop and to catalyze the rapid isomerization of its cis–trans peptidyl–prolyl bond. Importantly, co-immunoprecipitation experiments in cells transfected with FLAG-Sp140 demonstrate its interaction with endogenous Pin1 in a cellular context. Although Sp140 function needs further studies, data provided by this study include Sp140 in the list of the increasing number of Pin1 targets and suggest a Pin1-regulated modulation of the biological role of Sp140-PHD finger.
Sp140-PHD solution structure shows prolyl isomerization in the L3 loop
We solved the solution structure of Sp140-PHD finger by multidimensional heteronuclear NMR spectroscopy. The recombinant protein (Met687-Ser738) behaves as a monomer in solution, as assessed by its rotational correlation time (tc ~ 4.7 ns) determined from 15N relaxation data. This is in agreement with the expected value for a folded 6 kDa protein. Importantly, the 1H-15N HSQC (heteronuclear single quantum coherence) spectrum presented peak duplication for 24 amide signals, compatible with the presence of two conformations in slow exchange. We hypothesized that the two sets of peaks might arise from propagation of structural changes due to cis–trans isomerization around the Thr726-Pro727 imide bond, a sequence which is known to favor this conformational rearrangement [19, 20]. Indeed, mutation of Pro727 into Ala removed peak duplication in the 1H-15N HSQC spectrum, confirming our hypothesis (Fig. 1C). Peptidyl–prolyl bond conformations were assigned on the basis of the proline diagnostic chemical shift difference Δ= δ13Cβ − δ13Cγ, which showed Δ values of 4.43 and 9.67 ppm for the trans and cis configurations, respectively . Further evidence was obtained from 13C-edited nuclear Overhauser effect spectroscopy (NOESY), which showed two sets of NOE cross-peaks between HαThr726-HδPro727 and HαThr726-HαPro727, typical for the trans and cis conformations, respectively (Fig. S1H,I) . Finally, volume integration of the duplicated amide cross-peaks in the 1H-15N HSQC spectrum indicated that at room temperature the two conformers were present with 66% in trans and 33% in cis. The exchange rate between the two conformations was too slow to be detected on the NMR timescale, as assessed by the absence of exchange peaks between cis and trans resonances. A single NMR data set contained the necessary information to simultaneously determine the structures of the two conformers  (Fig. 1D). In both families of structures the residues Leu690-Ile718, Cys730-Met735 adopt a well-defined tertiary structure with an rmsd of ~ 0.45 Å for backbone atoms and have all residues in the allowed regions of the Ramachandran plot (Table 1). Superposition of the backbone atoms of the structured regions of cis and trans structures indicates that the two conformers are virtually identical in these regions and that cis–trans isomerization increases structural heterogeneity in the L3 loop. Accordingly, the largest chemical shift differences between the two conformers were observed within this loop and in residues Phe703-Cys705 which are near in space to the L3 loop (Fig. S1A–G). In line with the paucity of the NOEs detected in this region, residues in L3 showed a reduction of the heteronuclear NOE intensities (Fig. S2).
Overall, the Sp140-PHD structure presents some peculiarities compared with the canonical PHD finger fold. First, one Zn2+ binding site, usually formed by a CysCysHisCys motif, is replaced by a CysCysHisHis motif (Fig. 1E,F). This coordination pattern was confirmed by several NOEs involving the metal coordinating residues (HβCys693 and Hδ2His713, HαCys696 and Hε1His717). Both Nδ1 of His713 and His717 are protonated and Zn2+ coordination occurs through the Nε2 of the two imidazole rings, as judged from their chemical shifts in the 2D 1H–15N long-range HMQC spectrum (data not shown). Importantly, the involvement of both His713 and His717 in metal coordination excludes the conserved Cys716 from the Zn2+ binding site, thus allowing its side chain to be in close proximity to the Cys704 thiol group. Notably, NOEs between Hβ atoms of Cys704 and Cys716 along with downshifts of their Cβ resonances (50.9 and 42.8 ppm for Cys704 and Cys716, respectively) indicate the presence of a disulfide bond between these two cysteines (Fig. 1E,F). Another structural peculiarity of Sp140-PHD consists in the presence of two α helices involving residues Asp706-Val711 (α2) and His713-His717 (α3), respectively (Fig. 1D). Notably, in other PHD finger structures (e.g. AIRE-PHD1), residues corresponding to Val711-His713 usually form the second strand of a short antiparallel β-sheet, which is absent in Sp140-PHD (Fig. S4). A search for structural homologues using the dali server  failed to identify any structural neighbor, indicating that Sp140-PHD belongs to a structurally different class of PHD fingers. Indeed, despite the high sequence identity, superposition of Sp140-PHD onto the AIRE-PHD1 structure shows a high rmsd (7.03 Å) on 50 equivalent residues (Fig. S4). Finally, a small hydrophobic cluster, composed of Phe703, Val721, Ile731, stabilizes the structure. Notably, the conserved Trp728, which is usually part of the hydrophobic core of the PHD finger fold, is partially accessible or totally exposed in the cis and trans conformer, respectively (Fig. 1E,F).
Sp140-PHD does not bind to histone H3 tail peptides
To investigate the possible role of Sp140-PHD in chromatin-regulating complexes, we examined its putative binding to histone tails. Prompted by the presence of a conserved N-terminal acidic hallmark, suggestive of a binding preference for the unmodified histone H3 tail (Fig. 1B), we analyzed binding of Sp140-PHD to a non-methylated peptide corresponding to the first 15 amino acids of histone H3 (H3K4me0) by using 2D 1H-15N NMR. Upon addition of a fivefold excess (1 mm) of H3K4me0 into 15N-labeled Sp140-PHD we did not observe any interaction, as assessed by the absence of peak displacement in the 1H-15N HSQC spectrum (Fig. S5A). Further NMR titrations of 15N Sp140-PHD with other H3 peptides bearing different epigenetic marks, such as H3K4me3, H3R2me2a (asymmetric di-methylation of R2) and H3K9ac, or with unmodified peptides corresponding to H3 (17–29) or H4 (1–10) did not show any binding (Fig. S6). Similar negative results were obtained with Sp140-PHDPro45Ala mutant (data not shown). To test whether other histone post-translational modifications and/or combinations thereof might be crucial for a possible interaction with Sp140-PHD, we performed binding assays using the MODified™ Histone Peptide Array (Active Motif, Carlsbad, CA, USA). The array contains 384 peptides (19 amino acids long) in various combinations of known and hypothetical modification states of the H3, H4, H2A and H2B histone tails. Despite the extensive coverage of histone modifications we did not observe any specific binding to GST-Sp140-PHD (Fig. S5B). We hypothesize that one of the reasons determining the lack of interaction might be related to some Sp140-PHD structural peculiarities. On the one hand we observed that the preformed anchoring pocket usually exploited by PHD fingers to anchor the positively charged N-terminus of histone H3 (e.g. AIRE-PHD1; Fig. S7A)  is absent or partially covered by Trp728 in Sp140-PHD trans and cis conformers, respectively (Fig. S7B,C). On the other hand, in both Sp140-PHD conformers the conserved aspartate in position 9 (Fig. 1B), which is usually a fundamental residue for the recognition of H3K4me0, is unfavorably oriented pointing in the opposite direction with respect to the canonical histone binding surface (Fig. S7).
Human Pin1 binds to the phosphorylated peptide corresponding to Sp140-PHD L3 loop and catalyzes the isomerization of the pThr-Pro bond in vitro
We next wondered whether the peptidyl–prolyl cis–trans isomerization observed in Sp140-PHD might have a functional relevance, as this conformational exchange process is emerging as a versatile regulatory strategy to modulate cell signaling, protein transcription, transport degradation and/or localization [26-29]. In this context, human PPIase Pin1 plays a fundamental role catalyzing the cis–trans isomerization of phosphorylated Ser/Thr-Pro peptide bonds in an increasing number of targets [30, 31]. As Pin1 is emerging as a mediator of immune cell function [32, 33], we asked whether the PHD of the leukocyte-specific protein Sp140 might be a substrate for human Pin1. With this aim we first tested in vitro Pin1 enzyme activity on two peptides, EAERpTPWN and EAERTPWN, corresponding to Sp140-PHD L3 loop (Glu722-Asn729) with or without threonine phosphorylation, respectively. Because of the slow exchange rate of the peptidyl–prolyl cis–trans isomerization, several residues in both the free peptides displayed two distinct sets of 1H signals in 2D ROESY experiments (Fig. 2A,B). The cis and trans populations of the peptides were 15% and 85%, respectively, as estimated from 1D 1H and 2D 1H-13C HSQC spectra at room temperature. Exchange cross-peaks were absent in the ROESY spectra of the free peptides, indicating that the exchange regime between the two conformations was too slow to be detected on the NMR timescale (Fig. 2A left, B left). Notably, addition of catalytic amounts of Pin1 to EAERpTPWN accelerated the isomerization rate of the phosphothreonine–prolyl bond, as shown by the appearance of exchange cross-peaks in the ROESY spectrum (Fig. 2A, right). As expected, in the presence of Pin1 no exchange peaks were observed for the non-phosphorylated control peptide (Fig. 2B, right).
We next determined the binding site of EAERpTPWN on Pin1, performing NMR based chemical shift mapping assays. To this end 2D 1H-15N HSQC spectra of full-length 15N-labeled Pin1 were recorded to monitor possible changes in Pin1 1H-15N chemical shifts upon successive additions of unlabeled peptides. A comparison of the spectra in the absence and presence of a fivefold excess (1 mm) of Sp140 peptides showed that only EAERpTPWN was able to bind Pin1, as revealed by the numerous peak displacements observed upon addition of the phosphopeptide (Fig. 2C, Fig. S8A). The unphosphorylated peptide did not show any binding evidence, confirming the phospho dependence of the Pin1–peptide interaction (Fig. S8B). The complex between Pin1 and EAERpTPWN was in the fast exchange regime on the NMR chemical shift timescale (Fig. 2C) with a dissociation constant of 138 ± 4 μm (Fig. 2D). Pin1 residues exhibiting significant amide chemical shift changes (Fig. 2E) where mapped on the Pin1 crystallographic structure (pdb code http://www.rcsb.org/pdb/search/structidSearch.do?structureId=1PIN). The binding surface mainly involved the β-sheet of the WW domain (Lys13-Ser16, Gly20, Val22-Asn26, Ala31, Gln33-Arg35; Fig. 2F). Chemical shift changes were observed also on the flexible linker (Ser41, Ser43, Lys46) and on the region of the PPIase domain facing (Lys97-Glu100) or nearby (Phe139-Arg142) the WW domain, probably induced by long-range conformational rearrangements upon complex formation. Overall, these data indicate that Pin1 binds to EAERpTPWN and catalyzes the cis–trans isomerization of its phosphothreonine–proline bond in vitro.
Pin1 recognizes the Sp140-PHD scaffold independently of phosphorylation
We next asked whether Pin1 was able to recognize the entire Sp140-PHD finger scaffold and we performed NMR binding assays titrating 15N-labeled Pin1 with unlabeled Sp140-PHD. Notably, upon addition of sub-stoichiometric amounts of Sp140-PHD a number of Pin1 resonances shifted in the 1H-15N HSQC spectrum. At equimolar ratio several Pin1 peaks disappeared broadening out from the spectrum, indicating binding in the intermediate exchange regime. Upon addition of a 1.5 excess (0.3 mm) of Sp140-PHD almost all Pin1 peaks disappeared, with the exception of residues from the flexible N-terminus (Fig. 3A–C). Interestingly, analogous line broadening effects have been observed in response to binding to Pin1 of the full-length substrate stem-loop binding protein . Similarly to what was observed in the titration with the phosphopeptide, residues shifting upon Sp140-PHD addition involved the WW domain (Ser16, Gly20-Asn26, Thr29, Ser32, Gln33, Glu35) and the flexible linker (Ser41, Ser43, Gly45). Most importantly, spectral perturbation propagated throughout the protein involving additional residues in the PPIase domain around the catalytic pocket (Thr152, Asp153, Ser154, Gly155) and around the basic cluster (Lys63, His64, Arg69), suggesting that accommodation of the full-length substrate in the interdomain space induces further interactions and/or conformational rearrangements with respect to the phosphopeptide (Fig. 3D,E). Titrations with Sp140-PHDThr726Asp, a mutant mimicking the phosphorylation of Thr726, led essentially to similar results (Fig. S9), suggesting that Pin1 is able to recognize the PHD finger scaffold independently of phosphorylation. In this context we cannot exclude that the aspartate mimics the phosphorylation only partially, as it is smaller, unbranched and with a lower charge density with respect to a phosphorylated threonine. The reverse titration of 15N Sp140-PHD with unlabeled Pin1 confirmed the interaction with both the trans (Fig. 4) and cis (Fig. S10) conformers, as assessed by peak shifting and broadening upon addition of Pin1 (Fig. 4A–C). Sp140-PHD residues shifting in the presence of sub-stoichiometric amounts of Pin1 (0.1 mm; Sp140-PHD : Pin1 1 : 0.5) included not only the L3 loop (Ala41-Cys48) but also amino acids located on α2, α3 and α4 helices (Val711, Phe712, Asp715, Cys716, Ile717, Met735; Figs 4D–F, S10), suggesting either a direct contact or long-range conformational effects upon binding. As expected, despite Pin1 direct interaction with Sp104-PHD, it was not able to catalyze cis–trans isomerization, as assessed by the unaltered peak volume ratio (33% cis and 66% trans) observed in the 1H-15N HSQC spectra in the presence of a catalytic amount of Pin1. Taken together these data indicate that in vitro Pin1 is able to recognize the Sp140-PHD finger scaffold but does not catalyze cis–trans isomerization.
Sp140 interacts in vivo with Pin1
To verify whether the interaction between Sp140 and Pin1 occurs also in vivo we performed a co-immunoprecipitation assay in HEK293T cells, transiently transfected with FLAG-tagged Sp140 or with FLAG-tagged enhanced blue fluorescent protein (EBFP) as control. As reported in Fig. 5, anti-Pin1 western blot analysis of the FLAG immunoprecipitation shows that Pin1 is co-precipitated only with Sp140 and not in the control, thus demonstrating that the two proteins can interact also in vivo.
In recent years the PHD finger domain, one of the most recurrent domains in nuclear proteins, has been extensively investigated from both the structural and functional point of view [14, 15, 35, 36]. This small Zn2+ binding motif has emerged as a robust conserved scaffold with diversified activities: it can work not only as an epigenetic reader sensing the modification status of histone H3, but can also function as a general protein–protein interaction motif, thereby expanding its role in diverse cellular processes including transcriptional regulation and/or signal transduction . Its high functional versatility relies on the low secondary structure content and on subtle but significant changes in amino acid compositions contributing to the domain functional and structural plasticity. In this context the structure of the PHD finger of the leukocyte-specific nuclear Sp140 protein represents a paradigmatic example for the structural and functional versatility attributed to this domain. In fact, structural comparison with AIRE-PHD1 reveals for Sp140-PHD an unexpected switch from an α/β to an all α-helical fold, conceivably imputable to few differences in the primary structure (Fig. 1B). For example, the presence in Sp140-PHD of a glutamate residue in position 12 (an alanine in AIRE-PHD1) favours the formation of a salt-bridge with Arg697, thus stabilizing a helical turn in this region. A second helix (α2), encompassing residues Asp706-Val711, is similarly stabilized by electrostatic interactions. In AIRE-PHD1 formation of this helix is probably hindered by the presence of a proline in position 27. In Sp140-PHD helix α2 is immediately followed by a third helix (α3) in which His713 and His717 form a helical zinc anchor site  that coordinates together with Cys693 and Cys696 the second Zn2+ ion, thus replacing the canonical CysCysHisCys coordination scheme. The helix-turn-helix arrangement involving α2 and α3 impairs the formation of the short β-strands usually encompassing residues in positions 20–22 and 29–31. A further Sp140-PHD structural peculiarity consists in the unprecedented presence of cis–trans peptidyl–prolyl isomerization (Thr726-Pro7275 imide bond) in the variable L3 loop, conferring high structural heterogeneity to this region. Notably, in PHD fingers specialized in histone H3 tail recognition, this loop forms a narrow cavity to accommodate the positively charged N-terminus of histone H3 [14, 15, 35, 36]. This preformed pocket is absent in the trans conformer and partially covered by Trp726 in the cis conformer (Fig. S7). Considering the importance of the H3A1 pocket as an anchoring element for histone H3 recognition, we hypothesize that the absence of an appropriate binding surface in this region might be one of the structural determinants hampering Sp140-PHD binding to histone H3 tail peptides in vitro. In this context, it should also be noted that the conserved aspartate in position 9 (Fig. 1B), which is considered the hallmark for unmethylated H3K4 recognition, points in the opposite direction with respect to the canonical histone binding surface (Fig. S7B,C). It is therefore conceivable that the combination of all these structural features strongly compromise the ability of Sp140-PHD to recognize the N-terminal part of the histone H3 tail, suggesting that the so-called acidic hallmark is not sufficient to predict recognition of unmodified H3K4. At this stage we cannot exclude that in vivo the Thr726-Pro727 bond isomerization, together with Thr726 phosphorylation, might play a role in chromatin–Sp140 interactions, as regulatory mechanisms via cis–trans peptidyl–prolyl isomerization are not unusual in the context of epigenetic readers. A remarkable example is offered by the MLL1 PHD-BRD cassette, where a cis–trans proline within the domain linker binds to the proline isomerase Cyp33, causing dramatic conformational changes in the domain orientation and preventing H3K4me3 interaction, ultimately resulting in HOX target gene repression . In the context of full-length Sp140 we also hypothesize that its putative involvement in chromatin interactions might be promoted and/or reinforced by other chromatin related domains, such as the SAND domain, a DNA binding module , and/or the BRD, another epigenetic reader specialized in the decoding of acetylated histones . Notably, the L3 loop is characterized by high sequence and structural variability within the PHD family, contributing to the functional versatility of PHD fingers . In fact, previous studies aimed at engineering the PHD finger scaffold with tailored functions have shown that grafting of CtBP2 binding motif into the L3 loop of Mi2β-PHD resulted in a functional domain switch . Importantly, the L3 loop constitutes the structural determinant for the binding of several PHD fingers to non-histone proteins. This is the case for Pygo-PHD, where an α helix within the L3 loop constitutes the interaction surface with the homology domain 1 of BCL9 . In line with this concept, we reasoned that the Sp140-PHD L3 loop, which is conserved among different mammalian species (Fig. S3), might play a relevant role in Sp140-PHD function. Prompted by the cis–trans isomerization of the Thr726-Pro727 bond, we asked whether Sp140-PHD could be a substrate for Pin1, a unique human PPIase which is able to catalyze the cis–trans isomerization of the phosphorylated Ser/Thr-Pro bond [43, 44]. Pin1 is a component of the nuclear speckles macromolecular complex, including cell cycle proteins as well as elements of the splicing machinery . Depending on specific target sites and local structural constraints, human Pin1 catalyzes cis to trans or trans to cis isomerization, thereby modifying the conformation, the stability and the activity of phosphorylated target proteins. In this respect the enzyme acts as an effective molecular timer playing a significant role in biological and pathological processes such as immune and cellular stress responses, microbial infections, cancer, cell cycle progression, growth-signal and gene regulation [30, 31, 45]. Herein we showed by co-immunoprecipitation experiments in HEK293T cells transfected with FLAG-Sp140 that Sp140 interacts in vivo with endogenous Pin1. Moreover, in vitro NMR binding experiments show that Pin1 specifically binds to a phosphopeptide corresponding to the L3 loop of Sp140-PHD, thus catalyzing cis–trans isomerization of the pThr-Pro bond. NMR-based chemical shift mapping indicates that the phosphopeptide mainly targets the WW domain. The chemical shift difference pattern is highly reminiscent of previously described interactions with other Pin1 phosphopeptide targets, which also bind to Pin1 with micromolar affinity and target mainly the WW domain [46-48]. Importantly, binding experiments performed with the entire Sp140-PHD domain showed that in vitro Pin1 recognizes the whole PHD scaffold, independently from threonine phosphorylation in L3, but does not catalyze cis–trans isomerization of the peptidyl–prolyl bond. In line with the NMR results obtained on phosphorylated and non-phosphorylated L3 loop peptides, the absence of isomerization catalysis of recombinant Sp140-PHD by Pin1 is probably due to a missing phosphorylation on Thr726. It is noteworthy that the residues whose backbone chemical shifts were affected by addition of Sp140-PHD are not only located in the WW domain but propagate throughout the domain involving the flexible interdomain linker and the PPIase domain, within and in spatial proximity to the basic cluster and the catalytic site. These results are in agreement with a recent study suggesting a non-catalytic participation of the PPIase domain in target binding . However, the molecular mechanisms governing Pin1 interdomain rearrangement, substrate recognition and peptidyl–prolyl bond isomerization are still largely debated, as no 3D structure is available describing Pin1 interaction with a full-length substrate. Taken together, the data presented in this work provide an example of how malleable the PHD fold can be, thus expanding its regulatory potential as a versatile structural platform for diversified interactions. In this context Pin1 phosphorylation dependent cis–trans isomerization of the Thr726-Pro727 bond could act as a molecular switch to modulate Sp140 cellular fate and its interaction with chromatin. Herein this mechanism might then orchestrate the crosstalk between several Sp140 PTMs such as phosphorylation, acetylation, ubiquitylation and sumoylation, thus determining Sp140 turnover and/or cellular localization.
Sp140 PHD finger expression and purification
Human Sp140-PHD finger (residues Met687-Ser738, http://www.ncbi.nlm.nih.gov/protein/NM_007237) was cloned into NcoI/KpnI sites of pETM11 expression vector (EMBL). The vector expresses the domain with N-terminal His6 tag, removable by cleavage with TEV (tobacco etch virus) protease. Site-directed mutations were made by standard overlap extension methods. BL21 (DE3) Escherichia coli cells were induced overnight at 30 °C with 1 mm isopropyl thio-β-d-galactoside (IPTG), in LB medium supplemented with 0.2 mm ZnCl2. Cells were sonicated in buffer containing 20 μg·mL−1 RNase A, 2 μg·mL−1 DNase I, 150 mm NaCl, 20 mm Tris/HCl pH 8, 10 mm imidazole pH 8, 0.2% NP-40, 50 μm ZnCl2, 0.4 mm dithiothreitol and complete EDTA-free (Roche, Mannheim, Germany). The His6-tagged protein was purified on an Ni-nitrilotriacetic acid (NTA) column (GE Healthcare, Uppsala, Sweden) and eluted with 150 mm NaCl, 20 mm Tris/HCl pH 8, 50 μm ZnCl2, 2 mm β-mercapto-EtOH and 300 mm imidazole pH 8. The His6 tag was cleaved off during overnight dialysis at 4 °C, by addition of His6-tagged TEV protease (home-made). The TEV protease was then removed by purification on an Ni-NTA column; Sp140-PHD finger was further purified by size exclusion chromatography (HiLoad 16/60 Superdex 30 pg column; GE Healthcare). The final buffer contained 20 mm Na2HPO4/NaH2PO4 pH 6.3, 150 mm NaCl, 5 mm dithiothreitol and 50 μm ZnCl2. Protein identity was confirmed by mass spectroscopy. Uniformly 15N- and 13C-15N-labeled Sp140-PHD finger was expressed by growing E. coli BL21 (DE3) cells in minimal bacterial medium containing 15NH4Cl, with or without 13C-d-glucose. For binding assays with histone peptide arrays the Sp140-PHD finger was cloned into NcoI/KpnI sites of pETM30 expression vector (EMBL). Purification of the His6-GST-tagged protein was performed as described above, without cleavage of the GST-fusion protein and by size exclusion chromatography on a HiLoad 16/60 Superdex 75 pg column (GE Healthcare).
Pin1 expression and purification
His8-tagged human Pin1 (plasmid kindly provided by J. P. Noel, Salk Institute for Biological Studies, CA, USA) was expressed in E. coli BL21 (DE3) cells, induced for 4 h at 37 °C with 0.2 mm IPTG. Sonication buffer was 150 mm NaCl, 20 mm Tris/HCl pH 8, 10 mm imidazole pH 8, 0.2% NP-40, 4 mm β-mercapto-EtOH and complete EDTA-free (Roche). His8-tagged Pin1 was purified on an Ni-NTA column (GE Healthcare) and eluted with 150 mm NaCl, 20 mm Tris/HCl pH 8, 4 mm β-mercapto-EtOH, 300 mm imidazole pH 8. Protein was dialysed overnight against 20 mm Tris/HCl pH 7.2, 4 mm β-mercapto-EtOH, 150 mm NaCl and it was further purified by size exclusion chromatography (HiLoad 16/60 Superdex 75 pg column; GE Healthcare). Protein identity was confirmed by mass spectrometry. The final buffer contained 150 mm NaCl, 20 mm Tris/HCl pH 6.6 and 5 mm dithiothreitol. Uniformly 15N-labeled Pin1 was expressed by growing E. coli BL21 (DE3) cells in minimal bacterial medium containing 15NH4Cl. The Pin1 1H-15N HSQC spectrum was assigned based on the human Pin1 1H and 15N backbone chemical shifts deposited in the BMRB data bank (entry 5305) . The deposited assignment is incomplete; the following residues were missing in the entry and were therefore not assigned in our spectra: Ser19, Gln75, Glu76 and Glu145 (numbering scheme according to Pin1 crystallographic structure, http://www.rcsb.org/pdb/search/structidSearch.do?structureId=1PIN.pdb). The peaks corresponding to Gly39, Asn40, Gly44, Gln49, Gly50, Ser114 and Lys132 were not detectable in our Pin1 1H-15N HSQC spectra, probably because of solvent exchange phenomena.
NMR spectroscopy and resonance assignments
NMR experiments were performed at 295 K on Bruker Avance 600 and 900 MHz spectrometers equipped with inverse triple resonance cryoprobe and pulsed field gradients. Data were processed with nmrpipe  or topspin 2.0 (Bruker, Karlsruhe, Germany) and analyzed using ccpnmr . Sp140-PHD finger sample concentrations were 0.8–1.2 mm in 20 mm NaH2PO4/Na2HPO4 pH 6.3, 150 mm NaCl, 5 mm dithiothreitol, 50 μm ZnCl2 and 10% or 100% (v/v) D2O. 1H, 15N and 13C backbone resonances were assigned through the following 2D and 3D experiments: 1H-15N HSQC, 1H-13C HSQC, HNCA, HNCO, CBCA(CO)NH, CBCANH. 1H and 13C side chain resonances were obtained by 2D and 3D experiments (1H-1H TOCSY, HCCH-TOCSY, CC(CO)NH and HCC(CO)NH). The tautomeric state of the histidine rings was determined by performing a long range 1H-15N HMQC, optimized to detect J-couplings in histidine side chains (J(HN) = 22 Hz) . Proton–proton distance constraints were obtained from 15N and 13C separated 3D NOESY and from 2D 1H-1H NOESY spectra in H2O and D2O (120 ms mixing time). 3J(HN, Ha) coupling constants were measured to derive restraints for Φ dihedral angles. Additional Φ/Ψ restraints were obtained from backbone chemical shifts using talos+ . 1H-15N residual dipolar couplings were measured in isotropic and anisotropic phases created by the addition of 20 mg·mL−1 Pf1 phage (ASLA Biotech Ltd, Riga, Latvia). Heteronuclear 1H-15N NOEs as well as longitudinal and transversal 15N relaxation rates were measured using standard 2D methods . The relaxation delays were applied in an interleaved manner. The T1 and T2 decay curves were sampled at 14 (50–2600 ms) and 12 (14.4–244.8 ms) different time points, respectively. The heteronuclear NOE experiments were run twice in an interleaved fashion with and without (reference spectrum) proton saturation during proton recovery delay. The relaxation experiments were analyzed with the program nmrview.5.03 .
The structures of Sp140-PHD finger in trans and cis conformation were separately calculated using aria 2.3.1 software  in combination with cns using the experimentally derived restraints (Table 1). NOESY spectra were manually assigned and calibrated by aria 2.3.2. A total of eight iterations was performed, 100 structures were computed in the last two iterations and aria default water refinement was performed on the 20 best structures of the final interaction. In aria 2.3.2, the geometry of the Zn2+ coordination is fixed through covalent bonds and angles in the cns parameters; the tetrahedral angles and distances for Zn2+ coordinating residues were maintained also after water refinement. Several NOEs were observed between Cys704 and Cys716 revealing spatial proximity between the two residues. Calculations with or without imposing the disulfide bond between Cys704 and Cys716 were virtually identical. Structural quality was assessed using procheck-nmr  and molecular images were generated by pymol (http://pymol.org/). The family of the 20 lowest energy structures for Sp140-PHD finger in trans and cis conformations has been deposited in the Protein Data Bank with the accession codes 2md7 and 2md8 for the cis and trans conformation, respectively. Chemical shift and restraints lists that were used in the structure calculations have been deposited in BioMagResBank (19472 and 19473 for the trans and cis conformations, respectively).
NMR binding assays
For NMR binding assays the following synthetic peptides were purchased from Caslo Lyngby, Denmark: H3K4me01-15 ARTKQTARKSTGGKA; H3K4me3 (ARTKme3QTARKS, ARme2aTKQTARKS (asymmetric di-metylation of R2); H3K9ac (ARTKQTARKacS); H4 peptide (SGRGKGGKGL); phosphorylated EAERpTPWN and non-phosphorylated EAERTPWN (both with N-acetylation and C-amidation). Peptide purity (> 98%) was confirmed by HPLC and mass spectrometry. Titration of 15N Sp140-PHD with histone peptides was performed in 20 mm phosphate buffer, pH 6.8, 2 mm dithiothreitol, 150 mm NaCl. For NMR binding assays involving Pin1 all NMR titrations were performed in the absence of phosphate salts, in a buffer containing 20 mm Tris/HCl pH 6.6, 150 mm NaCl and 5 mm dithiothreitol, to avoid the presence of inorganic phosphate which inhibits binding between Pin1 and its targets . Protein concentrations were determined by UV spectroscopy using the predicted extinction coefficients of ε280 5599 m−1·cm−1 and 20 970 m−1·cm−1 for Sp140-PHD and Pin1, respectively. Peptide concentrations were estimated from their mass. In order to minimize dilution and NMR signal loss, titrations were carried out by adding to the protein samples small aliquots of concentrated (15 mm) peptide stock solutions. For each titration point (typically 0.25, 0.5, 0.75, 1, 1.5, 2, 3, 4, 5 equivalents of ligand) a 2D water-flip-back 15N-edited HSQC spectrum was acquired with 512 (100) complex points, 55 ms (60 ms) acquisition times, apodized by 60° shifted squared (sine) window functions and zero filled to 1024 (512) points for 1H (15N), respectively. Assignment of the complex Pin1 : EAERpTPWN was made by following individual cross-peaks through the titration series. For each residue the weighted average of the 1H and 15N chemical shift difference (CSD) was calculated as CSD = [(Δ2HN + Δ2N/25)/2]1/2 . The binding constant of EAERpTPWN to Pin1 was estimated by monitoring the variation of CSD of individual peaks (nine peaks: Lys13, Arg14, Gly20, Asn30, Trp34, Glu35, Lys46, Phe139, Trp34ϵ). Assuming a simple binary reaction between protein and peptide, dissociation constants were obtained from least squares fitting of CSD as a function of total ligand concentration according to
with a = (Ka/δb)[Pt], b = 1 + Ka([Lti] + [Pt]) and c = δbKa[Lti], where δi is the absolute change in chemical shift for each titration point, [Lti] is the total ligand concentration at each titration point, [Pt] is the total protein concentration, Ka = 1/Kd is the binding constant and δb is the chemical shift of the resonance in the complex. Kd and δb were used as fitting parameters using the origin program.
To monitor binding of Sp140-PHD to 15N-Pin1, a concentrated solution of unlabeled Sp140-PHD (2.5 mm) was stepwise titrated into a 0.2 mm solution of 15N-Pin1 up to a 1.5 molar excess (0.3 mm Sp140-PHD), and each titration step was monitored by recording a 2D 1H-15N HSQC spectrum. The reverse titration was also performed adding a concentrated solution of unlabelled Pin1 (1.8 mm) to a 0.2 mm solution of 1H-15N Sp140-PHD. Because of the large peak broadening effects in the HSQC spectra already at sub-stoichiometric protein : ligand ratio (1 : 0.25), it was not possible to determine a dissociation constant from CSD values and to proceed with the titration beyond a 1 : 1.5 molar ratio.
Enzyme activity analysis
NMR experiments were performed at T = 301 K on 6.5 mm solutions of EAERpTPWN or EAERTPWN peptides, corresponding to phosphorylated and unphosphorylated Sp140-PHD finger L3 loop, in 20 mm Tris/HCl, 150 mm NaCl and 5 mm dithiothreitol, 90% H2O and 10% D2O (pH 6.6), with or without 0.1 mm Pin1 (molar ratio Pin1 : peptide 1 : 65). 2D 1H-1H TOCSY and ROESY spectra were recorded with spectral widths of 7183.91 and 6009.61 Hz in t1 and t2 dimensions, respectively. ROESY spectra were acquired at a mixing time of 300 ms with 32 scans, while TOCSY spectra were recorded at a mixing time of 60 ms with 32 scans.
Histone overlay assays
MODifiedTM Histone Peptide Arrays were purchased by Active Motif®. They enable screening in a single experiment 59 acetylation, methylation, phosphorylation and citrullination modifications on the entire N-terminal tails of histones H2A, H2B, H3 and H4. A series of synthetic 19mer histone H2A, H2B, H3 and H4 peptides, each of which may contain as many as four modifications, are spotted in duplicate onto a glass slide, generating a total of 384 unique histone modification combinations. Following overnight blocking at 4 °C with 5% milk in TTBS buffer (10 mm Tris/HCl pH 7.4, 150 mm NaCl, 0.05% Tween 20), the array was washed twice with TTBS and once with binding buffer (50 mm Tris/HCl pH 7.5, 300 mm NaCl, 0.1% NP-40, proteinase inhibitors). The array was then incubated, for 2–4 h at room temperature, with 1 μm solution of GST-tagged recombinant protein in binding buffer. After three washes with binding buffer, the array was incubated with primary antibody anti-GST (1 : 1000) in 5% milk/TTBS for 1 h at room temperature. The array was then washed three times with TTBS and incubated for 1 h at room temperature with a secondary antibody HRP-conjugated (1 : 10 000) in 5% milk/TTBS. Three washes with TTBS followed, and ECLTM Western Blotting detection solution (GE Healthcare) was added and incubated on the array surface for 5 min at room temperature. The image was finally captured by the ImageQuant™ ECL image analysis system (GE Healthcare).
HEK293T transfection with FLAG-Sp140 and FLAG-EBFP
The FLAG-Sp140 was cloned in pFLAG-CMV-5a vector (Sigma, St. Louis, MO, USA). HEK293T cells were transiently transfected by using Turbofect reagent (Thermo).
Co-immunoprecipitation and western blot analysis
HEK293T cells transiently transfected with FLAG-Sp140 were cultured in DMEM with 10% fetal bovine serum and 5% glutamine whereas HEK293T cells transiently transfected with EBFP were used as control. Cells were harvested for 24 h and then lysed in JS buffer (75 mm NaCl, 50 mm HEPES pH 7.5, 1% glycerol, 1% Triton X-100, 1.5 mm MgCl2, 5 mm EGTA) for 20 min. Phosphatase and protease inhibitors were added. Lysate was centrifuged at 16 100 g. for 20 min at 4 °C. For co-immunoprecipitation experiments, lysates were precleaned and then incubated with anti-FLAG M2 affinity resin (Sigma) overnight at 4 °C. The day after, the sample was centrifuged at 1000 g for 1 min and the supernatant was discharged. Beads were washed three times with the JS buffer. The bound proteins were eluted by using the FLAG-peptide 1X from Sigma for 40 min at 4 °C by end-over mixing and run on a 4–12% SDS/PAGE gel (Invitrogen, Carlsbad, CA, USA). Western blot analysis was carried out using an antibody rabbit anti-FLAG (Antibodies-online) and an antibody rabbit anti-Pin1 (Abcam, Cambridge, UK).
Multiple sequence alignment
Sp140 protein sequence alignments and the phylogenetic tree were obtained from Ensembl gene tree ENSGT00510000046835 . The alignments and phylogenetic tree were edited with jalview  and treegraph 2 , respectively.
We thank Professor Joseph P. Noel (Salk Institute for Biological Studies, CA, USA) for PIN1 plasmid. GM thanks Dr Luca Mollica for acquiring NMR experiments in the first stage of the project and Telethon Foundation (TCP99035) and Associazione Italiana Ricerca sul cancro (Airc; grant no. 13159) for financial support. MS and PP were supported by the Estonian Research Agency grant IUT2-2, the Center of Excellence of Translational Medicine and Tartu University Development Fund (Center of Translational Genomics). ST conducted this study as partial fulfillment of his PhD in Molecular Medicine, Program in Cellular and Molecular Biology, San Raffaele University, Milan, Italy. CZ conducted this study as partial fulfillment of her PhD in biochemistry, University of Milan. We wish to thank CERM Infrastructure (Florence) for access to the 900 MHz spectrometer for NMR measurements.