De novo design of proteins housing excitonically coupled chlorophyll special pairs

Ennist, Nathan M.; Wang, Shunzhi; Kennedy, Madison A.; Curti, Mariano; Sutherland, George A.; Vasilev, Cvetelin; Redler, Rachel L.; Maffeis, Valentin; Shareef, Saeed; Sica, Anthony V.; Hua, Ash Sueh; Deshmukh, Arundhati P.; Moyer, Adam P.; Hicks, Derrick R.; Swartz, Avi Z.; Cacho, Ralph A.; Novy, Nathan; Bera, Asim K.; Kang, Alex; Sankaran, Banumathi; Johnson, Matthew P.; Phadkule, Amala; Reppert, Mike; Ekiert, Damian; Bhabha, Gira; Stewart, Lance; Caram, Justin R.; Stoddard, Barry L.; Romero, Elisabet; Hunter, C. Neil; Baker, David

doi:10.1038/s41589-024-01626-0

De novo design of proteins housing excitonically coupled chlorophyll special pairs

Article
Open access
Published: 03 June 2024

Volume 20, pages 906–915, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

De novo design of proteins housing excitonically coupled chlorophyll special pairs

Download PDF

6214 Accesses
16 Altmetric
Explore all metrics

Abstract

Natural photosystems couple light harvesting to charge separation using a ‘special pair’ of chlorophyll molecules that accepts excitation energy from the antenna and initiates an electron-transfer cascade. To investigate the photophysics of special pairs independently of the complexities of native photosynthetic proteins, and as a first step toward creating synthetic photosystems for new energy conversion technologies, we designed C₂-symmetric proteins that hold two chlorophyll molecules in closely juxtaposed arrangements. X-ray crystallography confirmed that one designed protein binds two chlorophylls in the same orientation as native special pairs, whereas a second designed protein positions them in a previously unseen geometry. Spectroscopy revealed that the chlorophylls are excitonically coupled, and fluorescence lifetime imaging demonstrated energy transfer. The cryo-electron microscopy structure of a designed 24-chlorophyll octahedral nanocage with a special pair on each edge closely matched the design model. The results suggest that the de novo design of artificial photosynthetic systems is within reach of current computational methods.

De novo protein design of photochemical reaction centers

Article Open access 23 August 2022

Polychromatic solar energy conversion in pigment-protein chimeras that unite the two kingdoms of (bacterio)chlorophyll-based photosynthesis

Article Open access 24 March 2020

Light polarization dependency existing in the biological photosystem and possible implications for artificial antenna systems

Article 23 October 2019

Main

Photosynthetic proteins manipulate the distances and angles between chlorophyll (Chl) molecules to tune excitonic coupling and control absorption and fluorescence spectra, excited state dynamics, energy transfer and electron tunneling^1,2,3,4,5,6. This control enables light harvesting and charge separation with quantum yields of 97% or higher under favorable conditions^7,8. Natural photosynthesis can guide the development of synthetic biology for renewable fuel production, but only if we can determine the structure−function relationships required for efficient solar-to-fuel energy conversion and build new structures that exploit this knowledge. Chl special pairs have attracted great interest as primary electron donors, but the complexity of natural photosystems makes it difficult to study these Chl molecules directly. Model protein systems, such as the water-soluble Chl protein and the B820 dimer of LH1, allow the investigation of excitonic interactions between Chl molecules or between bacteriochlorophyll (BChl) molecules without spectral congestion from other pigments^9,10,11,12; however, the BChl and Chl dimers are not structural mimics of special pairs. The challenges of studying special pairs have inspired chemists to synthesize numerous small molecule mimics^{13,14,15,16,17}, which have provided valuable insights, but these can be labor-intensive to synthesize, overlook the role of protein matrix effects, which are important in native special pairs⁵, and lack the fine control over Chl−Chl distances and orientations needed to reproduce the precise geometries of native special pairs. De novo-designed Zn-tetrapyrrole monomer-binding proteins^{18,19,20,21,22,23,24,25,26} and de novo Chl dimer proteins^27,28,29,30 have contributed to the understanding of light harvesting and quenching of excitation energy, but no Chl dimer structures have been determined experimentally in designed proteins. Systematic methods for assembling Chl dimers with predefined geometries are lacking, making it difficult to correlate structure and function, and despite decades of active research, there has been no generalizable strategy for assembling Chl dimers that precisely match special pair geometries.

We reasoned that recent advances in computational protein design could enable the creation of stable, water-soluble proteins that assemble Chl dimers with predefined geometries. Binding a small molecule as a dimer is a computational challenge because the binding interface involves not only the protein but also the second small molecule, which has an independent set of rotational and translational degrees of freedom. To control these degrees of freedom, we sought to design homodimers with perfect two-fold cyclic (C₂) symmetry, which bind a C₂-symmetric Chl pair such that the C₂ symmetry axes of the protein and chromophore are coincident, similar to native reaction centers, which can have true C₂ symmetry^31,32 or pseudo-C₂ symmetry (Fig. 1). C₂ symmetry ensures that the two bound Chl molecules will have near-degenerate site energies, improving the resonance between pigment transitions necessary to create delocalized states³³. For Chl dimer protein scaffolds, we chose hyperstable C₂-symmetric repeat protein dimers containing symmetric pockets with tunable sizes and geometries^34,35,36. In this dimeric repeat protein architecture (Fig. 1c), the hydrophobic core is independent of the small molecule-binding site, enabling full customization for binding with little impact on the overall protein structure. Several thousand C₂-symmetric homodimers that sample a wide range of superhelical curvature, rise and radius parameters have been generated^34,35.

**Fig. 1: Computational design of Chl special pair proteins.**

Results

De novo protein design strategy

To investigate the effect of geometry on Chl−Chl coupling, we set out to design a range of C₂-symmetric dimers that hold two closely interacting Chl molecules in varied geometries, including the arrangement found in native special pairs. In native proteins, BChl or Chl molecules typically have a pentacoordinate central Mg(II) or Zn(II) ion with a His Nε atom as the axial ligand. For each chosen special pair geometry, we built a His rotamer interaction field and stored the possible His−Chl interaction geometries in a hash table (Fig. 1c; see Methods for details). For each geometrically compatible C₂ scaffold, we cycled through His−Chl rotamers from the hash table, aligned them to the scaffold C₂-symmetry axis and searched for matches of the His N−Cα−C backbone atoms with the backbone atoms of the residues lining the binding cavity. Scaffolds in which the His N–Cα−C backbone atoms aligned with the corresponding atoms in the protein backbone and could also accommodate the Chl dimer without clashes were redesigned using symmetric Rosetta FastDesign to optimize hydrophobic packing and hydrogen bonding around the Chl molecules³⁷ (Fig. 1c). Designs were filtered on the basis of the Rosetta full-atom energy, the solvent-accessible surface area of the Chl dimer, His rotamers and His Nε−metal ligation geometry. We selected 43 designs based on 13 unique scaffolds for experimental characterization (see Supplementary Table 1 for amino acid sequences). We also characterized an additional five redesigned variants of one of the initial 43 designs after determination of its X-ray crystal structure provided clues to improve its function (vide infra). The protein monomer sizes ranged from 20.6 to 28.4 kDa (179 to 261 amino acids). We refer to these 48 designs as Chl special pair proteins.

Special pair protein stability and pigment binding

Following special pair protein expression in Escherichia coli, SDS−PAGE gels showed that all 48 designs were present in the soluble fractions of the lysates. Proteins were purified using Ni-NTA agarose and size-exclusion chromatography (SEC) (Supplementary Fig. 1). All SEC traces exhibited protein absorption at the elution volume expected for homodimer formation. Of the 20 designs investigated by small-angle X-ray scattering (SAXS) in the apo state, 15 had SAXS profiles that suggested a three-dimensional (3D) shape consistent with the design model (Fig. 2, Supplementary Fig. 2 and Supplementary Table 2)^38,39. The slightly lower predicted radius of gyration (R_g) value compared with the experimental SAXS data is likely due to a dense hydration shell around the highly charged special pair proteins^40,41. The far-ultraviolet (UV) circular dichroism (CD) spectra of three special pair proteins that were expressed in high yield (≥140 mg l⁻¹) showed that these proteins were highly α-helical with and without the synthetic Chl a derivative Zn-pheophorbide a methyl ester (ZnPPaM). Thermal denaturation curves monitored by the CD signal at 222 nm indicated that all three proteins are highly thermostable in the apo and holo states (Fig. 2).

**Fig. 2: Folding, stability and ZnPPaM binding of special pair proteins.**

At longer wavelengths in the UV/visible/near-infrared (UV/vis/NIR) range, CD spectra can serve as a convenient probe of excitonic interactions between Chl molecules. Monomeric Chl molecules, including Chl a and ZnPPaM, exhibit asymmetric negative CD signals in the Q_y region near ~670 nm (Supplementary Fig. 3)⁴². However, when Chl dimers are arranged in chiral protein environments, excitonic interactions can produce delocalized transitions with chiral character, yielding CD signals that are stronger and conservative (that is, composed of a bisignate doublet that integrates to zero). Figure 2 shows that ZnPPaM molecules bound to the special pairs SP1, SP2 and SP3 proteins had bisignate CD features in the Q_y region (the red part of the spectrum), consistent with excitonic coupling between the Chl molecules. As shown in Supplementary Table 3, the Q_y CD features of SP2 and SP3 were substantially stronger relative to their Q_y absorption bands than the Q_y CD signal of monomeric ZnPPaM in organic solvent. ZnPPaM binding titrations of SP2 and SP3 monitored by CD in the Q_y region showed that the CD doublets are attributable to the binding of ZnPPaM dimers. Curve fitting of the CD titrations yielded SP2−ZnPPaM dissociation constants (K_d) of 300 nM for K_d1 and 2.5 μM for K_d2, and SP3−ZnPPaM K_d values of 800 nM for K_d1 and 1.0 μM for K_d2 (Supplementary Fig. 4). Because of the lower CD signal of the SP1−ZnPPaM complex, we instead used absorbance and fluorescence to measure the SP1−ZnPPaM interaction. Absorption titrations yielded curve fits with K_d1 and K_d2 values of 290 nM and 430 nM for SP1, 110 nM and 2.0 μM for SP2 and 350 nM and 940 nM for SP3, respectively (Supplementary Figs. 5 and 6). Fluorescence titrations analyzed using a 1:1 binding model of protein monomer to ZnPPaM yielded K_d estimates of 660 nM for SP1, 480 nM for SP2 and 120 nM for SP3; these K_d values approximate the average of K_d1 and K_d2 for each protein (Supplementary Fig. 7).

High-resolution X-ray crystal structures

Based on the results of the SEC, SAXS and spectroscopic experiments (Fig. 2 and Supplementary Figs. 1–7), we selected promising candidates for X-ray crystallographic structure determination. We solved the crystal structures of SP1 and SP2 and found that both had protein backbone conformations that matched the corresponding design models to within 1.7 Å Cα root mean square deviation (r.m.s.d.) (Fig. 3).

**Fig. 3: X-ray crystal structures of the designed special pair proteins.**

The X-ray crystal structure of SP1 was solved in the ZnPPaM-bound state to 2.0 Å resolution, revealing a special pair geometry closely matching that of purple photosynthetic bacteria (Fig. 3a–f). The rotameric state of the Zn-ligating His121 is identical to that in the design model, and several hydrophobic and T-stacking interactions form as designed. Hydrogen bonds to the ring E ketone group, which has been shown to be important for modulating special pair redox potentials⁴³, form with Gln10 in both ZnPPaM molecules, in agreement with the design model. Alignment of the tetrapyrrole rings of the SP1−ZnPPaM dimer with nine native BChl a special pairs from different species of purple bacteria gave r.m.s.d. values of 0.23–0.28 Å (refs. ^{44,45,46,47,48,49,50,51}). (See details of r.m.s.d. measurements in Methods.) For comparison, the special pairs of two crystal structures of the same Thermochromatium tepidum LH1−RC complex deviate from one another by 0.22 Å r.m.s.d. across the tetrapyrrole rings (Protein Data Bank (PDB) 3WMM and 5Y5S)^45,51. The r.m.s.d. between the ZnPPaM dimer in the SP1 crystal structure and its design model is 0.25 Å.

SP2 was intended to assemble a ZnPPaM dimer with a conformation substantially different from that of native special pairs to investigate the effect of dimer geometry. The SP2 crystal structure was solved in both the apo state and the ZnPPaM-bound state to 2.4 and 2.5 Å resolution, respectively. The apo- and holo-state amino acid backbones both agree with the SP2 design model to within 1.4 Å r.m.s.d. (Fig. 3g). The holo-state crystal structure has two copies of the SP2 dimer in the asymmetric unit; alignment of the two ZnPPaM dimers shows that their binding geometries are equivalent, with an r.m.s.d of 0.22 Å over the tetrapyrrole rings. The ZnPPaM molecules are ligated by His178, as in the SP2 design model. After alignment of the crystal structure and design model protein backbones, the corresponding tetrapyrrole rings are approximately coplanar. Despite the accuracy of the protein backbone design, the crystal structure shows that the ZnPPaM molecules are rotated and translated relative to the design model (3.5 Å r.m.s.d. across tetrapyrrole ring atoms). Compared with the apo-state crystal structure, the SP2 binding cavity widens by ~1.6 Å in the presence of ZnPPaM; this expansion provides the extra volume needed for the ZnPPaM molecules to adopt their unexpected conformation. While the ZnPPaM dimer in SP2 differs from the design model, the crystal structure nevertheless satisfies the objective of creating a non-native dimer geometry.

We did not succeed in solving the holo-state structure of SP3; however, we were able to solve a 3.05 Å-resolution apo-state structure of the closely related design SP3x, which shares 94% sequence identity with SP3. The SP3x homodimeric design model agrees with the X-ray crystal structure to 1.61 Å Cα r.m.s.d. (Supplementary Fig. 8).

Excitonic coupling between Chl molecules

The absorption and fluorescence spectra of native special pairs are shifted compared with those of monomeric BChl or Chl molecules, in part due to excitonic coupling between the BChl or Chl molecules, which enables them to act as exciton traps^5,6,52,53. Absorbance, fluorescence and circular dichroism measurements along with time-dependent density functional theory calculations suggest that the chlorophyll molecules in our special pair proteins are also excitonically coupled. The SP2−ZnPPaM dimer absorbance spectrum exhibited splitting of the Q_y band in solution. Analysis of SP2−ZnPPaM absorbance in binding titrations (Fig. 4a and Supplementary Fig. 5) showed that the Q_y transition of monomeric ZnPPaM in SP2 had an absorbance maximum at 669 nm with an extinction coefficient (ε_669nm) of 49,900 M⁻¹ cm⁻¹, whereas the SP2−ZnPPaM dimer spectrum had its Q_y maximum slightly shifted to 668 nm with a decreased ε_668nm of 38,200 M⁻¹ cm⁻¹. While the monomer had no discernable spectroscopic feature at 690 nm (its ε_690nm was 9,400 M⁻¹ cm⁻¹), the SP2−ZnPPaM dimer spectrum had a distinct shoulder with ε_690nm of 17,700 M⁻¹ cm⁻¹.

**Fig. 4: Spectral shift on ZnPPaM dimer binding in the SP2 protein.**

To investigate the origin of the SP2−ZnPPaM bands at 668 and 690 nm and rule out the possibility that they represent two populations of distinct ZnPPaM oligomers, we collected low-temperature absorption and fluorescence spectra (Fig. 4b,c and Supplementary Fig. 9; see Methods for details). We prepared a dimer sample with 2.0 molar equivalents of ZnPPaM per SP2 protein dimer and a monomer sample with only 0.3 molar equivalents per protein dimer, both in sucrose/trehalose films at 75 K. The monomer sample lacked a red-shifted shoulder (Fig. 4b), but in the SP2−ZnPPaM dimer sample, two emission bands were observed at 673 and 692 nm (Fig. 4c), consistent with excitonic coupling. Time-dependent density functional theory calculations were also consistent with excitonic coupling. Using the SP2−ZnPPaM dimer crystal structure (Fig. 3h), we calculated an excitonic coupling energy of 241 cm⁻¹ (Methods). The experimental SP2−ZnPPaM absorption features at 668 and 690 nm correspond to a Q_y peak splitting of 477 cm⁻¹ (a coupling of 239 cm⁻¹), which is consistent with the calculated value. We also simulated absorption and fluorescence spectra in the PigmentHunter application⁵⁴ (dashed lines in Fig. 4b,c and Methods). The simulated dimer spectra successfully reproduced the weak oscillator strength of the lower-energy state and the large shift between the absorption and fluorescence maxima. The close agreement between the calculated and experimental data supports the conclusion that the SP2−ZnPPaM dimer system exhibits excitonic delocalization.

Calculations on the SP1−ZnPPaM dimer crystal structure (Fig. 3e) yielded a lower excitonic coupling of 87 cm⁻¹. Experimental low-temperature spectra of SP1−ZnPPaM in 40% glycerol showed only a modest broadening of the SP1−ZnPPaM dimer fluorescence emission band compared with the monomer, consistent with weaker excitonic coupling (Supplementary Fig. 10).

Further confirmation of excitonic coupling was obtained by comparing experimental and calculated CD spectra based on the ZnPPaM dimer geometries of the design models and crystal structures. We found that the signs of the CD Cotton effects predicted from the crystal structures of SP1 and SP2 were consistent with the experimental signs in the Q_y region. In the SP3 dimer, the calculated spectrum based on the design model agreed with the experimentally measured signs of the CD Cotton effects, suggesting that the P₇₀₀-like ZnPPaM dimer in SP3 assembles as designed. (See Methods and Supplementary Figs. 11–14 for details of CD spectral calculations and simulations of Q_x and Soret bands.)

Special pair proteins as energy transfer acceptors

Native special pairs have critical roles as energy transfer acceptors for antenna proteins. To test whether our designed special pair proteins participate in energy transfer with natural light-harvesting proteins, we analyzed excitation energy transfer in two-dimensional (2D) surface arrays using fluorescence lifetime imaging microscopy (FLIM) along with nanoimprint lithography⁵⁵ (Fig. 5). We selected the cyanobacterial antenna protein CpcA with attached phycoerythrobilin (PEB) as the energy transfer donor. CpcA−PEB was purified from E. coli and had a strong fluorescence emission maximum at 568 nm, with emission extending past 630 nm (ref. ⁵⁶) and overlap** with the excitation spectrum of Zn-pheophorbide a (ZnPPa) when bound to SP2. Q_y peak splitting was not observed in SP2 assembled with ZnPPa instead of ZnPPaM (Fig. 5b), suggesting that the peripheral substituents of chlorin have a role in excitonic coupling. To monitor energy transfer, ~5-μm-wide linear arrays of CpcA−PEB and perpendicular linear arrays of SP2−ZnPPa were applied to a poly(l-lysine)-functionalized glass surface by contact printing⁵⁵, creating intersection points where CpcA−PEB and SP2−ZnPPa interacted and other locations in which only one of the proteins was present. Wide-field epifluorescence imaging with excitation at 450 nm was used to analyze the surface attachment; filtering emission at 620 nm preferentially displayed regions of CpcA−PEB, whereas filtering at 680 nm preferentially showed SP2−ZnPPa. At the intersections between the lines, donor CpcA−PEB emission (620-nm filter) was decreased and SP2−ZnPPa acceptor (680-nm filter) emission was increased, indicating energy transfer from donor to acceptor (Fig. 5c).

**Fig. 5: SP2 functions as an energy transfer acceptor for native light-harvesting proteins.**

To quantify the strength of the interaction between CpcA−PEB and SP2−ZnPPa, we used time-resolved single-photon counting. The surface was illuminated with a 485-nm picosecond laser filtered at 620 nm (donor emission) and photons were counted for individual pixels (surface resolution of approximately 300 nm) over time, allowing both total fluorescence intensity and lifetime to be analyzed (Fig. 5d). In regions with only CpcA−PEB, the fluorescence intensity was greater than 4,000 arbitrary units (a.u.) and the lifetime was more than 2 ns (τ_av = 2,058 ps). Regions with both CpcA−PEB and SP2−ZnPPa show reduced fluorescence intensity (< 1,000 a.u.) and lifetimes under 0.9 ns (τ_av = 839 ps). We estimated the energy transfer efficiency of CpcA−PEB to SP2−ZnPPa in 2D arrays to be 59%, similar to the efficiencies recorded for natural fluorescent proteins^57,58,59 (see Supplementary Fig. 15 for time-resolved photon counting and energy transfer calculation).

Chl-binding supercomplex

For efficient solar energy conversion, nature organizes photosynthetic machinery into specialized compartments such as thylakoids in plants or chromatophore vesicles in purple photosynthetic bacteria⁶⁰. As a first step toward such structures, we sought to incorporate a Chl-binding special pair protein into a two-component supercomplex with octahedral symmetry⁶¹. The C₂ symmetry axes of 12 copies of the SP2 dimer and the C₃ axes of 8 copies of a C₃-symmetric homotrimer⁶² were aligned with the C₂ and C₃ axes of an octahedron. We sampled rotations and translations along these axes to generate a closely packed octahedral model with the C₂ dimers on the edges and the C₃ trimers on the vertices. Interface residues were redesigned to create binding surfaces between the SP2 dimers and the trimers. Twenty-one designs were experimentally characterized, and one was found to assemble into octahedral structures by negative-stain EM (Supplementary Figs. 16 and 17). In this nanocage, the SP2-like component shares 87% sequence identity with the original SP2 design and its absorbance spectrum had a red-shifted shoulder in the Q_y region, similar to the original SP2−ZnPPaM complex (Supplementary Fig. 16).

The cryo-EM structure of the 600-kDa octahedral nanocage with 24 ZnPPaM molecules bound to it was solved to 6.5 Å resolution, with the helices well resolved. The cryo-EM structure is very similar to the computational design model in both the protein−protein interfaces and the overall architecture (Fig. 6a,b and Supplementary Figs. 18 and 19). The asymmetric unit of the cryo-EM structure agrees with the design model to 3.4 Å backbone r.m.s.d. Variability analysis showed several modes of flexibility, which may have limited the resolution (see Supplementary Information for movies of protein breathing motions). Although the resolution is not sufficient to confidently determine the orientations of the Chl molecules, the cryo-EM density map is consistent with the Chl dimer geometry in the SP2 crystal structure (Fig. 6c,d).

**Fig. 6: Cryo-EM structure of a nanocage that assembles Chl dimer proteins.**

Discussion

We describe de novo-designed proteins that hold Chl dimers in precisely defined, closely juxtaposed geometries. We obtained crystal structures of two holo-state designs: the first, SP1, reproduces the binding geometry of the native purple bacterial reaction center special pair with sub-Ångstrom precision, and the second, SP2, has a distinct geometry with the Chl molecules closer together. Our use of symmetry reduces the complexity of the design procedure while ensuring equivalent site energies for the two bound Chl molecules to strengthen excitonic coupling. Symmetry also enables integration of the de novo special pair proteins into larger supercomplexes. Our octahedral nanocage incorporating 12 ZnPPaM dimers is a first step toward de novo design of photosynthetic compartments analogous to thylakoids or chromatophores.

SP2 exhibits spectroscopic hallmarks of native special pairs, including Cotton effects by CD, shifting of absorption and fluorescence bands and energy transfer activity when paired with the native antenna protein CpcA. SP1 exhibits weaker excitonic coupling than SP2 and the BChl special pair of purple photosynthetic bacteria, despite its close structural similarity to the latter. The stronger coupling of SP2 relative to SP1 may reflect the closer spacing of the ZnPPaM molecules in SP2, whereas the stronger coupling of the purple bacterial special pair is likely due to the stronger Q_y transition dipole moment of bacteriochlorins compared with chlorins⁶³. Directed evolution could alter the binding specificity for different types of Chl molecules and increase absorbance band shifts for more effective exciton trap**. Prediction of the spectroscopic properties of a Chl dimer in a protein is complicated by the fact that Chl−Chl coupling energies are typically similar in magnitude to the available thermal energy, Franck−Condon active vibrational and phonon reorganization energies and local Chl vibrational frequencies³³. Accurate optical predictions require benchmarking of theoretical methods using robust model systems. Native photosynthetic proteins can be difficult to isolate and typically contain many interacting pigment molecules, which creates spectral congestion. The highly thermostable, water-soluble Chl dimer proteins described here avoid the complexity of native pigment−protein complexes and provide a testbed to investigate structure−spectrum relationships.

Studies of photosynthetic light harvesting and charge separation indicate that natural photosynthesis leaves room for efficiency improvements^{1,25,64,65,66}. The successful design of excitonically coupled chromophore pairs and their assembly into organized superstructures suggests that de novo protein design could provide a route to new solar-to-fuel energy conversion technologies. With its red-shifted absorbance spectrum, SP2 is well tuned to accept energy from light-harvesting Chl molecules or other pigments (Fig. 5), and it has the long-lived excited state (fluorescence emission lifetime of >3 ns; Supplementary Fig. 9) required to allow electron transfer to occur. To couple light absorption to charge separation for solar fuel production, the next step is to engineer the transfer of the excited-state electron to a low-potential electron acceptor.

Methods

Computational placement of the Chl special pair into symmetric protein scaffolds

Identifying residue positions capable of accommodating the Chl special pair, and that are scalable to millions of potential scaffolds, was achieved by using a motif-hash-based method^34,68 specifically adapted for the His−Chl dimer motif inspired by the special pair of purple bacteria, P₈₆₅. However, the number of example structures of the His−Chl dimer motif found in PDB is not acceptable for effectively populating a motif hash table. Therefore, additional structural examples of the symmetric His−Chl dimer complex were generated de novo.

The conformer generation was achieved using the NeRF algorithm (available on GitHub at https://github.com/atom-moyer/nerf), which translates internal molecular coordinates to global molecular coordinates. Various conformers were generated by varying the internal coordinates, such as the relative positioning of the Chl groups, the dihedral of ligation by the His residue, and the rotamer of the His side chain. The full complex was duplicated along the C₂ axis to create the symmetric complex. If the relative orientations of the Chl molecules were varied, clashes between the rings and their substitutions were evaluated and filtered. The entire process of de novo motif generation was repeated for ligation with the epsilon and delta nitrogen atoms of the imidazole ring.

Once the de novo conformers were generated, the 6D transformation that defines the relative orientation of the N-Cɑ-C atoms of the ligating His residues was hashed using a method described previously^34,68. The hashed 6D transformation was used as a key in a multivalue hash table (https://github.com/atom-moyer/getpy) and the associated value was a vector that defined the information necessary to rebuild the His−Chl complex, with nitrogen from the His residue used for ligation and the internal coordinates of the His rotamer.

During the evaluation of the design scaffolds, the 6D transformation of each symmetric residue pair across chains was evaluated and hashed using the same method used to hash the de novo conformers described above. This allowed the identification of symmetric residue pairs that had similar 6D transformations to the potentially acceptable ligation geometries. If a matching 6D transformation was found, the His−Chl complex was rebuilt from the associated value in the hash table and the complex was evaluated in the context of the protein. If the Chl molecules did not clash with the backbone atoms of the protein, the placement was accepted and passed into the protein design process.

A Python package and example scripts that generate the de novo hash tables and place the His−Chl complexes into symmetric proteins can be found at https://github.com/atommoyer/stapler.

Protein expression and purification

Synthetic genes with N-terminal His₆ tags followed by tobacco etch virus (TEV) protease cleavage sites were purchased in pET29b expression vectors from Integrated DNA Technologies. Plasmids were transformed into Lemo21(DE3) competent E. coli (New England Biolabs). For each protein, a single E. coli colony was grown in a culture of 5 ml LB with 100 μg ml⁻¹ kanamycin overnight at 37 °C. Overnight cultures were used to inoculate 50- to 500-ml cultures of autoinduction medium⁶⁹. Bacteria were grown in autoinduction medium at 37 °C with shaking for 4 h and then incubated with shaking overnight at 18 °C. Bacteria were harvested and resuspended in 300 mM NaCl, 30 mM imidazole, 25 mM Tris buffer at pH 8, ~0.01 mg ml⁻¹ DNase (Sigma-Aldrich), ~0.1 mg ml⁻¹ lysozyme (Sigma-Aldrich) and Pierce protease inhibitor tablets (Thermo Fisher Scientific). Bacteria were lysed by sonication and centrifuged at ~18,000g for 30 min. Soluble fractions were purified using immobilized metal affinity chromatography gravity columns packed with Ni-NTA agarose resin (Qiagen) at room temperature. Columns were washed with a buffer containing 20 mM imidazole and proteins were eluted with 300 mM imidazole buffer. Samples were digested with His-tagged TEV protease in the presence of 0.5 mM dithiothreitol for 1–2 days at room temperature. Digested proteins were buffer exchanged into 20 mM imidazole buffer, 300 mM NaCl and 25 mM Tris buffer at pH 8 and applied to immobilized metal affinity chromatography columns to remove TEV protease and uncleaved protein. Proteins were further purified by SEC using an ÄKTA FPLC instrument with a Superdex 200 Increase 10/300 GL column (GE Healthcare Life Sciences). Protein and Chl molecular weights were verified by reverse-phase liquid chromatography−mass spectrometry (LC−MS) using an Agilent G6230B time-of-flight instrument and AdvanceBio Desalting-RP column. Mass spectra were deconvoluted in BioConfirm using a total entropy algorithm (Supplementary Figs. 20–22).

Protein−Chl sample preparation

ZnPPaM was purchased from Frontier Scientific. ZnPPaM stock solutions were prepared in dimethyl sulfoxide or methanol to concentrations between 200 μM and 1 mM. ZnPPaM concentrations were determined using mass measurements and the known absorptivity of Zn pheophytin a, which has a similar absorbance spectrum and an extinction coefficient ε_659nm of 77,300 M⁻¹ cm⁻¹ in 80% acetone/20% deionized water⁷⁰. UV/vis absorbance spectra were acquired using a Jasco V-750 spectrophotometer with a bandwidth of 1 nm and scanning speed of 400 nm min⁻¹. Protein−ZnPPaM complexes were prepared by slowly adding freshly prepared ZnPPaM stock solution to protein solution in aqueous buffer at room temperature and incubating the samples for several hours. Unbound ZnPPaM was removed by (1) centrifugation to pellet precipitated ZnPPaM and (2) sterile filtration using a 0.22-μm syringe filter and/or by PD-10 desalting column purification (Sephadex G-25 M resin, Cytiva Life Sciences).

CD spectroscopy

CD spectra were collected using a Jasco J-1500 spectrophotometer. For protein secondary structure assays, spectra were measured on samples of 0.2–0.4 mg ml⁻¹ protein in 1-mm quartz cuvettes from 260 to 190 nm with a 1 nm bandwidth, 1-nm data interval, data integration time (DIT) of 1 s and scanning speed of 50 nm min⁻¹. Thermal melts were monitored at 222 nm from 2 to 98 °C with a 2-nm bandwidth and a DIT of 8 s. UV/vis/NIR CD transitions of protein-bound Chl molecules were examined in the 800- to 300-nm region in 1-cm quartz cuvettes as averages of 10 scans using a 3-nm bandwidth, 1-nm data interval, DIT of 4 s and scanning speed of 50 nm min⁻¹, unless otherwise noted. UV/vis/NIR CD spectra shown in Fig. 2 were obtained after sterile filtering using a 0.22-μm filter and PD-10 desalting column purification (Sephadex G-25 M resin, Cytiva Life Sciences). Each spectrum represents the average of two independent sample preparations. Samples contained 8–15 μM protein (monomer concentration) with equimolar ZnPPaM, 150 mM NaCl and 10 mM Tris buffer at pH 8. ZnPPaM dry powder was dissolved in methanol stock solutions immediately before its addition to protein solutions. Samples were allowed to incubate for 8–16 h at room temperature before the spectra were measured.

SAXS

Data were collected at the Advanced Light Source (ALS) at Lawrence Berkeley National Laboratory using the SIBYLS beamline for high-throughput SAXS⁷¹. Proteins were sent as 30-μl samples in 96-well plates with buffer-matched blank solutions for background subtraction. Datasets were processed in SAXS Frameslice v1.4.13 and compared with the design models using FoXS^38,39.

Fluorescence quantum yield measurements

Fluorescence spectra displayed in Supplementary Fig. 7 were recorded on a Fluorolog Horiba Jobin Yvon spectrofluorimeter equipped with a xenon lamp, a double monochromator and a photomultiplier detector. The experiments were carried out in a right-angle configuration. Each baseline-subtracted fluorescence spectrum was corrected for the spectral sensitivity of the fluorimeter and reabsorption by assuming that the middle of the cuvette is the origin of emission. Relative quantum yields were estimated using Chl a in diethyl ether as a ref. ⁷².

Low-temperature absorbance and fluorescence spectroscopy in a sucrose/trehalose film

Solutions of SP2x−ZnPPaM were mixed with a saturated sugar solution made by dissolving 50:50 sucrose/trehalose (w/w) in distilled water as described previously⁷³. A 100-μl sample of SP2 at 34 mg ml⁻¹ (ZnPPaM dimer) or 156 mg ml⁻¹ (ZnPPaM monomer) was added dropwise to 100 μl of the sugar solution and gently mixed. The sugar/protein mixture was dropped onto a 0.1-mm quartz cuvette (Starna Cells) and kept under vacuum in the dark for 24 h. The sample was then loaded into a Janis ST-100 cryostat using a custom-built copper cuvette holder and cooled with liquid nitrogen. A Lakeshore 330 autotuning temperature controller was used to control the temperature. An Agilent Cary-60 spectrometer was used to collect absorbance spectra at different temperatures. For the temperature-dependent fluorescence emission spectra, we used a home-built setup equipped with a Thorlabs 405-nm laser head (CPS405). The collected emission was fiber-coupled into a Flame Ocean Optics spectrometer. Lifetimes were recorded using a home-built all-reflective epifluorescence system. The samples were excited via a pulsed laser output from a 405-nm pulsed diode laser (LDH-P-C-405, PicoQuant) with a repetition rate of 10 MHz. The emission was subsequently filtered using a 420-nm long-pass dichroic beam splitter (DMLP425R, Thorlabs) and a 420-nm long-pass filter (10CGA-420, Newport). Emission was detected by avalanche photodiodes (PD050-CTD, Micro Photon Devices). Time-correlated single-photon counting traces were histogrammed using a Picoquant HydraHarp 400 and analyzed using the corresponding software.

Molecular dynamics simulations

All molecular dynamics simulations were performed with Amber18 (ref. ⁷⁴) using the ff14SB forcefield⁷⁵ for proteins and TIP3P⁷⁶ for water. To obtain the forcefield parameters of the chromophore (ZnPPaM), we used the MCPB.py module of Amber⁷⁷. Atomic charges were calculated using the restrained electrostatic potential (ESP) fitting scheme, while force constants were calculated using the Seminario method. Quantum mechanical geometry optimization and ESP calculations were performed using Gaussian 16 (rev B.01)⁷⁸ at the B3LYP/6–31 G* level. Parameters for the organic part of the chromophore were obtained from the general AMBER forcefield.

As starting structures, either design models or crystal structures were used. Protonation states were determined using the H++ webserver at pH 8 (using default parameters)^79,80,81. Topology and geometry files were generated using LEaP with an isometric truncated-octahedron shape for the periodic box and a minimum distance of 1.5 nm between the protein and the edges of the box. Protein charges were neutralized with Na⁺ and Cl⁻ ions.

Minimization and initial equilibration steps were performed following a recently developed protocol⁸². Briefly, the protocol consists of nine sequential energy minimizations and short molecular dynamics runs, which sum to 4,000 steps of minimization and 40,000 molecular dynamics steps (totaling 30 ps), followed by a final molecular dynamics equilibration (500,000 steps, 1,000 ps). Then, after discarding the first 200 ns, production runs were done in the isothermal−isobaric (NPT) ensemble at 300.0 K with a time step of 2 fs, and bonds involving hydrogen atoms were constrained via the SHAKE algorithm. Constant temperature and pressure were ensured using the Langevin thermostat (collision frequency, 2 ps⁻¹) and Monte Carlo barostat, respectively. Long-range electrostatics were considered via the particle mesh Ewald model, setting the direct space sum cutoff to 1.0 nm.

Calculation of Chl dimer excitonic coupling and spectra

Calculations involving excited states were performed on the chromophore geometries of the design models and crystal structures. In the latter case, hydrogen atoms were added using UCSF Chimera 1.11 (ref. ⁸³). Electronic couplings were calculated using the EET (electronic energy transfer) module from Gaussian, at the CAM-B3LYP/6-31 G* level. Environmental effects were considered through the polarizable continuum model^84,85, choosing n-octanol as representative of the protein dielectric behavior. The EET analysis considered six singlet excited states per chromophore.

To obtain CD spectra, we used the results of the EET calculations and the Excitonic Analysis Tool program^86,87,88. Rotatory strengths were calculated by considering both electric and magnetic dipoles in the velocity formulation. Spectral line shapes were simulated as Gaussians, with a full-width at half-maximum of 350 cm⁻¹ for the Q_y and Q_x transitions and 1,150 cm⁻¹ for transitions in the Soret region. The spectra were shifted by −0.25 eV to reproduce the experimental position of the Q_y band.

Simulated absorption and fluorescence spectra in Fig. 4 were calculated in the online PigmentHunter application, which uses the standard Frenkel exciton approach of constructing and diagonalizing an excitonic Hamiltonian to yield system transition dipoles and frequencies. Explicit expressions for absorption and fluorescence spectra are reported in ref. ⁸⁹, with the exception that PigmentHunter uses a single preset (temperature-dependent) spectral line shape for each excitonic transition. Line shape parameters for Fig. 4 were calculated at 75 K using the experimentally determined vibrational and phonon densities for Chl a in the CP29 complex⁹⁰. Simulations in Fig. 4 are ensemble averaged over 10⁶ realizations of pigment site energies chosen randomly from a Gaussian distribution with full-width at half-maximum of 500 cm⁻¹; calculated spectra are convolved with a 10 cm⁻¹ Gaussian to eliminate high-frequency noise due to finite statistical sampling. To match the experimental data, mean pigment site energies were set to 14,925 cm⁻¹ for monomer (PL complex) and 14,750 cm⁻¹ for dimer (PL₂). Pigment transition dipoles were calculated from chains A and B of the experimental crystal structure using the transition ESP method and parameters for Chl a⁹¹. For consistency with the calculated CD spectra described above, the intersite coupling was rescaled by a factor of 1.212 to a value of 241 cm⁻¹. (PigmentHunter’s automatically calculated intersite transition ESP coupling, which includes an empirical scaling factor to account for protein dielectric effects, is slightly smaller, at 199 cm⁻¹.)

FLIM

FLIM was conducted on a home-built laser scanning time-resolved fluorescence microscope, as described previously⁵⁵. The microscope was equipped with a 485-nm picosecond diode laser (PicoQuant, PDL 828) and a 450-nm LED (Thorlabs, M470L2) (wide-field illumination) as excitation sources. The excitation light was focused by a ×100 objective (PlaneFluorite, NA = 1.4, oil immersion, Olympus). The emitted light was filtered using a 495-nm dichroic beam splitter (Semrock) and 565/25-nm, 630/20-nm and 680/45-nm bandpass filters (Semrock) to remove the background excitation light. The microscope was fitted with a spectrometer (150 lines/mm grating, Acton SP2558, Princeton Instruments) and an electron-multiplying charge-coupled device camera (ProEM 512, Princeton Instruments) for emission spectrum acquisition and wide-field imaging. A hybrid detector (HPM-100-50, Becker & Hickl) was used for single-photon counting. The modulation of the excitation laser was synchronized with a time-correlated single-photon counting module (SPC-150, Becker & Hickl) for the lifetime decay measurement. The repetition rate of the laser was set at 1 MHz. The excitation laser power was adjusted to produce a fluence of approximately 2 × 10¹⁴ photons per pulse per cm. The instrument response function of the setup was approximately 130 ps. Fluorescence lifetime images were recorded by scanning the excitation laser over the sample using a piezo scanner. FLIM data were analyzed using OriginPro (OriginLab Corporation) and FLIMfit (www.flimfit.org).

X-ray crystallography of SP1 and SP2

Crystals of SP1 and SP2 were grown using protein purified as described above. Protein samples dispensed in 1-μl drops at purification concentrations were mixed with an equal volume of a crystallization solution and set in hanging drops (refer to Supplementary Table 4 for conditions). Vapor-phase equilibration of the resulting drops against a 1-ml reservoir of the same crystallization solution resulted in the growth of crystals. The crystals were flash-cooled in liquid nitrogen. Diffraction data were collected on a Pilatus area detector at the ALS synchrotron facility at beamline 5.0.2 for SP1−ZnPPaM and SP2−ZnPPaM protein assemblies. Diffraction data for SP2 were collected using a Rigaku HyPix-6000HE hybrid photon counting detector at the Fred Hutchinson Cancer Center. The resulting datasets (Supplementary Table 4) extend to 2.0 Å, 2.4 Å and 2.5 Å resolution for SP1−ZnPPaM, apo-state SP2 and SP2−ZnPPaM, respectively. The asymmetric units of the SP1−ZnPPaM and apo-state SP2 structures each contained one complete dimer (two copies of a protein subunit) and the SP2−ZnPPaM structure had two dimers in the asymmetric unit.

Data were processed using HKL2000 (ref. ⁹²) or Aimless⁹³. Subunits were placed by using the molecular replacement algorithm in PHENIX⁹⁴. Local rebuilding of all constructs was performed using Coot⁹⁵, followed by refinement in PHENIX⁹⁴. For the ZnPPaM-bound structures, the protein was built and refined completely with water molecules (excluding water molecules from the binding site) and other chemicals before manually fitting ZnPPaM into the remaining density. ZnPPaM energies were calculated using eLBOW⁹⁶. The final values for R_work/R_free are notated in Supplementary Table 4.

X-ray crystallography of SP3x

All crystallization experiments for the SP3x protein were conducted using the sitting drop vapor diffusion method. Crystallization trials were set up in 200-nl drops using the 96-well plate format at 20 °C. Crystallization plates were set up using a mosquito crystal from SPT Labtech and then imaged using UVEX microscopes and UVEX PS-600 from JAN Scientific. Diffraction-quality SP3x crystals formed in 2.4 M sodium malonate dibasic monohydrate pH 7.0.

Diffraction data were collected at the ALS at beamline 5.0.1. X-ray intensities and data reduction were evaluated and integrated using XDS⁹⁷ and merged/scaled using Pointless/Aimless in the CCP4 suite⁹⁸. Structure determination and refinement starting phases were obtained by molecular replacement using Phaser⁹⁹ and the designed model for the structures. Following molecular replacement, the models were improved using phenix.autobuild⁹⁴; efforts were made to reduce model bias by setting rebuild-in-place to false and using simulated annealing and prime-and-switch phasing. The structures were refined in Phenix⁹⁴. Model building was performed using Coot¹⁰⁰. The final model was evaluated using MolProbity¹⁰¹. Data collection and refinement statistics are recorded in Supplementary Table 4. Data deposition, atomic coordinates and structural factors reported for the SP3x protein in this paper have been deposited in the PDB, http://www.rcsb.org/, with accession code 8EVM.

Protein structure alignment

Protein crystal structures were compared to Rosetta design models by aligning corresponding backbone Cα atoms and calculating r.m.s.d. using TM-align¹⁰². BChl or Chl special pair geometries were compared using the align function in the PyMOL Molecular Graphics System (v2.5.2; Schrödinger). To facilitate comparison of the geometries of special pairs composed of different BChl or Chl types, any unimportant conformational differences, such as the rotameric states of peripheral substituents, were omitted and differences in the Mg(II) versus Zn(II) positions were neglected, so that only the atoms of the tetrapyrrole rings were considered in pairwise special pair structural alignments. These atoms included the 4 pyrrole nitrogen atoms, 16 pyrrole carbon atoms and 4 methine bridge carbon atoms from each BChl or Chl monomer, giving 48 atoms per dimer that were used for structural comparisons. Corresponding atoms were aligned in PyMOL and the r.m.s.d. over all 48 atom pairs was calculated. Native BChl a special pairs used for comparison with the SP1 protein came from five different species of purple photosynthetic bacteria, including Rhodobacter sphaeroides, Rhodopseudomonas palustris, Thermochromatium tepidum, Gemmatimonas phototrophica and Thiorhodovibrio strain 970. The PDB IDs of the nine X-ray crystal and cryo-EM structures containing the native special pairs used for comparison with SP1 were as follows: 7PIL, 7VNY, 6Z27, 6Z02, 6Z5S, 3WMM, 5Y5S, 7O0U and 7C9R (refs. ^{44,45,46,47,48,49,50,51}).

Nanocage design

The Chl-binding dimer SP2 was docked against a library of trimeric cyclic oligomer scaffolds (C₃) from previous de novo designs^34,62,103 to form octahedral cages (O₃₂) using RPXDock software¹⁰⁴. The RPXDock package uses a hierarchical sampling strategy to search for interfaces with high shape complementarity based on residue pair transform scoring. The ten docking configurations with the best score for each scaffold were subsequently sequence designed by symmetric RosettaDesign calculations, using a previously reported protocol⁶¹ to carry out two-component protein−protein interface design. Briefly, we aimed to design low-energy, well-packed hydrophobic protein−protein interfaces where protein building blocks are treated as rigid backbones and only side chain rotamers of interface residues are packed with layer design restrictions. beta_nov16 or a clash-fixed score function was used during the design. Finally, all cage designs were filtered on the basis of shape complementarity (>0.6), interface surface area (solvent-accessible surface area, 1,000 < SASA < 1,600), predicted binding energy (ddG <−20 kcal mol⁻¹), buried unsatisfied hydrogen bonds (uhb <3) and clash check (< 3). All Rosetta scripts used are available upon request.

Transmission negative-stain EM and image processing

SEC-purified cage fractions were diluted to about 0.5 µM (monomeric component concentration) for negative-stain EM characterization. Briefly, on a glow-discharged formvar/carbon supported 400-mesh copper grid (Ted Pella), 6 μl of protein sample was drop-casted for 2 min. The grid was blotted and stained with 3 μl of 2% uranyl formate, blotted again and then stained with 3 μl of uranyl formate for 20 s before final blotting. Micrographs of stained samples were acquired using a 120-kV Talos L120C transmission electron microscope. All negative-stain EM datasets were collected using EPU software and processed using cryoSPARC¹⁰⁵ with contrast transfer function (CTF) correction. All the particle picks were 2D classified for 20 iterations into 50 classes. Particles from selected classes were used to build the ab initio initial model. The initial model was homogeneously refined using C₁ and the corresponding O symmetry.

Cryo-EM grid preparation and data collection

Grids (QUANTIFOIL R 2/2 on Cu 300 mesh grids + 2 nm C) were vitrified using a Vitrobot Mark IV with a chamber maintained at 22 °C and 100% humidity. Grids were plunge-frozen into liquid ethane directly following application for 5 s of 3.5 μl of the ZnPPaM-loaded nanocage to the glow-discharged surface of the grid. Grids were screened at the New York University cryo-EM core facility using a Talos Arctica microscope operated at 200 kV equipped with a Gatan K3 camera. Data were then acquired using a Titan Krios microscope operated at 300 kV equipped with a Gatan K3 camera with a BioQuantum imaging filter (‘Krios 2’; New York Structural Biology Center). Data were acquired from duplicate grids using Leginon¹⁰⁶ and preprocessed (2⨯ binned and motion-corrected with MotionCor2 (ref. ¹⁰⁷)) within Appion¹⁰⁸. Full data collection parameters are shown in Supplementary Table 5.

Cryo-EM data processing and model building

Aligned and dose-weighted micrographs were imported to cryoSPARC v.3 (ref. ¹⁰⁵) and processed using the workflow shown in Supplementary Fig. 18. During data collection, we noted a high proportion of damaged (compressed or fragmented) nanocage particles in areas of ice with a reported thickness of less than 40 nm. Curation of micrographs to exclude those with the thinnest ice and with CTF fit resolution lower than ~6 Å facilitated selection of intact nanocage particles. 2D classification was performed on manually selected particles to generate templates representing diverse views of the nanocage, but subsequent template-based picking tended to exclude rare particle views. Recovery of these rare views was improved by using a single template for picking representing the view most often missed in prior template-based picking efforts (Supplementary Table 6). Compared with picking using multiple templates, using a single, rare-view template improved the recovery of diverse particle views, which were then used as a training set for Topaz¹⁰⁹. Picking with Topaz yielded diverse, well-centered nanocage particles. Data from each of the two imaged grids were picked separately using Topaz and the curated particles were then combined and further curated in 2D. This larger set of curated particles was used to retrain Topaz (204,039 versus 19,355 in the initial Topaz training set) on the full set of micrographs from both grids. Particles picked using this Topaz model were then curated by 2D classification, micrograph curation by ice thickness and CTF fit values and removal of duplicates.

A 200-micrograph subset from a single grid was used to generate an ab initio 3D reconstruction. Following iterative rounds of homogeneous and heterogeneous refinement, this map served as the initial 3D model for processing the full particle set from both grids. 3D refinement and classification yielded a map of the full nanocage with an average reported resolution of ~6.5 Å (as calculated in cryoSPARC using a gold-standard FSC cutoff of 0.143). O symmetry was imposed during the final round of refinement. Continuous conformational heterogeneity likely limited the resolution of the full nanocage map because discrete states were not readily separable by further 3D classification. Multiple modes of flexibility were visualized using cryoSPARC’s 3D variability analysis¹¹⁰, supporting the notion that the nanocage particles used in refinement were subject to compression/deformation (see Supplementary Information for movies of protein breathing motions). We then used partial signal subtraction and focused refinement to improve the resolution in the ligand-binding region of the cage (region enclosed in yellow mask, Supplementary Fig. 18 inset). Before partial signal subtraction, particles were expanded with T symmetry (the highest-order symmetry containing a complete Chl-binding dimer). The symmetry-expanded, partially subtracted particle set was then refined in C₁ using local refinement in cryoSPARC.

The cryo-EM map of the full nanocage was used for real-space refinement of a model in Phenix¹¹¹. Due to the intermediate map resolution, all residues were modeled as alanine and restraints (secondary structure, Ramachandran and noncrystallographic symmetry constraints) were imposed during refinement. The designed nanocage model was used as a starting point for refinement and individual chains were docked into cryo-EM maps using Chimera⁸³ before hydrogen removal and truncation to polyalanine using phenix.pdbtools. Stubbed, docked models were then subjected to restrained real-space refinement in Phenix. We observed a notable difference between the design model and the cryo-EM density in the angle between each trimeric interface helix and its attached DHR ‘arm’. To generate a starting model for restrained refinement of the full nanocage, we first performed rigid-body refinement, with each trimer subunit modeled as two rigid bodies (corresponding to the interface helix and ‘arm’ regions; residues 259–337 and 1–258, respectively). Cryo-EM model statistics are listed in Supplementary Table 7.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

X-ray crystallographic coordinates and data files of designed special pair dimer proteins were deposited at the PDB with accession codes 7UNJ (SP1 with ZnPPaM bound), 7UNH (SP2, apo state), 7UNI (SP2 with ZnPPaM bound) and 8EVM (SP3x, apo-state). All previously published high-resolution structures referenced in this manuscript, including the Blastochloris viridis LH1−RC complex (PDB 6ET5), are available at the PDB. An electron microscopy map of the full ZnPPaM-binding nanocage (map 1) was deposited in the EMDB with accession code EMD-40208 and a backbone model was deposited in the PDB with accession code 8GLT. An electron microscopy map of the ZnPPaM-binding region of the nanocage (map 2) was deposited in the EMDB with the accession code EMD-40209. Computational data related to molecular dynamics simulations and CD calculations have been deposited in the ioChem-BD database¹¹² and are accessible at https://doi.org/10.19061/iochem-bd-6-268. Source data are provided with this paper.

Code availability

The Rosetta macromolecular modeling suite (https://www.rosettacommons.org) is freely available to academic and noncommercial users. Commercial licenses for the suite are available from the University of Washington Technology Transfer Office. A Python package and example scripts for Chl docking can be found at: https://github.com/atommoyer/stapler. RPXDock scripts and computational methods are available on GitHub at https://github.com/willsheffler/rpxdock.

References

Romero, E., Novoderezhkin, V. I. & van Grondelle, R. Quantum design of photosynthesis for bio-inspired solar-energy conversion. Nature 543, 355–365 (2017).
CAS PubMed Google Scholar
Croce, R. & van Amerongen, H. Natural strategies for photosynthetic light harvesting. Nat. Chem. Biol. 10, 492–501 (2014).
CAS PubMed Google Scholar
Mirkovic, T. et al. Light absorption and energy transfer in the antenna complexes of photosynthetic organisms. Chem. Rev. 117, 249–293 (2017).
CAS PubMed Google Scholar
Şener, M. et al. Förster energy transfer theory as reflected in the structures of photosynthetic light-harvesting systems. Chemphyschem 12, 518–531 (2011).
PubMed PubMed Central Google Scholar
Gorka, M. et al. Shedding light on primary donors in photosynthetic reaction centers. Front. Microbiol. 12, 735666 (2021).
PubMed PubMed Central Google Scholar
Swainsbury, D. J. K., Qian, P., Hitchcock, A. & Hunter, C. N. The structure and assembly of reaction centre-light-harvesting 1 complexes in photosynthetic bacteria. Biosci. Rep. 43, BSR20220089 (2023).
CAS PubMed PubMed Central Google Scholar
Sener, M. K. et al. Robustness and optimality of light harvesting in cyanobacterial photosystem I. J. Phys. Chem. B 106, 7948–7960 (2002).
CAS Google Scholar
Wraight, C. A. & Clayton, R. K. The absolute quantum efficiency of bacteriochlorophyll photooxidation in reaction centres of Rhodopseudomonas spheroides.Biochim. Biophys. Acta 333, 246–260 (1974).
CAS PubMed Google Scholar
Ferretti, M. et al. The nature of coherences in the B820 bacteriochlorophyll dimer revealed by two-dimensional electronic spectroscopy. Phys. Chem. Chem. Phys. 16, 9930–9939 (2014).
CAS PubMed Google Scholar
Bednarczyk, D. et al. Fine tuning of chlorophyll spectra by protein-induced ring deformation. Angew Chem. Int. Ed. Engl. 55, 6901–6905 (2016).
CAS PubMed PubMed Central Google Scholar
Pieper, J. et al. Excitonic energy level structure and pigment−protein interactions in the recombinant water-soluble chlorophyll protein. II. Spectral hole-burning experiments. J. Phys. Chem. B 115, 4053–4065 (2011).
CAS PubMed Google Scholar
Srivastava, A., Ahad, S., Wat, J. H. & Reppert, M. Accurate prediction of mutation-induced frequency shifts in chlorophyll proteins with a simple electrostatic model. J. Chem. Phys. 155, 151102 (2021).
CAS PubMed Google Scholar
Sharma, V. K. et al. Dimeric corrole analogs of chlorophyll special pairs. J. Am. Chem. Soc. 143, 9450–9460 (2021).
CAS PubMed PubMed Central Google Scholar
Kobuke, Y. & Miyaji, H. Supramolecular organization of imidazolyl-porphyrin to a slipped cofacial dimer. J. Am. Chem. Soc. 116, 4111–4112 (1994).
CAS Google Scholar
Wasielewski, M. R., Studier, M. H. & Katz, J. J. Covalently linked chlorophyll α dimer: a biomimetic model of special pair chlorophyll. Proc. Natl Acad. Sci. USA 73, 4282–4286 (1976).
CAS PubMed PubMed Central Google Scholar
Boxer, S. G. & Closs, G. L. A covalently bound dimeric derivative of pyrochlorophyllide α. A possible model for reaction center chlorophyll. J. Am. Chem. Soc. 98, 5406–5408 (1976).
CAS Google Scholar
McCleese, C. et al. Excitonic interactions in bacteriochlorin homo-dyads enable charge transfer: a new approach to the artificial photosynthetic special pair. J. Phys. Chem. B 122, 4131–4140 (2018).
CAS PubMed PubMed Central Google Scholar
Kodali, G. et al. Design and engineering of water-soluble light-harvesting protein maquettes. Chem. Sci. 8, 316–324 (2017).
CAS PubMed Google Scholar
Farid, T. A. et al. Elementary tetrahelical protein design for diverse oxidoreductase functions. Nat. Chem. Biol. 9, 826–833 (2013).
CAS PubMed PubMed Central Google Scholar
Ennist, N. M. et al. Maquette strategy for creation of light- and redox-active proteins. in Photosynthesis and Bioenergetics (eds. Barber, J. & Ruban, A. V.) 1–33 (World Scientific Publishers, 2017).
Ennist, N. M. et al. De novo protein design of photochemical reaction centers. Nat. Commun. 13, 4937 (2022).
Moser, C. C. et al. De novo construction of redox active proteins. Methods Enzymol. 580, 365–388 (2016).
Fry, H. C., Lehmann, A., Saven, J. G., DeGrado, W. F. & Therien, M. J. Computational design and elaboration of a de novo heterotetrameric α-helical protein that selectively binds an emissive abiological (porphinato)zinc chromophore. J. Am. Chem. Soc. 132, 3997–4005 (2010).
CAS PubMed PubMed Central Google Scholar
Pirro, F. et al. Allosteric cooperation in a de novo-designed two-domain protein. Proc. Natl Acad. Sci. USA 117, 33246–33253 (2020).
CAS PubMed PubMed Central Google Scholar
Ennist, N. M., Stayrook, S. E., Dutton, P. L. & Moser, C. C. Rational design of photosynthetic reaction center protein maquettes. Front. Mol. Biosci. 9, 997295 (2022).
CAS PubMed PubMed Central Google Scholar
Polizzi, N. F. et al. De novo design of a hyperstable non-natural protein–ligand complex with sub-Å accuracy.Nat. Chem. 9, 1157–1164 (2017).
CAS PubMed PubMed Central Google Scholar
Cohen-Ofri, I. et al. Zinc-bacteriochlorophyllide dimers in de novo designed four-helix bundle proteins. A model system for natural light energy harvesting and dissipation. J. Am. Chem. Soc. 133, 9526–9535 (2011).
CAS PubMed Google Scholar
Wahadoszamen, M., Margalit, I., Ara, A. M., van Grondelle, R. & Noy, D. The role of charge-transfer states in energy transfer and dissipation within natural and artificial bacteriochlorophyll proteins. Nat. Commun. 5, 5287 (2014).
CAS PubMed Google Scholar
Rabanal, F., DeGrado, W. F. & Dutton, P. L. Toward the synthesis of a photosynthetic reaction center maquette: a cofacial porphyrin pair assembled between two subunits of a synthetic four-helix bundle multiheme protein. J. Am. Chem. Soc. 118, 473–474 (1996).
CAS Google Scholar
Curti, M. et al. Engineering excitonically-coupled dimers in an artificial protein for light harvesting via computational modelling. Protein Sci. 32, e4579 (2023).
CAS PubMed PubMed Central Google Scholar
Gisriel, C. et al. Structure of a symmetric photosynthetic reaction center-photosystem. Science 357, 1021–1025 (2017).
CAS PubMed Google Scholar
Chen, J.-H. et al. Architecture of the photosynthetic complex from a green sulfur bacterium. Science 370, eabb6350 (2020).
CAS PubMed Google Scholar
Reppert, M. Bioexcitons by design: how do we get there? J. Phys. Chem. B 127, 1872–1879 (2023).
CAS PubMed Google Scholar
Fallas, J. A. et al. Computational design of self-assembling cyclic protein homo-oligomers. Nat. Chem. 9, 353–360 (2017).
CAS PubMed Google Scholar
Hicks, D. R. et al. De novo design of protein homodimers containing tunable symmetric protein pockets. Proc. Natl Acad. Sci. USA 119, e2113400119 (2022).
CAS PubMed PubMed Central Google Scholar
Brunette, T. J. et al. Modular repeat protein sculpting using rigid helical junctions. Proc. Natl Acad. Sci. USA 117, 8870–8875 (2020).
CAS PubMed PubMed Central Google Scholar
Maguire, J. B. et al. Perturbing the energy landscape for improved packing during computational protein design. Proteins 89, 436–449 (2021).
CAS PubMed Google Scholar
Schneidman-Duhovny, D., Hammel, M., Tainer, J. A. & Sali, A. FoXS, FoXSDock and MultiFoXS: single-state and multi-state structural modeling of proteins and their complexes based on SAXS profiles. Nucleic Acids Res. 44, W424–W429 (2016).
CAS PubMed PubMed Central Google Scholar
Schneidman-Duhovny, D., Hammel, M., Tainer, J. A. & Sali, A. Accurate SAXS profile computation and its assessment by contrast variation experiments. Biophys. J. 105, 962–974 (2013).
CAS PubMed PubMed Central Google Scholar
Svergun, D. I. et al. Protein hydration in solution: experimental observation by X-ray and neutron scattering. Proc. Natl Acad. Sci. USA 95, 2267–2272 (1998).
CAS PubMed PubMed Central Google Scholar
Kim, H. S. et al. SAXS/SANS on supercharged proteins reveals residue-specific modifications of the hydration shell. Biophys. J. 110, 2185–2194 (2016).
CAS PubMed PubMed Central Google Scholar
Lindorfer, D., Müh, F. & Renger, T. Origin of non-conservative circular dichroism of the CP29 antenna complex of photosystem II. Phys. Chem. Chem. Phys. 19, 7524–7536 (2017).
CAS PubMed Google Scholar
Lin, X. et al. Specific alteration of the oxidation potential of the electron donor in reaction centers from Rhodobacter sphaeroides. Proc. Natl Acad. Sci. USA 91, 10265–10269 (1994).
CAS PubMed PubMed Central Google Scholar
Cao, P. et al. Structural basis for the assembly and quinone transport mechanisms of the dimeric photosynthetic RC–LH1 supercomplex. Nat. Commun. 13, 1977 (2022).
CAS PubMed PubMed Central Google Scholar
Niwa, S. et al. Structure of the LH1–RC complex from Thermochromatium tepidum at 3.0 Å. Nature 508, 228–232 (2014).
CAS PubMed Google Scholar
Qian, P. et al. Cryo-EM structure of the monomeric Rhodobacter sphaeroides RC–LH1 core complex at 2.5 Å. Biochem. J. 478, 3775–3790 (2021).
CAS PubMed Google Scholar
Qian, P. et al. 2.4-Å structure of the double-ring Gemmatimonas phototrophica photosystem. Sci. Adv. 8, eabk3139 (2022).
CAS PubMed PubMed Central Google Scholar
Selikhanov, G., Fufina, T., Vasilieva, L., Betzel, C. & Gabdulkhakov, A. Novel approaches for the lipid sponge phase crystallization of the Rhodobacter sphaeroides photosynthetic reaction center. IUCrJ 7, 1084–1091 (2020).
CAS PubMed PubMed Central Google Scholar
Swainsbury, D. J. K. et al. Structures of Rhodopseudomonas palustris RC-LH1 complexes with open or closed quinone channels. Sci. Adv. 7, eabe2631 (2021).
CAS PubMed PubMed Central Google Scholar
Tani, K. et al. Cryo-EM structure of a Ca²⁺-bound photosynthetic LH1-RC complex containing multiple αβ-polypeptides. Nat. Commun. 11, 4955 (2020).
CAS PubMed PubMed Central Google Scholar
Yu, L.-J., Suga, M., Wang-Otomo, Z.-Y. & Shen, J.-R. Structure of photosynthetic LH1–RC supercomplex at 1.9 Å resolution. Nature 556, 209–213 (2018).
CAS PubMed Google Scholar
Taylor, N. & Kassal, I. Why are photosynthetic reaction centres dimeric? Chem. Sci. 10, 9576–9585 (2019).
CAS PubMed PubMed Central Google Scholar
van Amerongen, H., Valkunas, L. & van Grondelle, R. Photosynthetic Excitons (World Scientific, 2000).
Ahad, S., Lin, C. & Reppert, M. Photosynthetic Protein Spectroscopy Lab. nanoHUB https://doi.org/10.21981/0NQJ-NZ11 (2023).
Huang, X., Vasilev, C. & Hunter, C. N. Excitation energy transfer between monomolecular layers of light harvesting LH2 and LH1-reaction centre complexes printed on a glass substrate. Lab Chip 20, 2529–2538 (2020).
CAS PubMed Google Scholar
Barnett, S. F. H. et al. Repurposing a photosynthetic antenna protein as a super-resolution microscopy label. Sci. Rep. 7, 16807 (2017).
PubMed PubMed Central Google Scholar
Duncan, R. R., Bergmann, A., Cousin, M. A., Apps, D. K. & Shipston, M. J. Multi-dimensional time-correlated single photon counting (TCSPC) fluorescence lifetime imaging microscopy (FLIM) to detect FRET in cells. J. Microsc. 215, 1–12 (2004).
CAS PubMed PubMed Central Google Scholar
Becker, W. et al. Fluorescence lifetime imaging by time-correlated single-photon counting. Microsc. Res. Tech. 63, 58–66 (2004).
CAS PubMed Google Scholar
Tramier, M. et al. Picosecond-hetero-FRET microscopy to probe protein-protein interactions in live cells. Biophys. J. 83, 3570–3577 (2002).
CAS PubMed PubMed Central Google Scholar
Singharoy, A. et al. Atoms to phenotypes: molecular design principles of cellular energy metabolism. Cell 179, 1098–1111 (2019).
CAS PubMed PubMed Central Google Scholar
King, N. P. et al. Accurate design of co-assembling multi-component protein nanomaterials. Nature 510, 103–108 (2014).
CAS PubMed PubMed Central Google Scholar
Boyken, S. E. et al. De novo design of protein homo-oligomers with modular hydrogen-bond network-mediated specificity. Science 352, 680–687 (2016).
CAS PubMed PubMed Central Google Scholar
Knox, R. S. & Spring, B. Q. Dipole strengths in the chlorophylls. Photochem. Photobiol. 77, 497–501 (2003).
CAS PubMed Google Scholar
Blankenship, R. E. et al. Comparing photosynthetic and photovoltaic efficiencies and recognizing the potential for improvement. Science 332, 805–809 (2011).
CAS PubMed Google Scholar
Barber, J. & Tran, P. D. From natural to artificial photosynthesis. J. R. Soc. Interface 10, 20120984 (2013).
PubMed PubMed Central Google Scholar
Hitchcock, A. et al. Redesigning the photosynthetic light reactions to enhance photosynthesis—the PhotoRedesign consortium. Plant J. 109, 23–34 (2022).
CAS PubMed Google Scholar
Qian, P., Siebert, C. A., Wang, P., Canniffe, D. P. & Hunter, C. N. Cryo-EM structure of the Blastochloris viridis LH1–RC complex at 2.9 Å. Nature 556, 203–208 (2018).
CAS PubMed Google Scholar
Yao, S. et al. De novo design and directed folding of disulfide-bridged peptide heterodimers. Nat. Commun. 13, 1539 (2022).
CAS PubMed PubMed Central Google Scholar
Studier, F. W. Protein production by auto-induction in high density shaking cultures. Protein Expr. Purif. 41, 207–234 (2005).
CAS PubMed Google Scholar
Jones, I. D., White, R. C., Gibbs, E. & Butler, L. S. Estimation of zinc pheophytins, chlorophylls, and pheophytins in mixtures in diethyl ether or 80% acetone by spectrophotometry and fluorometry. J. Agric. Food Chem. 25, 146–149 (1976).
CAS PubMed Google Scholar
Dyer, K. N. et al. High-throughput SAXS for the characterization of biomolecules in solution: a practical approach. Methods Mol. Biol. 1091, 245–258 (2014).
CAS PubMed PubMed Central Google Scholar
Weber, G. & Teale, F. W. J. Determination of the absolute quantum yield of fluorescent solutions. Trans. Faraday Soc. 53, 646–655 (1957).
CAS Google Scholar
Caram, J. R. et al. Room-temperature micron-scale exciton migration in a stabilized emissive molecular aggregate. Nano Lett. 16, 6808–6815 (2016).
CAS PubMed Google Scholar
Case, D. A. et al. AMBER 2018 http://ambermd.org/ (University of California, San Francisco, 2018).
Maier, J. A. et al. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 11, 3696–3713 (2015).
CAS PubMed PubMed Central Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
CAS Google Scholar
Li, P. & Merz, K. M. Jr. MCPB.py: a python based metal center parameter builder. J. Chem. Inf. Model. 56, 599–604 (2016).
CAS PubMed Google Scholar
Frisch, M. J. et al. Gaussian 16 https://gaussian.com/gaussian16/ (Gaussian Inc., 2016).
Myers, J., Grothaus, G., Narayanan, S. & Onufriev, A. A simple clustering algorithm can be accurate enough for use in calculations of pKs in macromolecules. Proteins 63, 928–938 (2006).
CAS PubMed Google Scholar
Anandakrishnan, R., Aguilar, B. & Onufriev, A. V. H++ 3.0: automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulations. Nucleic Acids Res. 40, W537–W541 (2012).
CAS PubMed PubMed Central Google Scholar
Gordon, J. C. et al. H++: a server for estimating pK_as and adding missing hydrogens to macromolecules. Nucleic Acids Res. 33, W368–W371 (2005).
CAS PubMed PubMed Central Google Scholar
Roe, D. R. & Brooks, B. R. A protocol for preparing explicitly solvated systems for stable molecular dynamics simulations. J. Chem. Phys. 153, 054123 (2020).
CAS PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
CAS PubMed Google Scholar
Tomasi, J., Mennucci, B. & Cammi, R. Quantum mechanical continuum solvation models. Chem. Rev. 105, 2999–3093 (2005).
CAS PubMed Google Scholar
Iozzi, M. F., Mennucci, B., Tomasi, J. & Cammi, R. Excitation energy transfer (EET) between molecules in condensed matter: a novel application of the polarizable continuum model (PCM). J. Chem. Phys. 120, 7029–7040 (2004).
CAS PubMed Google Scholar
Jurinovich, S., Guido, C. A., Bruhn, T., Pescitelli, G. & Mennucci, B. The role of magnetic–electric coupling in exciton-coupled ECD spectra: the case of bis-phenanthrenes. Chem. Commun. 51, 10498–10501 (2015).
CAS Google Scholar
Jurinovich, S., Cupellini, L., Guido, C. A. & Mennucci, B. EXAT: EXcitonic analysis tool. J. Comput. Chem. 39, 279–286 (2018).
CAS PubMed Google Scholar
Jurinovich, S., Pescitelli, G., Di Bari, L. & Mennucci, B. A TDDFT/MMPol/PCM model for the simulation of exciton-coupled circular dichroism spectra. Phys. Chem. Chem. Phys. 16, 16407–16418 (2014).
CAS PubMed Google Scholar
Renger, T. & Marcus, R. A. On the relation of protein dynamics and exciton relaxation in pigment–protein complexes: an estimation of the spectral density and a theory for the calculation of optical spectra. J. Chem. Phys. 116, 9997–10019 (2002).
CAS Google Scholar
Rätsep, M., Pieper, J., Irrgang, K.-D. & Freiberg, A. Excitation wavelength-dependent electron−phonon and electron−vibrational coupling in the CP29 antenna complex of green plants. J. Phys. Chem. B 112, 110–118 (2008).
PubMed Google Scholar
Madjet, M. E., Abdurahman, A. & Renger, T. Intermolecular coulomb couplings from ab initio electrostatic potentials: application to optical transitions of strongly coupled pigments in photosynthetic antennae and reaction centers. J. Phys. Chem. B 110, 17268–17281 (2006).
CAS PubMed Google Scholar
Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276, 307–326 (1997).
CAS PubMed Google Scholar
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr. D Biol. Crystallogr. 69, 1204–1214 (2013).
CAS PubMed PubMed Central Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
CAS PubMed PubMed Central Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 (2010).
CAS PubMed PubMed Central Google Scholar
Moriarty, N. W., Grosse-Kunstleve, R. W. & Adams, P. D. electronic Ligand Builder and Optimization Workbench (eLBOW): a tool for ligand coordinate and restraint generation. Acta Crystallogr. D Biol. Crystallogr. 65, 1074–1080 (2009).
CAS PubMed PubMed Central Google Scholar
Kabsch, W. XDS. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 (2010).
CAS PubMed PubMed Central Google Scholar
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 (2011).
CAS PubMed PubMed Central Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
CAS PubMed PubMed Central Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
PubMed Google Scholar
Williams, C. J. et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
CAS PubMed Google Scholar
Zhang, Y. & Skolnick, J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 33, 2302–2309 (2005).
CAS PubMed PubMed Central Google Scholar
Hsia, Y. et al. Design of multi-scale protein complexes by hierarchical building block fusion. Nat. Commun. 12, 2294 (2021).
CAS PubMed PubMed Central Google Scholar
Sheffler, W. et al. Fast and versatile sequence-independent protein docking for nanomaterials design using RPXDock. PLOS Comput. Biol. 19, e1010680 (2023).
CAS PubMed PubMed Central Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
CAS PubMed Google Scholar
Suloway, C. et al. Automated molecular microscopy: the new Leginon system. J. Struct. Biol. 151, 41–60 (2005).
CAS PubMed Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
CAS PubMed PubMed Central Google Scholar
Lander, G. C. et al. Appion: an integrated, database-driven pipeline to facilitate EM image processing. J. Struct. Biol. 166, 95–102 (2009).
CAS PubMed PubMed Central Google Scholar
Bepler, T. et al. Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs. Nat. Methods 16, 1153–1160 (2019).
CAS PubMed PubMed Central Google Scholar
Punjani, A. & Fleet, D. J. 3D variability analysis: resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. J. Struct. Biol. 213, 107702 (2021).
CAS PubMed Google Scholar
Echols, N. et al. Graphical tools for macromolecular crystallography in PHENIX. J. Appl. Crystallogr. 45, 581–586 (2012).
CAS PubMed PubMed Central Google Scholar
Álvarez-Moreno, M. et al. Managing the computational chemistry big data problem: the ioChem-BD platform. J. Chem. Inf. Model. 55, 95–103 (2015).
PubMed Google Scholar

Download references

Acknowledgements

This work was supported by The Audacious Project at the Institute for Protein Design (N.N., A.K.B., A.K., L.S., D.B.), the National Institute on Aging grant 5U19AG065156-02 (D.R.H., D.B.), the Open Philanthropy Project Improving Protein Design Fund (R.L.R., D.E., G.B., B.L.S., D.B), a donation from Amgen to the Institute for Protein Design (S.W.), Department of Energy (DOE) ARPA-E Grant #2459-1671 (N.M.E., A.S., D.B.), the Washington Research Foundation (R.A.C.), the US National Science Foundation DGE1762114 (M.A.K.), the National Institute of General Medical Sciences (R01 GM105691, R01 GM139752 and R35 GM148166; B.L.S.), the Howard Hughes Medical Institute (D.B.) and the US DOE Office of Science (DE-SC0018940 MOD03). Additional support was provided by the Fred Hutchinson Cancer Center and the National Institute of Health (NIH) shared instrumentation grant S10OD028581-01. We thank X. Li, M. Lamb and R. K. Ballard for lending their expertise in mass spectrometry. We acknowledge support from Purdue University (M.R., A.P.), the US DOE Office of Science, Basic Energy Sciences, under award no. DE-SC0022884 (M.R., A.P.) and National Science Foundation Career Award no. 1945572 (A.P.D., A.V.S., A.S.H., J.R.C.). G.A.S. and C.N.H. are supported by the European Research Council Synergy Award 854126. C.V. and M.P.J. acknowledge funding (grant BB/V006630/1) from the Biotechnology and Biological Sciences Research Council UK. SAXS data were collected at the ALS, a national user facility operated by the Lawrence Berkeley National Laboratory on behalf of the DOE Office of Basic Energy Sciences, through the Integrated Diffraction Analysis Technologies program, supported by the DOE Office of Biological and Environmental Research. Additional support comes from NIH project ALS-ENABLE (P30 GM124169) and High-End Instrumentation Grant S10OD018483. The ALS-ENABLE beamlines are supported in part by the NIH, National Institute of General Medical Sciences, grant P30 GM124169-01. The ALS is a US DOE Office of Science user facility under contract no. DE-AC02-05CH11231. The authors acknowledge funding from the European Union’s Horizon 2020 program Marie Sklodowska-Curie grant agreements no. 801474 (M.C.) and no. 754510 (V.M.); from the State Research Agency/Spanish Ministry of Science and Innovation (AEI/MICINN/10.13039/501100011033) through the Severo Ochoa Excellence Accreditation CEX2019-000925-S (M.C. and E.R.); from the European Research Council under starting grant agreement no. 805524 (BioInspired_SolarH2, S.S., M.C. and E.R.); the computer resources at Pirineus and CTE-Power9 and the technical support provided by the Consorci de Serveis Universitaris de Catalunya and the Barcelona Supercomputing Center (RES grant QH-2021-2-0017). We thank W. Rice, A. Paquette and B. Wang of NYU Langone Health’s Cryo–Electron Microscopy Laboratory (RRID: SCR_019202) for assistance with cryo-EM grid screening and the NYU High-Performance Computing team for computing resource management and troubleshooting. We are grateful to all Bhabha/Ekiert laboratory members, especially N. Coudray, for their support and discussions related to cryo-EM processing and interpretation. We thank E. Eng and E. Chua of the Simons Electron Microscopy Center (SEMC), located at the New York Structural Biology Center (NYSBC), for Krios imaging scheduling and data collection. Some of this work was performed at the SEMC and National Resource for Automated Molecular Microscopy located at the NYSBC, supported by grants from the Simons Foundation (SF349247), NYSTAR and the NIH National Institute of General Medical Studies (GM103310), with additional support from the Agouron Institute (F00316), NIH (OD019994) and NIH (RR029300).

Author information

Authors and Affiliations

Institute for Protein Design, University of Washington, Seattle, WA, USA
Nathan M. Ennist, Shunzhi Wang, Adam P. Moyer, Derrick R. Hicks, Avi Z. Swartz, Ralph A. Cacho, Nathan Novy, Asim K. Bera, Alex Kang, Lance Stewart & David Baker
Department of Biochemistry, University of Washington, Seattle, WA, USA
Nathan M. Ennist, Shunzhi Wang, Madison A. Kennedy, Adam P. Moyer, Derrick R. Hicks, Avi Z. Swartz, Ralph A. Cacho, Nathan Novy, Asim K. Bera, Alex Kang, Lance Stewart & David Baker
Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
Madison A. Kennedy & Barry L. Stoddard
Institute of Chemical Research of Catalonia (ICIQ-CERCA), Barcelona Institute of Science and Technology (BIST), Tarragona, Spain
Mariano Curti, Valentin Maffeis, Saeed Shareef & Elisabet Romero
School of Biosciences, University of Sheffield, Sheffield, UK
George A. Sutherland, Cvetelin Vasilev, Matthew P. Johnson & C. Neil Hunter
Department of Cell Biology and Skirball Institute of Biomolecular Medicine, New York University School of Medicine, New York, NY, USA
Rachel L. Redler, Damian Ekiert & Gira Bhabha
Departament de Química Física i Inorgànica, Universitat Rovira i Virgili, Tarragona, Spain
Saeed Shareef
Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, USA
Anthony V. Sica, Ash Sueh Hua, Arundhati P. Deshmukh & Justin R. Caram
Molecular Biophysics and Integrated Bioimaging, Berkeley Center for Structural Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Banumathi Sankaran
Department of Chemistry, Purdue University, West Lafayette, IN, USA
Amala Phadkule & Mike Reppert
Department of Microbiology, New York University School of Medicine, New York, NY, USA
Damian Ekiert
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
David Baker

Authors

Nathan M. Ennist
View author publications
You can also search for this author in PubMed Google Scholar
Shunzhi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Madison A. Kennedy
View author publications
You can also search for this author in PubMed Google Scholar
Mariano Curti
View author publications
You can also search for this author in PubMed Google Scholar
George A. Sutherland
View author publications
You can also search for this author in PubMed Google Scholar
Cvetelin Vasilev
View author publications
You can also search for this author in PubMed Google Scholar
Rachel L. Redler
View author publications
You can also search for this author in PubMed Google Scholar
Valentin Maffeis
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Shareef
View author publications
You can also search for this author in PubMed Google Scholar
Anthony V. Sica
View author publications
You can also search for this author in PubMed Google Scholar
Ash Sueh Hua
View author publications
You can also search for this author in PubMed Google Scholar
Arundhati P. Deshmukh
View author publications
You can also search for this author in PubMed Google Scholar
Adam P. Moyer
View author publications
You can also search for this author in PubMed Google Scholar
Derrick R. Hicks
View author publications
You can also search for this author in PubMed Google Scholar
Avi Z. Swartz
View author publications
You can also search for this author in PubMed Google Scholar
Ralph A. Cacho
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Novy
View author publications
You can also search for this author in PubMed Google Scholar
Asim K. Bera
View author publications
You can also search for this author in PubMed Google Scholar
Alex Kang
View author publications
You can also search for this author in PubMed Google Scholar
Banumathi Sankaran
View author publications
You can also search for this author in PubMed Google Scholar
Matthew P. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Amala Phadkule
View author publications
You can also search for this author in PubMed Google Scholar
Mike Reppert
View author publications
You can also search for this author in PubMed Google Scholar
Damian Ekiert
View author publications
You can also search for this author in PubMed Google Scholar
Gira Bhabha
View author publications
You can also search for this author in PubMed Google Scholar
Lance Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Justin R. Caram
View author publications
You can also search for this author in PubMed Google Scholar
Barry L. Stoddard
View author publications
You can also search for this author in PubMed Google Scholar
Elisabet Romero
View author publications
You can also search for this author in PubMed Google Scholar
C. Neil Hunter
View author publications
You can also search for this author in PubMed Google Scholar
David Baker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.M.E. and D.B. conceived the project, planned the research and designed experiments. N.M.E., S.W., A.P.M., D.R.H., R.A.C. and N.N. designed proteins. N.M.E., S.W., D.R.H., A.Z.S., R.A.C. and N.N. experimentally screened designed proteins. M.A.K., A.K.B., A.K., B.S. and B.L.S. performed X-ray crystallographic experiments. M.C. conducted molecular dynamics simulations and calculations of CD spectra. G.A.S., C.V., M.P.J. and C.N.H. used contact printing and FLIM to investigate energy transfer. R.L.R., D.E. and G.B. carried out cryo-EM data collection and processing. N.M.E., M.C., V.M., S.S. and E.R. participated in spectroscopic ZnPPaM binding assays and data analysis. A.V.S., A.S.H., A.P.D., A.P., M.R. and J.R.C. measured low-temperature spectra and analyzed data. N.M.E., M.C., M.R. and E.R. analyzed and discussed data related to excitonic coupling. L.S. and D.B. contributed to conceptualization and administration of computational protein design. N.M.E. wrote the initial manuscript, and all authors discussed the results and contributed to the final manuscript.

Corresponding authors

Correspondence to Nathan M. Ennist or David Baker.

Ethics declarations

Competing interests

D.B. and D.R.H. are the authors of the patent application “De novo designed protein homodimers containing tunable symmetric pockets”, application number 17/938,752, filing date 7 October, 2022, submitted by the University of Washington for the design of homodimeric proteins that were used as initial scaffolds for the design of Chl-binding proteins in this study.

Peer review

Peer review information

Nature Chemical Biology thanks Vikas Nanda and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs, 1–22 and Tables 1–7.

Reporting Summary

Supplementary Video 1

Movie showing cryo-EM data of nanocage breathing motions #1.

Supplementary Video 2

Movie showing cryo-EM data of nanocage breathing motions #2.

Source data

Source Data Fig. 2

Raw SAXS data for SP1, SP2 and SP3 apo proteins.

Source Data Fig. 2

CD and UV/vis spectra and thermal melts monitored by CD of SP1, SP2 and SP3 apo- and holo-state proteins.

Source Data Fig. 4

Low-temperature absorbance and fluorescence spectra of SP2−ZnPPaM.

Source Data Fig. 5

Absorbance and fluorescence spectra of CpcA−PEB and SP2−ZnPPa.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ennist, N.M., Wang, S., Kennedy, M.A. et al. De novo design of proteins housing excitonically coupled chlorophyll special pairs. Nat Chem Biol 20, 906–915 (2024). https://doi.org/10.1038/s41589-024-01626-0

Download citation

Received: 25 March 2023
Accepted: 15 April 2024
Published: 03 June 2024
Issue Date: July 2024
DOI: https://doi.org/10.1038/s41589-024-01626-0
Springer Nature America, Inc.

De novo design of proteins housing excitonically coupled chlorophyll special pairs

Abstract

Similar content being viewed by others

Main

Results

De novo protein design strategy

Special pair protein stability and pigment binding

High-resolution X-ray crystal structures

Excitonic coupling between Chl molecules

Special pair proteins as energy transfer acceptors

Chl-binding supercomplex

Discussion

Methods

Computational placement of the Chl special pair into symmetric protein scaffolds

Protein expression and purification

Protein−Chl sample preparation

CD spectroscopy

SAXS

Fluorescence quantum yield measurements

Low-temperature absorbance and fluorescence spectroscopy in a sucrose/trehalose film

Molecular dynamics simulations

Calculation of Chl dimer excitonic coupling and spectra

FLIM

X-ray crystallography of SP1 and SP2

X-ray crystallography of SP3x

Protein structure alignment

Nanocage design

Transmission negative-stain EM and image processing

Cryo-EM grid preparation and data collection

Cryo-EM data processing and model building

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation