Matthew Hj Cordes
Publications
PMID: 11101895;Abstract:
Arc repressor bearing the N11L substitution (Arc-N11L) is an evolutionary intermediate between the wild type protein, in which the region surrounding position 11 forms a β-sheet, and a double mutant 'switch Arc', in which this region is helical. Here, Arc-N11L is shown to be able to adopt either the wild type or mutant conformations. Exchange between these structures occurs on the millisecond time scale in a dynamic equilibrium in which the relative populations of each fold depend on temperature, solvent conditions and ligand binding. The N11L mutation serves as an evolutionary bridge from the β-sheet to the helical fold because in the mutant, Leu is an integral part of the hydrophobic core of the new structure but can also occupy a surface position in the wild type structure. Conversely, the polar Asn 11 side chain serves as a negative design element in wild type Arc because it cannot be incorporated into the core of the mutant fold.
Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a "stepping-stone" method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and lambda. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and lambda. The domains show 40% sequence identity but differ by switching of alpha-helix to beta-sheet in a C-terminal region spanning approximately 25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization.
PMID: 18924109;PMCID: PMC3014317;Abstract:
ORFan genes can constitute a large fraction of a bacterial genome, but due to their lack of homologs, their functions have remained largely unexplored. To determine if particular features of ORFan-encoded proteins promote their presence in a genome, we analyzed properties of ORFans that originated over a broad evolutionary timescale. We also compared ORFan genes to another class of acquired genes, heterogeneous occurrence in prokaryotes (HOPs), which have homologs in other bacteria. A total of 54 ORFan and HOP genes selected from different phylogenetic depths in the Escherichia coli lineage were cloned, expressed, purified, and subjected to circular dichroism (CD) spectroscopy. A majority of genes could be expressed, but only 18 yielded sufficient soluble protein for spectral analysis. Of these, half were significantly α-helical, three were predominantly b-sheet, and six were of intermediate/indeterminate structure. Although a higher proportion of HOPs yielded soluble proteins with resolvable secondary structures, ORFans resembled HOPs with regard to most of the other features tested. Overall, we found that those ORFan and HOP genes that have persisted in the E. coli lineage were more likely to encode soluble and folded proteins, more likely to display environmental modulation of their gene expression, and by extrapolation, are more likely to be functional. © 2008 Wiley-VCH Verlag GmbH & Co. KGaA.
Cro proteins from different lambdoid bacteriophages are extremely variable in their target consensus DNA sequences and constitute an excellent model for evolution of transcription factor specificity. We experimentally tested a bioinformatically derived evolutionary code relating switches between pairs of amino acids at three recognition helix sites in Cro proteins to switches between pairs of nucleotide bases in the cognate consensus DNA half-sites. We generated all eight possible code variants of bacteriophage λ Cro and used electrophoretic mobility shift assays to compare binding of each variant to its own putative cognate site and to the wild-type cognate site; we also tested the wild-type protein against all eight DNA sites. Each code variant showed stronger binding to its putative cognate site than to the wild-type site, except some variants containing proline at position 27; each also bound its cognate site better than wild-type Cro bound the same site. Most code variants, however, displayed poorer affinity and specificity than wild-type λ Cro. Fluorescence anisotropy assays on λ Cro and the triple code variant (PSQ) against the two cognate sites confirmed the switch in specificity and showed larger apparent effects on binding affinity and specificity. Bacterial one-hybrid assays of λ Cro and PSQ against libraries of sequences with a single randomized half-site showed the expected switches in specificity at two of three coded positions and no clear switches in specificity at noncoded positions. With a few caveats, these results confirm that the proposed Cro evolutionary code can be used to reengineer Cro specificity.
PMID: 8696970;Abstract:
Protein design efforts are beginning to yield molecules with many of the properties of natural proteins. Such experiments are informed by and contribute to our understanding of the sequence determinants of protein folding and stability. The most important design elements seem to be the proper placement of hydrophobic residues along the polypeptide chain and the ability of these residues to form a well packed core. Buried polar interactions, turn and capping motifs and secondary structural propensities also contribute, although probably to a lesser extent.