Catalogue Search | MBRL

Consensus sequence design as a general strategy to create hyperstable, biologically active proteins

by Sternke, Matt , Tripp, Katherine W. , Barrick, Doug in Biological activity , Biological evolution , Biological Sciences

2019

Consensus sequence design offers a promising strategy for designing proteins of high stability while retaining biological activity since it draws upon an evolutionary history in which residues important for both stability and function are likely to be conserved. Although there have been several reports of successful consensus design of individual targets, it is unclear from these anecdotal studies how often this approach succeeds and how often it fails. Here, we attempt to assess generality by designing consensus sequences for a set of six protein families with a range of chain lengths, structures, and activities. We characterize the resulting consensus proteins for stability, structure, and biological activities in an unbiased way. We find that all six consensus proteins adopt cooperatively folded structures in solution. Strikingly, four of six of these consensus proteins show increased thermodynamic stability over naturally occurring homologs. Each consensus protein tested for function maintained at least partial biological activity. Although peptide binding affinity by a consensus-designed SH3 is rather low, K m values for consensus enzymes are similar to values from extant homologs. Although consensus enzymes are slower than extant homologs at low temperature, they are faster than some thermophilic enzymes at high temperature. An analysis of sequence properties shows consensus proteins to be enriched in charged residues, and rarified in uncharged polar residues. Sequence differences between consensus and extant homologs are predominantly located at weakly conserved surface residues, highlighting the importance of these residues in the success of the consensus strategy.

Journal Article

Share this book

Add to My Shelf

Detection of Low-Frequency Mutations and Identification of Heat-Induced Artifactual Mutations Using Duplex Sequencing

by Lee, Seung Hyuk , Ahn, Eun Hyun in Adult , Cell Line , Consensus Sequence

2019

We present a genome-wide comparative and comprehensive analysis of three different sequencing methods (conventional next generation sequencing (NGS), tag-based single strand sequencing (e.g., SSCS), and Duplex Sequencing for investigating mitochondrial mutations in human breast epithelial cells. Duplex Sequencing produces a single strand consensus sequence (SSCS) and a duplex consensus sequence (DCS) analysis, respectively. Our study validates that although high-frequency mutations are detectable by all the three sequencing methods with the similar accuracy and reproducibility, rare (low-frequency) mutations are not accurately detectable by NGS and SSCS. Even with conservative bioinformatical modification to overcome the high error rate of NGS, the NGS frequency of rare mutations is 7.0 × 10−4. The frequency is reduced to 1.3 × 10−4 with SSCS and is further reduced to 1.0 × 10−5 using DCS. Rare mutation context spectra obtained from NGS significantly vary across independent experiments, and it is not possible to identify a dominant mutation context. In contrast, rare mutation context spectra are consistently similar in all independent DCS experiments. We have systematically identified heat-induced artifactual variants and corrected the artifacts using Duplex Sequencing. Specific sequence contexts were analyzed to examine the effects of neighboring bases on the accumulation of heat-induced artifactual variants. All of these artifacts are stochastically occurring rare mutations. C > A/G > T, a signature of oxidative damage, is the most increased (170-fold) heat-induced artifactual mutation type. Our results strongly support the claim that Duplex Sequencing accurately detects low-frequency mutations and identifies and corrects artifactual mutations introduced by heating during DNA preparation.

Journal Article

Share this book

Add to My Shelf

Genome-Wide Classification and Evolutionary Analysis of the bHLH Family of Transcription Factors in Arabidopsis, Poplar, Rice, Moss, and Algae

by Roig-Villanova, Irma , Carretero-Paulet, Lorenzo , Robertson, David L. in Amino Acid Sequence , Amino acids , Animals

2010

Basic helix-loop-helix proteins (bHLHs) are found throughout the three eukaryotic kingdoms and constitute one of the largest families of transcription factors. A growing number of bHLH proteins have been functionally characterized in plants. However, some of these have not been previously classified. We present here an updated and comprehensive classification of the bHLHs encoded by the whole sequenced genomes of Arabidopsis (Arabidopsis thaliana), Populus trichocarpa, Oryza sativa, Physcomitrella patens, and five algae species. We define a plant bHLH consensus motif, which allowed the identification of novel highly diverged atypical bHLHs. Using yeast two-hybrid assays, we confirm that (1) a highly diverged bHLH has retained protein interaction activity and (2) the two most conserved positions in the consensus play an essential role in dimerization. Phylogenetic analysis permitted classification of the 638 bHLH genes identified into 32 subfamilies. Evolutionary and functional relationships within subfamilies are supported by intron patterns, predicted DNA-binding motifs, and the architecture of conserved protein motifs. Our analyses reveal the origin and evolutionary diversification of plant bHLHs through differential expansions, domain shuffling, and extensive sequence divergence. At the functional level, this would translate into different subfamilies evolving specific DNA-binding and protein interaction activities as well as differential transcriptional regulatory roles. Our results suggest a role for bHLH proteins in generating plant phenotypic diversity and provide a solid framework for further investigations into the role carried out in the transcriptional regulation of key growth and developmental processes.

Journal Article

Share this book

Add to My Shelf

Widespread transcription at neuronal activity-regulated enhancers

by Harmin, David A. , Laptewicz, Mike , Bito, Haruhiko in 631/208 , 631/337/572 , 631/378

2010

We used genome-wide sequencing methods to study stimulus-dependent enhancer function in mouse cortical neurons. We identified ∼12,000 neuronal activity-regulated enhancers that are bound by the general transcriptional co-activator CBP in an activity-dependent manner. A function of CBP at enhancers may be to recruit RNA polymerase II (RNAPII), as we also observed activity-regulated RNAPII binding to thousands of enhancers. Notably, RNAPII at enhancers transcribes bi-directionally a novel class of enhancer RNAs (eRNAs) within enhancer domains defined by the presence of histone H3 monomethylated at lysine 4. The level of eRNA expression at neuronal enhancers positively correlates with the level of messenger RNA synthesis at nearby genes, suggesting that eRNA synthesis occurs specifically at enhancers that are actively engaged in promoting mRNA synthesis. These findings reveal that a widespread mechanism of enhancer activation involves RNAPII binding and eRNA synthesis. Activity-regulated enhancers Regulatory proteins bind non-coding DNA either close to a gene's mRNA transcription start site at a promoter, or further away on the genome at an enhancer. Enhancers act by helping to recruit the RNA polymerase to the promoter. Now a genome-wide sequencing study of more than 10,000 enhancers that respond to electrical activity in neurons shows that the regulatory process also brings the polymerase to the enhancers themselves, where it transcribes non-coding RNAs. This 'enhancer RNA' (eRNA) synthesis occurs only at enhancers actively engaged in promoting mRNA synthesis from a promoter. The results suggest that at least in the brain, enhancers play a more active 'promoter-like' role in regulating gene expression than previously appreciated. Regulatory proteins bind non-coding DNA either at promoters (near to a gene's transcription start site) or at enhancers (far away). Binding at enhancers helps to bring the transcription enzyme RNA polymerase to promoters. Here, studies of some 12,000 enhancers that respond to electrical activity in neurons show that binding to enhancers also brings the polymerase to the enhancers themselves, where it transcribes a novel class of non-coding RNAs. Enhancers may thus be more similar to promoters than hitherto appreciated.

Journal Article

Share this book

Add to My Shelf

Definition of the bacterial N-glycosylation site consensus sequence

by Hug, Isabelle , Kelly, John F , Numao, Shin in Amino Acid Sequence - genetics , Amino Acid Substitution - genetics , Amino Acids - chemistry

2006

The Campylobacter jejuni pgl locus encodes an N ‐linked protein glycosylation machinery that can be functionally transferred into Escherichia coli . In this system, we analyzed the elements in the C. jejuni N ‐glycoprotein AcrA required for accepting an N ‐glycan. We found that the eukaryotic primary consensus sequence for N ‐glycosylation is N terminally extended to D/E‐Y‐N‐X‐S/T (Y, X≠P) for recognition by the bacterial oligosaccharyltransferase (OST) PglB. However, not all consensus sequences were N ‐glycosylated when they were either artificially introduced or when they were present in non‐ C. jejuni proteins. We were able to produce recombinant glycoproteins with engineered N ‐glycosylation sites and confirmed the requirement for a negatively charged side chain at position −2 in C. jejuni N ‐glycoproteins. N ‐glycosylation of AcrA by the eukaryotic OST in Saccharomyces cerevisiae occurred independent of the acidic residue at the −2 position. Thus, bacterial N ‐glycosylation site selection is more specific than the eukaryotic equivalent with respect to the polypeptide acceptor sequence.

Journal Article

Share this book

Add to My Shelf

Small design from big alignment: engineering proteins with multiple sequence alignment as the starting point

by Wang, Tianwen , Chen, Liang , An Yafei in Alignment , Conserved sequence , Engineering

2020

Multiple sequence alignment (MSA) is a fundamental way to gain information that cannot be obtained from the analysis of any individual sequence included in the alignment. It provides ways to investigate the relationship between sequence and function from a perspective of evolution. Thus, the MSA of proteins can be employed as a reference for protein engineering. In this paper, we reviewed the recent advances to highlight how protein engineering was benefited from the MSA of proteins. These methods include (1) engineering the thermostability or solubility of proteins by making it closer to the consensus sequence of the alignment through introducing site mutations; (2) structure-based engineering proteins with comparative modeling; (3) creating paleoenzymes featured with high thermostability and promiscuity by constructing the ancestral sequences derived from multiple sequence alignment; and (4) incorporating site-mutations targeting the evolutionarily coupled sites identified from multiple sequence alignment.

Journal Article

Share this book

Add to My Shelf

The Velvet Family of Fungal Regulators Contains a DNA-Binding Domain Structurally Similar to NF-κB

by Bayram, Özgür , Ficner, Ralf , Park, Hee-Soo in Aspergillus nidulans - genetics , Aspergillus nidulans - physiology , Biology

2013

Morphological development of fungi and their combined production of secondary metabolites are both acting in defence and protection. These processes are mainly coordinated by velvet regulators, which contain a yet functionally and structurally uncharacterized velvet domain. Here we demonstrate that the velvet domain of VosA is a novel DNA-binding motif that specifically recognizes an 11-nucleotide consensus sequence consisting of two motifs in the promoters of key developmental regulatory genes. The crystal structure analysis of the VosA velvet domain revealed an unforeseen structural similarity with the Rel homology domain (RHD) of the mammalian transcription factor NF-κB. Based on this structural similarity several conserved amino acid residues present in all velvet domains have been identified and shown to be essential for the DNA binding ability of VosA. The velvet domain is also involved in dimer formation as seen in the solved crystal structures of the VosA homodimer and the VosA-VelB heterodimer. These findings suggest that defence mechanisms of both fungi and animals might be governed by structurally related DNA-binding transcription factors.

Journal Article

Share this book

Add to My Shelf

Chimeric HP-PRRSV2 containing an ORF2-6 consensus sequence induces antibodies with broadly neutralizing activity and confers cross protection against virulent NADC30-like isolate

by Xiao, Yanzhao , Yan, Xilin , Li, Jixiang in Antibiotics , Antibodies , Antigens

2021

Due to the substantial genetic diversity of porcine reproductive and respiratory syndrome virus (PRRSV), commercial PRRS vaccines fail to provide sufficient cross protection. Previous studies have confirmed the existence of PRRSV broadly neutralizing antibodies (bnAbs). However, bnAbs are rarely induced by either natural infection or vaccination. In this study, we designed and synthesized a consensus sequence of PRRSV2 ORF2-6 genes (ORF2-6-CON) encoding all envelope proteins based on 30 representative Chinese PRRSV isolates. The ORF2-6-CON sequence shared > 90% nucleotide identities to all four lineages of PRRSV2 isolates in China. A chimeric virus (rJS-ORF2-6-CON) containing the ORF2-6-CON was generated using the avirulent HP-PRRSV2 JSTZ1712-12 infectious clone as a backbone. The rJS-ORF2-6-CON has similar replication efficiency as the backbone virus in vitro. Furthermore, pig inoculation and challenge studies showed that rJS-ORF2-6-CON is not pathogenic to piglets and confers better cross protection against the virulent NADC30-like isolate than a commercial HP-PRRS modified live virus (MLV) vaccine. Noticeably, the rJS-ORF2-6-CON strain could induce bnAbs while the MLV strain only induced homologous nAbs. In addition, the lineages of VDJ repertoires potentially associated with distinct nAbs were also characterized. Overall, our results demonstrate that rJS-ORF2-6-CON is a promising candidate for the development of a PRRS genetic engineered vaccine conferring cross protection.

Journal Article

Share this book

Add to My Shelf

Shrinking of repeating unit length in leucine-rich repeats from double-stranded DNA viruses

by Kretsinger, Robert H , Matsushima Norio , Miyashita Hiroki in Deoxyribonucleic acid , DNA viruses , Gene transfer

2021

Leucine-rich repeats (LRRs) are present in over 563,000 proteins from viruses to eukaryotes. LRRs repeat in tandem and have been classified into fifteen classes in which the repeat unit lengths range from 20 to 29 residues. Most LRR proteins are involved in protein-protein or ligand interactions. The amount of genome sequence data from viruses is increasing rapidly, and although viral LRR proteins have been identified, a comprehensive sequence analysis has not yet been done, and their structures, functions, and evolution are still unknown. In the present study, we characterized viral LRRs by sequence analysis and identified over 600 LRR proteins from 89 virus species. Most of these proteins were from double-stranded DNA (dsDNA) viruses, including nucleocytoplasmic large dsDNA viruses (NCLDVs). We found that the repeating unit lengths of 11 types are one to five residues shorter than those of the seven known corresponding LRR classes. The repeating units of six types are 19 residues long and are thus the shortest among all LRRs. In addition, two of the LRR types are unique and have not been observed in bacteria, archae or eukaryotes. Conserved strongly hydrophobic residues such as Leu, Val or Ile in the consensus sequences are replaced by Cys with high frequency. Phylogenetic analysis indicated that horizontal gene transfer of some viral LRR genes had occurred between the virus and its host. We suggest that the shortening might contribute to the survival strategy of viruses. The present findings provide a new perspective on the origin and evolution of LRRs.

Journal Article

Share this book

Add to My Shelf

PDSM, a Motif for Phosphorylation-Dependent SUMO Modification

by Fujimoto, Mitsuaki , Hietakangas, Ville , Anckar, Julius in Amino Acid Motifs - genetics , Antibodies , Biochemistry

2006

SUMO (small ubiquitin-like modifier) modification regulates many cellular processes, including transcription. Although sumoylation often occurs on specific lysines within the consensus tetrapeptide ΨKxE, other modifications, such as phosphorylation, may regulate the sumoylation of a substrate. We have discovered PDSM (phosphorylation-dependent sumoylation motif), composed of a SUMO consensus site and an adjacent proline-directed phosphorylation site (ΨKxExxSP). The highly conserved motif regulates phosphorylation-dependent sumoylation of multiple substrates, such as heat-shock factors (HSFs), GATA-1, and myocyte enhancer factor 2. In fact, the majority of the PDSM-containing proteins are transcriptional regulators. Within the HSF family, PDSM is conserved between two functionally distinct members, HSF1 and HSF4b, whose transactivation capacities are repressed through the phosphorylation-dependent sumoylation. As the first recurrent sumoylation determinant beyond the consensus tetrapeptide, the PDSM provides a valuable tool in predicting new SUMO substrates.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter