Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
2,579
result(s) for
"consensus sequence"
Sort by:
Consensus sequence design as a general strategy to create hyperstable, biologically active proteins
by
Sternke, Matt
,
Tripp, Katherine W.
,
Barrick, Doug
in
Biological activity
,
Biological evolution
,
Biological Sciences
2019
Consensus sequence design offers a promising strategy for designing proteins of high stability while retaining biological activity since it draws upon an evolutionary history in which residues important for both stability and function are likely to be conserved. Although there have been several reports of successful consensus design of individual targets, it is unclear from these anecdotal studies how often this approach succeeds and how often it fails. Here, we attempt to assess generality by designing consensus sequences for a set of six protein families with a range of chain lengths, structures, and activities. We characterize the resulting consensus proteins for stability, structure, and biological activities in an unbiased way. We find that all six consensus proteins adopt cooperatively folded structures in solution. Strikingly, four of six of these consensus proteins show increased thermodynamic stability over naturally occurring homologs. Each consensus protein tested for function maintained at least partial biological activity. Although peptide binding affinity by a consensus-designed SH3 is rather low, K
m values for consensus enzymes are similar to values from extant homologs. Although consensus enzymes are slower than extant homologs at low temperature, they are faster than some thermophilic enzymes at high temperature. An analysis of sequence properties shows consensus proteins to be enriched in charged residues, and rarified in uncharged polar residues. Sequence differences between consensus and extant homologs are predominantly located at weakly conserved surface residues, highlighting the importance of these residues in the success of the consensus strategy.
Journal Article
Detection of Low-Frequency Mutations and Identification of Heat-Induced Artifactual Mutations Using Duplex Sequencing
2019
We present a genome-wide comparative and comprehensive analysis of three different sequencing methods (conventional next generation sequencing (NGS), tag-based single strand sequencing (e.g., SSCS), and Duplex Sequencing for investigating mitochondrial mutations in human breast epithelial cells. Duplex Sequencing produces a single strand consensus sequence (SSCS) and a duplex consensus sequence (DCS) analysis, respectively. Our study validates that although high-frequency mutations are detectable by all the three sequencing methods with the similar accuracy and reproducibility, rare (low-frequency) mutations are not accurately detectable by NGS and SSCS. Even with conservative bioinformatical modification to overcome the high error rate of NGS, the NGS frequency of rare mutations is 7.0 × 10−4. The frequency is reduced to 1.3 × 10−4 with SSCS and is further reduced to 1.0 × 10−5 using DCS. Rare mutation context spectra obtained from NGS significantly vary across independent experiments, and it is not possible to identify a dominant mutation context. In contrast, rare mutation context spectra are consistently similar in all independent DCS experiments. We have systematically identified heat-induced artifactual variants and corrected the artifacts using Duplex Sequencing. Specific sequence contexts were analyzed to examine the effects of neighboring bases on the accumulation of heat-induced artifactual variants. All of these artifacts are stochastically occurring rare mutations. C > A/G > T, a signature of oxidative damage, is the most increased (170-fold) heat-induced artifactual mutation type. Our results strongly support the claim that Duplex Sequencing accurately detects low-frequency mutations and identifies and corrects artifactual mutations introduced by heating during DNA preparation.
Journal Article
Genome-Wide Classification and Evolutionary Analysis of the bHLH Family of Transcription Factors in Arabidopsis, Poplar, Rice, Moss, and Algae
by
Roig-Villanova, Irma
,
Carretero-Paulet, Lorenzo
,
Robertson, David L.
in
Amino Acid Sequence
,
Amino acids
,
Animals
2010
Basic helix-loop-helix proteins (bHLHs) are found throughout the three eukaryotic kingdoms and constitute one of the largest families of transcription factors. A growing number of bHLH proteins have been functionally characterized in plants. However, some of these have not been previously classified. We present here an updated and comprehensive classification of the bHLHs encoded by the whole sequenced genomes of Arabidopsis (Arabidopsis thaliana), Populus trichocarpa, Oryza sativa, Physcomitrella patens, and five algae species. We define a plant bHLH consensus motif, which allowed the identification of novel highly diverged atypical bHLHs. Using yeast two-hybrid assays, we confirm that (1) a highly diverged bHLH has retained protein interaction activity and (2) the two most conserved positions in the consensus play an essential role in dimerization. Phylogenetic analysis permitted classification of the 638 bHLH genes identified into 32 subfamilies. Evolutionary and functional relationships within subfamilies are supported by intron patterns, predicted DNA-binding motifs, and the architecture of conserved protein motifs. Our analyses reveal the origin and evolutionary diversification of plant bHLHs through differential expansions, domain shuffling, and extensive sequence divergence. At the functional level, this would translate into different subfamilies evolving specific DNA-binding and protein interaction activities as well as differential transcriptional regulatory roles. Our results suggest a role for bHLH proteins in generating plant phenotypic diversity and provide a solid framework for further investigations into the role carried out in the transcriptional regulation of key growth and developmental processes.
Journal Article
Widespread transcription at neuronal activity-regulated enhancers
2010
We used genome-wide sequencing methods to study stimulus-dependent enhancer function in mouse cortical neurons. We identified ∼12,000 neuronal activity-regulated enhancers that are bound by the general transcriptional co-activator CBP in an activity-dependent manner. A function of CBP at enhancers may be to recruit RNA polymerase II (RNAPII), as we also observed activity-regulated RNAPII binding to thousands of enhancers. Notably, RNAPII at enhancers transcribes bi-directionally a novel class of enhancer RNAs (eRNAs) within enhancer domains defined by the presence of histone H3 monomethylated at lysine 4. The level of eRNA expression at neuronal enhancers positively correlates with the level of messenger RNA synthesis at nearby genes, suggesting that eRNA synthesis occurs specifically at enhancers that are actively engaged in promoting mRNA synthesis. These findings reveal that a widespread mechanism of enhancer activation involves RNAPII binding and eRNA synthesis.
Activity-regulated enhancers
Regulatory proteins bind non-coding DNA either close to a gene's mRNA transcription start site at a promoter, or further away on the genome at an enhancer. Enhancers act by helping to recruit the RNA polymerase to the promoter. Now a genome-wide sequencing study of more than 10,000 enhancers that respond to electrical activity in neurons shows that the regulatory process also brings the polymerase to the enhancers themselves, where it transcribes non-coding RNAs. This 'enhancer RNA' (eRNA) synthesis occurs only at enhancers actively engaged in promoting mRNA synthesis from a promoter. The results suggest that at least in the brain, enhancers play a more active 'promoter-like' role in regulating gene expression than previously appreciated.
Regulatory proteins bind non-coding DNA either at promoters (near to a gene's transcription start site) or at enhancers (far away). Binding at enhancers helps to bring the transcription enzyme RNA polymerase to promoters. Here, studies of some 12,000 enhancers that respond to electrical activity in neurons show that binding to enhancers also brings the polymerase to the enhancers themselves, where it transcribes a novel class of non-coding RNAs. Enhancers may thus be more similar to promoters than hitherto appreciated.
Journal Article
Definition of the bacterial N-glycosylation site consensus sequence
by
Hug, Isabelle
,
Kelly, John F
,
Numao, Shin
in
Amino Acid Sequence - genetics
,
Amino Acid Substitution - genetics
,
Amino Acids - chemistry
2006
The
Campylobacter jejuni pgl
locus encodes an
N
‐linked protein glycosylation machinery that can be functionally transferred into
Escherichia coli
. In this system, we analyzed the elements in the
C. jejuni N
‐glycoprotein AcrA required for accepting an
N
‐glycan. We found that the eukaryotic primary consensus sequence for
N
‐glycosylation is
N
terminally extended to D/E‐Y‐N‐X‐S/T (Y, X≠P) for recognition by the bacterial oligosaccharyltransferase (OST) PglB. However, not all consensus sequences were
N
‐glycosylated when they were either artificially introduced or when they were present in non‐
C. jejuni
proteins. We were able to produce recombinant glycoproteins with engineered
N
‐glycosylation sites and confirmed the requirement for a negatively charged side chain at position −2 in
C. jejuni N
‐glycoproteins.
N
‐glycosylation of AcrA by the eukaryotic OST in
Saccharomyces cerevisiae
occurred independent of the acidic residue at the −2 position. Thus, bacterial
N
‐glycosylation site selection is more specific than the eukaryotic equivalent with respect to the polypeptide acceptor sequence.
Journal Article
Small design from big alignment: engineering proteins with multiple sequence alignment as the starting point
2020
Multiple sequence alignment (MSA) is a fundamental way to gain information that cannot be obtained from the analysis of any individual sequence included in the alignment. It provides ways to investigate the relationship between sequence and function from a perspective of evolution. Thus, the MSA of proteins can be employed as a reference for protein engineering. In this paper, we reviewed the recent advances to highlight how protein engineering was benefited from the MSA of proteins. These methods include (1) engineering the thermostability or solubility of proteins by making it closer to the consensus sequence of the alignment through introducing site mutations; (2) structure-based engineering proteins with comparative modeling; (3) creating paleoenzymes featured with high thermostability and promiscuity by constructing the ancestral sequences derived from multiple sequence alignment; and (4) incorporating site-mutations targeting the evolutionarily coupled sites identified from multiple sequence alignment.
Journal Article
The Velvet Family of Fungal Regulators Contains a DNA-Binding Domain Structurally Similar to NF-κB
by
Bayram, Özgür
,
Ficner, Ralf
,
Park, Hee-Soo
in
Aspergillus nidulans - genetics
,
Aspergillus nidulans - physiology
,
Biology
2013
Morphological development of fungi and their combined production of secondary metabolites are both acting in defence and protection. These processes are mainly coordinated by velvet regulators, which contain a yet functionally and structurally uncharacterized velvet domain. Here we demonstrate that the velvet domain of VosA is a novel DNA-binding motif that specifically recognizes an 11-nucleotide consensus sequence consisting of two motifs in the promoters of key developmental regulatory genes. The crystal structure analysis of the VosA velvet domain revealed an unforeseen structural similarity with the Rel homology domain (RHD) of the mammalian transcription factor NF-κB. Based on this structural similarity several conserved amino acid residues present in all velvet domains have been identified and shown to be essential for the DNA binding ability of VosA. The velvet domain is also involved in dimer formation as seen in the solved crystal structures of the VosA homodimer and the VosA-VelB heterodimer. These findings suggest that defence mechanisms of both fungi and animals might be governed by structurally related DNA-binding transcription factors.
Journal Article
Chimeric HP-PRRSV2 containing an ORF2-6 consensus sequence induces antibodies with broadly neutralizing activity and confers cross protection against virulent NADC30-like isolate
2021
Due to the substantial genetic diversity of porcine reproductive and respiratory syndrome virus (PRRSV), commercial PRRS vaccines fail to provide sufficient cross protection. Previous studies have confirmed the existence of PRRSV broadly neutralizing antibodies (bnAbs). However, bnAbs are rarely induced by either natural infection or vaccination. In this study, we designed and synthesized a consensus sequence of PRRSV2 ORF2-6 genes (ORF2-6-CON) encoding all envelope proteins based on 30 representative Chinese PRRSV isolates. The ORF2-6-CON sequence shared > 90% nucleotide identities to all four lineages of PRRSV2 isolates in China. A chimeric virus (rJS-ORF2-6-CON) containing the ORF2-6-CON was generated using the avirulent HP-PRRSV2 JSTZ1712-12 infectious clone as a backbone. The rJS-ORF2-6-CON has similar replication efficiency as the backbone virus in vitro. Furthermore, pig inoculation and challenge studies showed that rJS-ORF2-6-CON is not pathogenic to piglets and confers better cross protection against the virulent NADC30-like isolate than a commercial HP-PRRS modified live virus (MLV) vaccine. Noticeably, the rJS-ORF2-6-CON strain could induce bnAbs while the MLV strain only induced homologous nAbs. In addition, the lineages of VDJ repertoires potentially associated with distinct nAbs were also characterized. Overall, our results demonstrate that rJS-ORF2-6-CON is a promising candidate for the development of a PRRS genetic engineered vaccine conferring cross protection.
Journal Article
Shrinking of repeating unit length in leucine-rich repeats from double-stranded DNA viruses
by
Kretsinger, Robert H
,
Matsushima Norio
,
Miyashita Hiroki
in
Deoxyribonucleic acid
,
DNA viruses
,
Gene transfer
2021
Leucine-rich repeats (LRRs) are present in over 563,000 proteins from viruses to eukaryotes. LRRs repeat in tandem and have been classified into fifteen classes in which the repeat unit lengths range from 20 to 29 residues. Most LRR proteins are involved in protein-protein or ligand interactions. The amount of genome sequence data from viruses is increasing rapidly, and although viral LRR proteins have been identified, a comprehensive sequence analysis has not yet been done, and their structures, functions, and evolution are still unknown. In the present study, we characterized viral LRRs by sequence analysis and identified over 600 LRR proteins from 89 virus species. Most of these proteins were from double-stranded DNA (dsDNA) viruses, including nucleocytoplasmic large dsDNA viruses (NCLDVs). We found that the repeating unit lengths of 11 types are one to five residues shorter than those of the seven known corresponding LRR classes. The repeating units of six types are 19 residues long and are thus the shortest among all LRRs. In addition, two of the LRR types are unique and have not been observed in bacteria, archae or eukaryotes. Conserved strongly hydrophobic residues such as Leu, Val or Ile in the consensus sequences are replaced by Cys with high frequency. Phylogenetic analysis indicated that horizontal gene transfer of some viral LRR genes had occurred between the virus and its host. We suggest that the shortening might contribute to the survival strategy of viruses. The present findings provide a new perspective on the origin and evolution of LRRs.
Journal Article
PDSM, a Motif for Phosphorylation-Dependent SUMO Modification
by
Fujimoto, Mitsuaki
,
Hietakangas, Ville
,
Anckar, Julius
in
Amino Acid Motifs - genetics
,
Antibodies
,
Biochemistry
2006
SUMO (small ubiquitin-like modifier) modification regulates many cellular processes, including transcription. Although sumoylation often occurs on specific lysines within the consensus tetrapeptide ΨKxE, other modifications, such as phosphorylation, may regulate the sumoylation of a substrate. We have discovered PDSM (phosphorylation-dependent sumoylation motif), composed of a SUMO consensus site and an adjacent proline-directed phosphorylation site (ΨKxExxSP). The highly conserved motif regulates phosphorylation-dependent sumoylation of multiple substrates, such as heat-shock factors (HSFs), GATA-1, and myocyte enhancer factor 2. In fact, the majority of the PDSM-containing proteins are transcriptional regulators. Within the HSF family, PDSM is conserved between two functionally distinct members, HSF1 and HSF4b, whose transactivation capacities are repressed through the phosphorylation-dependent sumoylation. As the first recurrent sumoylation determinant beyond the consensus tetrapeptide, the PDSM provides a valuable tool in predicting new SUMO substrates.
Journal Article