Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
LanguageLanguage
-
SubjectSubject
-
Item TypeItem Type
-
DisciplineDiscipline
-
YearFrom:-To:
-
More FiltersMore FiltersIs Peer Reviewed
Done
Filters
Reset
124
result(s) for
"Neumann, Pavel"
Sort by:
Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2
2020
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at
https://repeatexplorer-elixir.cerit-sc.cz
.
RepeatExplorer is a software tool for repeat identification and quantification using unassembled sequencing reads. The authors describe four pipelines implemented on the Galaxy platform, highlighting different applications.
Journal Article
Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data
by
Novák, Petr
,
Macas, Jiří
,
Neumann, Pavel
in
Algorithms
,
Bioinformatics
,
Biomedical and Life Sciences
2010
Background
The investigation of plant genome structure and evolution requires comprehensive characterization of repetitive sequences that make up the majority of higher plant nuclear DNA. Since genome-wide characterization of repetitive elements is complicated by their high abundance and diversity, novel approaches based on massively-parallel sequencing are being adapted to facilitate the analysis. It has recently been demonstrated that the low-pass genome sequencing provided by a single 454 sequencing reaction is sufficient to capture information about all major repeat families, thus providing the opportunity for efficient repeat investigation in a wide range of species. However, the development of appropriate data mining tools is required in order to fully utilize this sequencing data for repeat characterization.
Results
We adapted a graph-based approach for similarity-based partitioning of whole genome 454 sequence reads in order to build clusters made of the reads derived from individual repeat families. The information about cluster sizes was utilized for assessing the proportion and composition of repeats in the genomes of two model species,
Pisum sativum
and
Glycine max
, differing in genome size and 454 sequencing coverage. Moreover, statistical analysis and visual inspection of the topology of the cluster graphs using a newly developed program tool,
SeqGrapheR
, were shown to be helpful in distinguishing basic types of repeats and investigating sequence variability within repeat families.
Conclusions
Repetitive regions of plant genomes can be efficiently characterized by the presented graph-based analysis and the graph representation of repeats can be further used to assess the variability and evolutionary divergence of repeat families, discover and characterize novel elements, and aid in subsequent assembly of their consensus sequences.
Journal Article
Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification
by
Neumann, Pavel
,
Novák, Petr
,
Hoštáková, Nina
in
Analysis
,
Animal Genetics and Genomics
,
Anopheles
2019
Background
Plant LTR-retrotransposons are classified into two superfamilies, Ty1/copia and Ty3/gypsy. They are further divided into an enormous number of families which are, due to the high diversity of their nucleotide sequences, usually specific to a single or a group of closely related species. Previous attempts to group these families into broader categories reflecting their phylogenetic relationships were limited either to analyzing a narrow range of plant species or to analyzing a small numbers of elements. Furthermore, there is no reference database that allows for similarity based classification of LTR-retrotransposons.
Results
We have assembled a database of retrotransposon encoded polyprotein domains sequences extracted from 5410 Ty1/copia elements and 8453 Ty3/gypsy elements sampled from 80 species representing major groups of green plants (Viridiplantae). Phylogenetic analysis of the three most conserved polyprotein domains (RT, RH and INT) led to dividing Ty1/copia and Ty3/gypsy retrotransposons into 16 and 14 lineages respectively. We also characterized various features of LTR-retrotransposon sequences including additional polyprotein domains, extra open reading frames and primer binding sites, and found that the occurrence and/or type of these features correlates with phylogenies inferred from the three protein domains.
Conclusions
We have established an improved classification system applicable to LTR-retrotransposons from a wide range of plant species. This system reflects phylogenetic relationships as well as distinct sequence and structural features of the elements. A comprehensive database of retrotransposon protein domains (REXdb) that reflects this classification provides a reference for efficient and unified annotation of LTR-retrotransposons in plant genomes. Access to REXdb related tools is implemented in the RepeatExplorer web server (
https://repeatexplorer-elixir.cerit-sc.cz/
) or using a standalone version of REXdb that can be downloaded seaparately from RepeatExplorer web page (
http://repeatexplorer.org/
).
Journal Article
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae
2015
The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Journal Article
Holocentromeres can consist of merely a few megabase-sized satellite arrays
2023
The centromere is the chromosome region where microtubules attach during cell division. In contrast to monocentric chromosomes with one centromere, holocentric species usually distribute hundreds of centromere units along the entire chromatid. We assembled the chromosome-scale reference genome and analyzed the holocentromere and (epi)genome organization of the lilioid
Chionographis japonica
. Remarkably, each of its holocentric chromatids consists of only 7 to 11 evenly spaced megabase-sized centromere-specific histone H3-positive units. These units contain satellite arrays of 23 and 28 bp-long monomers capable of forming palindromic structures. Like monocentric species,
C. japonica
forms clustered centromeres in chromocenters at interphase. In addition, the large-scale eu- and heterochromatin arrangement differs between
C. japonica
and other known holocentric species. Finally, using polymer simulations, we model the formation of prometaphase line-like holocentromeres from interphase centromere clusters. Our findings broaden the knowledge about centromere diversity, showing that holocentricity is not restricted to species with numerous and small centromere units.
The centromere is required for the segregation of chromosomes. Here, the authors report the organization and dynamic of holocentric chromosomes, each containing 7 to 11 evenly spaced megabase-sized centromere-specific histone H3-positive units.
Journal Article
Repeat-based holocentromeres of the woodrush Luzula sylvatica reveal insights into the evolutionary transition to holocentricity
2024
In most studied eukaryotes, chromosomes are monocentric, with centromere activity confined to a single region. However, the rush family (Juncaceae) includes species with both monocentric (
Juncus
) and holocentric (
Luzula
) chromosomes, where centromere activity is distributed along the entire chromosome length. Here, we combine chromosome-scale genome assembly, epigenetic analysis, immuno-FISH and super-resolution microscopy to study the transition to holocentricity in
Luzula sylvatica
. We report repeat-based holocentromeres with an irregular distribution of features along the chromosomes.
Luzula sylvatica
holocentromeres are predominantly associated with two satellite DNA repeats (
Lusy1
and
Lusy2
), while CENH3 also binds satellite-free gene-poor regions. Comparative repeat analysis suggests that
Lusy1
plays a crucial role in centromere function across most
Luzula
species. Furthermore, synteny analysis between
L. sylvatica
(
n
= 6) and
Juncus effusus
(
n
= 21) suggests that holocentric chromosomes in
Luzula
could have arisen from chromosome fusions of ancestral monocentric chromosomes, accompanied by the expansion of CENH3-associated satellite repeats.
Unlike most eukaryotes,
Luzula sylvatica
assembles holocentromeres along the entire chromosome length. Here, the authors combine genome assembly, epigenetic and cytogenetic analyses to reveal holocentricity involves the fusion of small monocentric chromosomes and the expansion of CENH3-interacting satellite DNA repeats.
Journal Article
Repeatless and Repeat-Based Centromeres in Potato: Implications for Centromere Evolution
by
Wu, Yufeng
,
Buell, C. Robin
,
Torres, Giovana A.
in
Base Sequence
,
Centromere - genetics
,
Centromeres
2012
Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single-or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains.
Journal Article
Impact of parasitic lifestyle and different types of centromere organization on chromosome and genome evolution in the plant genus Cuscuta
2021
• The parasitic genus Cuscuta (Convolvulaceae) is exceptional among plants with respect to centromere organization, including both monocentric and holocentric chromosomes, and substantial variation in genome size and chromosome number. We investigated 12 species representing the diversity of the genus in a phylogenetic context to reveal the molecular and evolutionary processes leading to diversification of their genomes.
• We measured genome sizes and investigated karyotypes and centromere organization using molecular cytogenetic techniques. We also performed low-pass whole genome sequencing and comparative analysis of repetitive DNA composition.
• A remarkable 102-fold variation in genome sizes (342–34 734 Mbp/1C) was detected for monocentric Cuscuta species, while genomes of holocentric species were of moderate sizes (533–1545 Mbp/1C). The genome size variation was primarily driven by the differential accumulation of LTR-retrotransposons and satellite DNA. The transition to holocentric chromosomes in the subgenus Cuscuta was associated with loss of histone H2A phosphorylation and elimination of centromeric retrotransposons. In addition, basic chromosome number of holocentric species (x = 7) was smaller than in monocentrics (x = 15 or 16).
• We demonstrated that the transition to holocentricity in Cuscuta was accompanied by significant changes in epigenetic marks, chromosome number and the repetitive DNA sequence composition.
Journal Article
Assembly of the 81.6 Mb centromere of pea chromosome 6 elucidates the structure and evolution of metapolycentric chromosomes
by
Ávila Robledillo, Laura
,
Biology Centre of the Czech Academy of Sciences (BIOLOGY CENTRE CAS) ; Czech Academy of Sciences [Prague] (CAS)
,
Neumann, Pavel
in
Analysis
,
Biology and Life Sciences
,
Cell cycle
2023
Centromeres in the legume genera Pisum and Lathyrus exhibit unique morphological characteristics, including extended primary constrictions and multiple separate domains of centromeric chromatin. These so-called metapolycentromeres resemble an intermediate form between monocentric and holocentric types, and therefore provide a great opportunity for studying the transitions between different types of centromere organizations. However, because of the exceedingly large and highly repetitive nature of metapolycentromeres, highly contiguous assemblies needed for these studies are lacking. Here, we report on the assembly and analysis of a 177.6 Mb region of pea (Pisum sativum) chromosome 6, including the 81.6 Mb centromere region (CEN6) and adjacent chromosome arms. Genes, DNA methylation profiles, and most of the repeats were uniformly distributed within the centromere, and their densities in CEN6 and chromosome arms were similar. The exception was an accumulation of satellite DNA in CEN6, where it formed multiple arrays up to 2 Mb in length. Centromeric chromatin, characterized by the presence of the CENH3 protein, was predominantly associated with arrays of three different satellite repeats; however, five other satellites present in CEN6 lacked CENH3. The presence of CENH3 chromatin was found to determine the spatial distribution of the respective satellites during the cell cycle. Finally, oligo-FISH painting experiments, performed using probes specifically designed to label the genomic regions corresponding to CEN6 in Pisum, Lathyrus, and Vicia species, revealed that metapolycentromeres evolved via the expansion of centromeric chromatin into neighboring chromosomal regions and the accumulation of novel satellite repeats. However, in some of these species, centromere evolution also involved chromosomal translocations and centromere repositioning.
Journal Article
Satellite DNA in Vicia faba is characterized by remarkable diversity in its sequence composition, association with centromeres, and replication timing
2018
Satellite DNA, a class of repetitive sequences forming long arrays of tandemly repeated units, represents substantial portions of many plant genomes yet remains poorly characterized due to various methodological obstacles. Here we show that the genome of the field bean (
Vicia faba
, 2n = 12), a long-established model for cytogenetic studies in plants, contains a diverse set of satellite repeats, most of which remained concealed until their present investigation. Using next-generation sequencing combined with novel bioinformatics tools, we reconstructed consensus sequences of 23 novel satellite repeats representing 0.008–2.700% of the genome and mapped their distribution on chromosomes. We found that in addition to typical satellites with monomers hundreds of nucleotides long,
V
.
faba
contains a large number of satellite repeats with unusually long monomers (687–2033 bp), which are predominantly localized in pericentromeric regions. Using chromatin immunoprecipitation with CenH3 antibody, we revealed an extraordinary diversity of centromeric satellites, consisting of seven repeats with chromosome-specific distribution. We also found that in spite of their different nucleotide sequences, all centromeric repeats are replicated during mid-S phase, while most other satellites are replicated in the first part of late S phase, followed by a single family of FokI repeats representing the latest replicating chromatin.
Journal Article