Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
117
result(s) for
"Coin, Lachlan J. M."
Sort by:
MultiPhen: Joint Model of Multiple Phenotypes Can Increase Discovery in GWAS
by
Calboli, Federico C. F.
,
Elliott, Paul
,
Hoggart, Clive J.
in
Analysis
,
Autoimmune diseases
,
Biology
2012
The genome-wide association study (GWAS) approach has discovered hundreds of genetic variants associated with diseases and quantitative traits. However, despite clinical overlap and statistical correlation between many phenotypes, GWAS are generally performed one-phenotype-at-a-time. Here we compare the performance of modelling multiple phenotypes jointly with that of the standard univariate approach. We introduce a new method and software, MultiPhen, that models multiple phenotypes simultaneously in a fast and interpretable way. By performing ordinal regression, MultiPhen tests the linear combination of phenotypes most associated with the genotypes at each SNP, and thus potentially captures effects hidden to single phenotype GWAS. We demonstrate via simulation that this approach provides a dramatic increase in power in many scenarios. There is a boost in power for variants that affect multiple phenotypes and for those that affect only one phenotype. While other multivariate methods have similar power gains, we describe several benefits of MultiPhen over these. In particular, we demonstrate that other multivariate methods that assume the genotypes are normally distributed, such as canonical correlation analysis (CCA) and MANOVA, can have highly inflated type-1 error rates when testing case-control or non-normal continuous phenotypes, while MultiPhen produces no such inflation. To test the performance of MultiPhen on real data we applied it to lipid traits in the Northern Finland Birth Cohort 1966 (NFBC1966). In these data MultiPhen discovers 21% more independent SNPs with known associations than the standard univariate GWAS approach, while applying MultiPhen in addition to the standard approach provides 37% increased discovery. The most associated linear combinations of the lipids estimated by MultiPhen at the leading SNPs accurately reflect the Friedewald Formula, suggesting that MultiPhen could be used to refine the definition of existing phenotypes or uncover novel heritable phenotypes.
Journal Article
Whole genome deep sequencing analysis of cell-free DNA in samples with low tumour content
2022
Background
Circulating cell-free DNA (cfDNA) in the plasma of cancer patients contains cell-free tumour DNA (ctDNA) derived from tumour cells and it has been widely recognized as a non-invasive source of tumour DNA for diagnosis and prognosis of cancer. Molecular profiling of ctDNA is often performed using targeted sequencing or low-coverage whole genome sequencing (WGS) to identify tumour specific somatic mutations or somatic copy number aberrations (sCNAs). However, these approaches cannot efficiently detect all tumour-derived genomic changes in ctDNA.
Methods
We performed WGS analysis of cfDNA from 4 breast cancer patients and 2 patients with benign tumours. We sequenced matched germline DNA for all 6 patients and tumour samples from the breast cancer patients. All samples were sequenced on Illumina HiSeqXTen sequencing platform and achieved approximately 30x, 60x and 100x coverage on germline, tumour and plasma DNA samples, respectively.
Results
The mutational burden of the plasma samples (1.44 somatic mutations/Mb of genome) was higher than the matched tumour samples. However, 90% of high confidence somatic cfDNA variants were not detected in matched tumour samples and were found to comprise two background plasma mutational signatures. In contrast, cfDNA from the di-nucleosome fraction (300 bp–350 bp) had much higher proportion (30%) of variants shared with tumour. Despite high coverage sequencing we were unable to detect sCNAs in plasma samples.
Conclusions
Deep sequencing analysis of plasma samples revealed higher fraction of unique somatic mutations in plasma samples, which were not detected in matched tumour samples. Sequencing of di-nucleosome bound cfDNA fragments may increase recovery of tumour mutations from plasma.
Journal Article
Utilizing Nanopore direct RNA sequencing of blood from patients with sepsis for discovery of co- and post-transcriptional disease biomarkers
by
He, Jingni
,
Trevor, Sharon L.
,
Schlapbach, Luregn J.
in
Bacterial infections
,
Bias
,
Biological markers
2025
Background
RNA sequencing of whole blood has been increasingly employed to find transcriptomic signatures of disease states. These studies traditionally utilize short-read sequencing of cDNA, missing important aspects of RNA expression such as differential isoform abundance and poly(A) tail length variation.
Methods
We used Oxford Nanopore Technologies sequencing to sequence native mRNA extracted from whole blood from 12 patients with definite bacterial and viral sepsis and compared with results from matching Illumina short-read cDNA sequencing data. Additionally, we explored poly(A) tail length variation, novel transcript identification, and differential transcript usage.
Results
The correlation of gene count data between Illumina cDNA- and Nanopore RNA-sequencing strongly depended on the choice of analysis pipeline;
NanoCount
for Nanopore and
Kallisto
for Illumina data yielded the highest mean Pearson’s correlation of 0.927 at the gene level and 0.736 at the transcript isoform level. We identified 2 genes with differential polyadenylation, 9 genes with differential expression and 4 genes with differential transcript usage between bacterial and viral infection. Gene ontology gene set enrichment analysis of poly(A) tail length revealed enrichment of long tails in mRNA of genes involved in signaling and short tails in oxidoreductase molecular functions. Additionally, we detected 240 non-artifactual novel transcript isoforms.
Conclusions
Nanopore RNA- and Illumina cDNA-gene counts are strongly correlated, indicating that both platforms are suitable for discovery and validation of gene count biomarkers. Nanopore direct RNA-seq provides additional advantages by uncovering additional post- and co-transcriptional biomarkers, such as poly(A) tail length variation and transcript isoform usage.
Journal Article
Real-time resolution of short-read assembly graph using ONT long reads
by
Coin, Lachlan J. M.
,
Cao, Minh Duc
,
Nguyen, Son Hoang
in
Algorithms
,
Assembly
,
Biology and Life Sciences
2021
A streaming assembly pipeline utilising real-time Oxford Nanopore Technology (ONT) sequencing data is important for saving sequencing resources and reducing time-to-result. A previous approach implemented in npScarf provided an efficient streaming algorithm for hybrid assembly but was relatively prone to mis-assemblies compared to other graph-based methods. Here we present npGraph , a streaming hybrid assembly tool using the assembly graph instead of the separated pre-assembly contigs. It is able to produce more complete genome assembly by resolving the path finding problem on the assembly graph using long reads as the traversing guide. Application to synthetic and real data from bacterial isolate genomes show improved accuracy while still maintaining a low computational cost. npGraph also provides a graphical user interface (GUI) which provides a real-time visualisation of the progress of assembly. The tool and source code is available at https://github.com/hsnguyen/assembly .
Journal Article
Genetic Determinants of Height Growth Assessed Longitudinally from Infancy to Adulthood in the Northern Finland Birth Cohort 1966
2009
Recent genome-wide association (GWA) studies have identified dozens of common variants associated with adult height. However, it is unknown how these variants influence height growth during childhood. We derived peak height velocity in infancy (PHV1) and puberty (PHV2) and timing of pubertal height growth spurt from parametric growth curves fitted to longitudinal height growth data to test their association with known height variants. The study consisted of N = 3,538 singletons from the prospective Northern Finland Birth Cohort 1966 with genotype data and frequent height measurements (on average 20 measurements per person) from 0-20 years. Twenty-six of the 48 variants tested associated with adult height (p<0.05, adjusted for sex and principal components) in this sample, all in the same direction as in previous GWA scans. Seven SNPs in or near the genes HHIP, DLEU7, UQCC, SF3B4/SV2A, LCORL, and HIST1H1D associated with PHV1 and five SNPs in or near SOCS2, SF3B4/SV2A, C17orf67, CABLES1, and DOT1L with PHV2 (p<0.05). We formally tested variants for interaction with age (infancy versus puberty) and found biologically meaningful evidence for an age-dependent effect for the SNP in SOCS2 (p = 0.0030) and for the SNP in HHIP (p = 0.045). We did not have similar prior evidence for the association between height variants and timing of pubertal height growth spurt as we had for PHVs, and none of the associations were statistically significant after correction for multiple testing. The fact that in this sample, less than half of the variants associated with adult height had a measurable effect on PHV1 or PHV2 is likely to reflect limited power to detect these associations in this dataset. Our study is the first genetic association analysis on longitudinal height growth in a prospective cohort from birth to adulthood and gives grounding for future research on the genetic regulation of human height during different periods of growth.
Journal Article
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly
2011
Identification of genomic structural variation from short-read sequencing data is typically accomplished by mapping reads to a reference genome. Li
et al
. show that
de novo
assembly of the reads followed by whole-genome alignment to the reference is a more comprehensive method that can also resolve complex rearrangements.
Here we use whole-genome
de novo
assembly of second-generation sequencing reads to map structural variation (SV) in an Asian genome and an African genome. Our approach identifies small- and intermediate-size homozygous variants (1–50 kb) including insertions, deletions, inversions and their precise breakpoints, and in contrast to other methods, can resolve complex rearrangements. In total, we identified 277,243 SVs ranging in length from 1–23 kb. Validation using computational and experimental methods suggests that we achieve overall <6% false-positive rate and <10% false-negative rate in genomic regions that can be assembled, which outperforms other methods. Analysis of the SVs in the genomes of 106 individuals sequenced as part of the 1000 Genomes Project suggests that SVs account for a greater fraction of the diversity between individuals than do single-nucleotide polymorphisms (SNPs). These findings demonstrate that whole-genome
de novo
assembly is a feasible approach to deriving more comprehensive maps of genetic variation.
Journal Article
Detection of Streptococcus pyogenes M1UK in Australia and characterization of the mutation driving enhanced expression of superantigen SpeA
2023
A new variant of
Streptococcus pyogenes
serotype M1 (designated ‘M1
UK
’) has been reported in the United Kingdom, linked with seasonal scarlet fever surges, marked increase in invasive infections, and exhibiting enhanced expression of the superantigen SpeA. The progenitor
S. pyogenes
‘M1
global
’ and M1
UK
clones can be differentiated by 27 SNPs and 4 indels, yet the mechanism for
speA
upregulation is unknown. Here we investigate the previously unappreciated expansion of M1
UK
in Australia, now isolated from the majority of serious infections caused by serotype M1
S. pyogenes
. M1
UK
sub-lineages circulating in Australia also contain a novel toxin repertoire associated with epidemic scarlet fever causing
S. pyogenes
in Asia. A single SNP in the 5’ transcriptional leader sequence of the transfer-messenger RNA gene
ssrA
drives enhanced SpeA superantigen expression as a result of
ssrA
terminator read-through in the M1
UK
lineage. This represents a previously unappreciated mechanism of toxin expression and urges enhanced international surveillance.
A variant of group A
Streptococcus
serotype M1 (UK) has been increasingly reported and can be differentiated from the global variant by its overexpression of the superantigen SpeA. Here, Davies et al probe the mechanism behind enhanced SpeA expression and demonstrate that a SNP in the 5’ leader sequence of
ssrA
is responsible for this virulence phenotype.
Journal Article
Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement
2018
Sweetpotato [
Ipomoea batatas
(L.) Lam.] is a globally important staple food crop, especially for sub-Saharan Africa. Agronomic improvement of sweetpotato has lagged behind other major food crops due to a lack of genomic and genetic resources and inherent challenges in breeding a heterozygous, clonally propagated polyploid. Here, we report the genome sequences of its two diploid relatives,
I. trifida
and
I. triloba
, and show that these high-quality genome assemblies are robust references for hexaploid sweetpotato. Comparative and phylogenetic analyses reveal insights into the ancient whole-genome triplication history of
Ipomoea
and evolutionary relationships within the Batatas complex. Using resequencing data from 16 genotypes widely used in African breeding programs, genes and alleles associated with carotenoid biosynthesis in storage roots are identified, which may enable efficient breeding of varieties with high provitamin A content. These resources will facilitate genome-enabled breeding in this important food security crop.
Sweetpotato is an important food security crop providing rich source of macro- and micronutrients including carbohydrates and vitamins. Here, the authors assemble of the two diploid relatives of cultivated sweetpotato and identify genes and alleles associated with carotenoid biosynthesis from breeding lines.
Journal Article
Rapid diagnosis of Capnocytophaga canimorsus septic shock in an immunocompetent individual using real-time Nanopore sequencing: a case report
by
Appleton, Sally
,
Graham, Rikki
,
Hajkowicz, Krispin
in
Alcoholism
,
Bacterial and fungal diseases
,
Bites (Injuries)
2019
Background
Rapid diagnosis and appropriate treatment is imperative in bacterial sepsis due increasing risk of mortality with every hour without appropriate antibiotic therapy. Atypical infections with fastidious organisms may take more than 4 days to diagnose leading to calls for improved methods for rapidly diagnosing sepsis.
Capnocytophaga canimorsus
is a slow-growing, fastidious gram-negative bacillus which is a common commensal within the mouths of dogs, but rarely cause infections in humans.
C. canimorsus
sepsis risk factors include immunosuppression, alcoholism and elderly age. Here we report on the application of emerging nanopore sequencing methods to rapidly diagnose an atypical case of
C. canimorsus
septic shock.
Case presentation
A 62 year-old female patient was admitted to an intensive care unit with septic shock and multi-organ failure six days after a reported dog bite. Blood cultures were unable to detect a pathogen after 3 days despite observed intracellular bacilli on blood smears. Real-time nanopore sequencing was subsequently employed on whole blood to detect
Capnocytophaga canimorsus
in 19 h. The patient was not immunocompromised and did not have any other known risk factors. Whole-genome sequencing of clinical sample and of the offending dog’s oral swabs showed near-identical
C. canimorsus
genomes. The patient responded to antibiotic treatment and was discharged from hospital 31 days after admission.
Conclusions
Use of real-time nanopore sequencing reduced the time-to-diagnosis of
Capnocytophaga canimorsus
in this case from 6.25 days to 19 h.
Capnocytophaga canimorsus
should be considered in cases of suspected sepsis involving cat or dog contact, irrespective of the patient’s known risk factors.
Journal Article
Scaffolding and completing genome assemblies in real-time with nanopore sequencing
by
Coin, Lachlan J. M.
,
Ganesamoorthy, Devika
,
Cooper, Matthew A.
in
631/114/2785/2302
,
631/208/212/2302
,
Algorithms
2017
Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing. Here we present npScarf, which can scaffold and complete short read assemblies while the long read sequencing run is in progress. It reports assembly metrics in real-time so the sequencing run can be terminated once an assembly of sufficient quality is obtained. In assembling four bacterial and one eukaryotic genomes, we show that npScarf can construct more complete and accurate assemblies while requiring less sequencing data and computational resources than existing methods. Our approach offers a time- and resource-effective strategy for completing short read assemblies.
Assembling genomes using currently available computational methods can be time consuming. Here, Coin and colleagues describe a bioinformatics tool named npScarf that can scaffold and complete an existing short read assembly in real-time using nanopore sequencing.
Journal Article