Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
19
result(s) for
"Fadlelmola, Faisal"
Sort by:
Ten simple rules for organizing a webinar series
2019
Biosciences eastern and central Africa (BecA-ILRI Hub), International Livestock Research Institute, Nairobi, Kenya Affiliation: South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville 7535, Cape Town, South Africa ORCID logo http://orcid.org/0000-0002-8282-1325 Oussema Souiai Affiliations Laboratory of BioInformatics Biomathematics and bioStatistics, Institut Pasteur de SalTunis, Tunis, Tunisia, Institut supérieur des technologies médicales, Univesité Tunis al Manar, Tunis, Tunisia Nicola Mulder Affiliation: Computational Biology Division, Department of Integrative Biomedical Sciences, University of Cape Town, Cape Town, South Africa ORCID logo http://orcid.org/0000-0003-4905-0941 H3ABioNet Research working group as members of the H3Africa Consortium ¶Membership list of the H3ABioNet Research working group can be found in the Acknowledgments section. [...]choosing webinar themes requires mapping of the target audience needs and interests [7]. [...]the earlier a regular webinar date and time is settled upon, the earlier it will make its way into attendees’ calendars thereby enabling them to avoid subsequent scheduling conflicts. The webinar coordination team assists with all the planning and logistics for hosting a webinar (Rule 1); Choosing webinar themes requires mapping of the target audience needs and interest (Rule 2); Drafting a webinar planning checklist through regular planning meetings as well as post webinar meetings (Rule 3); Decentralized webinar organization of tasks and resources through accessible shared space (Rule 4); Planing early and settling on the provisional dates and times of the webinar events along with their themes (Rule 5); Choosing and settling on convenient and user friendly webinar platform (Rule 6); Approaching and confirming potential speakers (Rule 7); Obtaining the webinar title, abstract and presenter’s biography for creating the webinar announcement through emails and social media channels (Rule 8); Allocating time for the platform orientation (Rule 9); and Keeping close-up track of webinar metrics for regular assessment and evaluation (Rule 10).
Journal Article
Managing genomic variant calling workflows with Swift/T
2019
Bioinformatics research is frequently performed using complex workflows with multiple steps, fans, merges, and conditionals. This complexity makes management of the workflow difficult on a computer cluster, especially when running in parallel on large batches of data: hundreds or thousands of samples at a time. Scientific workflow management systems could help with that. Many are now being proposed, but is there yet the \"best\" workflow management system for bioinformatics? Such a system would need to satisfy numerous, sometimes conflicting requirements: from ease of use, to seamless deployment at peta- and exa-scale, and portability to the cloud. We evaluated Swift/T as a candidate for such role by implementing a primary genomic variant calling workflow in the Swift/T language, focusing on workflow management, performance and scalability issues that arise from production-grade big data genomic analyses. In the process we introduced novel features into the language, which are now part of its open repository. Additionally, we formalized a set of design criteria for quality, robust, maintainable workflows that must function at-scale in a production setting, such as a large genomic sequencing facility or a major hospital system. The use of Swift/T conveys two key advantages. (1) It operates transparently in multiple cluster scheduling environments (PBS Torque, SLURM, Cray aprun environment, etc.), thus a single workflow is trivially portable across numerous clusters. (2) The leaf functions of Swift/T permit developers to easily swap executables in and out of the workflow, which makes it easy to maintain and to request resources optimal for each stage of the pipeline. While Swift/T's data-level parallelism eliminates the need to code parallel analysis of multiple samples, it does make debugging more difficult, as is common for implicitly parallel code. Nonetheless, the language gives users a powerful and portable way to scale up analyses in many computing architectures. The code for our implementation of a variant calling workflow using Swift/T can be found on GitHub at https://github.com/ncsa/Swift-T-Variant-Calling, with full documentation provided at http://swift-t-variant-calling.readthedocs.io/en/latest/.
Journal Article
Design considerations for workflow management systems use in production genomics research and the clinic
by
Heldenbrand, Jacob R.
,
Burra, Prakruthi
,
Allen, Joshua M.
in
631/114
,
631/114/1314
,
631/114/1751
2021
The changing landscape of genomics research and clinical practice has created a need for computational pipelines capable of efficiently orchestrating complex analysis stages while handling large volumes of data across heterogeneous computational environments. Workflow Management Systems (WfMSs) are the software components employed to fill this gap. This work provides an approach and systematic evaluation of key features of popular bioinformatics WfMSs in use today: Nextflow, CWL, and WDL and some of their executors, along with Swift/T, a workflow manager commonly used in high-scale physics applications. We employed two use cases: a variant-calling genomic pipeline and a scalability-testing framework, where both were run locally, on an HPC cluster, and in the cloud. This allowed for evaluation of those four WfMSs in terms of language expressiveness, modularity, scalability, robustness, reproducibility, interoperability, ease of development, along with adoption and usage in research labs and healthcare settings. This article is trying to answer,
which WfMS should be chosen for a given bioinformatics application regardless of analysis type?
. The choice of a given WfMS is a function of both its intrinsic language and engine features. Within bioinformatics, where analysts are a mix of dry and wet lab scientists, the choice is also governed by collaborations and adoption within large consortia and technical support provided by the WfMS team/community. As the community and its needs continue to evolve along with computational infrastructure, WfMSs will also evolve, especially those with permissive licenses that allow commercial use. In much the same way as the dataflow paradigm and containerization are now well understood to be very useful in bioinformatics applications, we will continue to see innovations of tools and utilities for other purposes, like big data technologies, interoperability, and provenance.
Journal Article
Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics
2018
Background
The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous African computing environments. Processing and analysis of genomic data is an example of a big data application requiring complex interdependent data analysis workflows. Such bioinformatics workflows take the primary and secondary input data through several computationally-intensive processing steps using different software packages, where some of the outputs form inputs for other steps. Implementing scalable, reproducible, portable and easy-to-use workflows is particularly challenging.
Results
H3ABioNet has built four workflows to support (1) the calling of variants from high-throughput sequencing data; (2) the analysis of microbial populations from 16S rDNA sequence data; (3) genotyping and genome-wide association studies; and (4) single nucleotide polymorphism imputation. A week-long hackathon was organized in August 2016 with participants from six African bioinformatics groups, and US and European collaborators. Two of the workflows are built using the Common Workflow Language framework (CWL) and two using Nextflow. All the workflows are containerized for improved portability and reproducibility using Docker, and are publicly available for use by members of the H3Africa consortium and the international research community.
Conclusion
The H3ABioNet workflows have been implemented in view of offering ease of use for the end user and high levels of reproducibility and portability, all while following modern state of the art bioinformatics data processing protocols. The H3ABioNet workflows will service the H3Africa consortium projects and are currently in use. All four workflows are also publicly available for research scientists worldwide to use and adapt for their respective needs. The H3ABioNet workflows will help develop bioinformatics capacity and assist genomics research within Africa and serve to increase the scientific output of H3Africa and its Pan-African Bioinformatics Network.
Journal Article
The Extent and Impact of Variation in ADME Genes in Sub-Saharan African Populations
2021
Introduction: Investigating variation in genes involved in the absorption, distribution, metabolism, and excretion (ADME) of drugs are key to characterizing pharmacogenomic (PGx) relationships. ADME gene variation is relatively well characterized in European and Asian populations, but data from African populations are under-studied—which has implications for drug safety and effective use in Africa. Results: We identified significant ADME gene variation in African populations using data from 458 high-coverage whole genome sequences, 412 of which are novel, and from previously available African sequences from the 1,000 Genomes Project. ADME variation was not uniform across African populations, particularly within high impact coding variation. Copy number variation was detected in 116 ADME genes, with equal ratios of duplications/deletions. We identified 930 potential high impact coding variants, of which most are discrete to a single African population cluster. Large frequency differences (i.e., >10%) were seen in common high impact variants between clusters. Several novel variants are predicted to have a significant impact on protein structure, but additional functional work is needed to confirm the outcome of these for PGx use. Most variants of known clinical outcome are rare in Africa compared to European populations, potentially reflecting a clinical PGx research bias to European populations. Discussion: The genetic diversity of ADME genes across sub-Saharan African populations is large. The Southern African population cluster is most distinct from that of far West Africa. PGx strategies based on European variants will be of limited use in African populations. Although established variants are important, PGx must take into account the full range of African variation. This work urges further characterization of variants in African populations including in vitro and in silico studies, and to consider the unique African ADME landscape when developing precision medicine guidelines and tools for African populations.
Journal Article
Developing Clinical Phenotype Data Collection Standards for Research in Africa
2023
Modern biomedical research is characterised by its high-throughput and interdisciplinary nature. Multiproject and consortium-based collaborations requiring meaningful analysis of multiple heterogeneous phenotypic datasets have become the norm; however, such analysis remains a challenge in many regions across the world. An increasing number of data harmonisation efforts are being undertaken by multistudy collaborations through either prospective standardised phenotype data collection or retrospective phenotype harmonisation. In this regard, the Phenotype Harmonisation Working Group (PHWG) of the Human Heredity and Health in Africa (H3Africa) consortium aimed to facilitate phenotype standardisation by both promoting the use of existing data collection standards (hosted by PhenX), adapting existing data collection standards for appropriate use in low- and middle-income regions such as Africa, and developing novel data collection standards where relevant gaps were identified. Ultimately, the PHWG produced 11 data collection kits, consisting of 82 protocols, 38 of which were existing protocols, 17 were adapted, and 27 were novel protocols. The data collection kits will facilitate phenotype standardisation and harmonisation not only in Africa but also across the larger research community. In addition, the PHWG aims to feed back adapted and novel protocols to existing reference platforms such as PhenX.
Journal Article
Constitutively overexpressed 21 kDa protein in Hodgkin lymphoma and aggressive non-Hodgkin lymphomas identified as cytochrome B5b (CYB5B)
by
Zhou, Minglong
,
Banerjee, Diponkar
,
Fadlelmola, Faisal M
in
Amino Acid Sequence
,
Antibodies, Neoplasm - metabolism
,
B-Lymphocytes - metabolism
2010
Background
We have previously reported a novel constitutively overexpressed 21 kDa protein in Hodgkin Lymphoma (HL) and aggressive Non-Hodgkin Lymphomas (NHL). The objective of the current study was to 1) identify this protein using two independent methods, 2) study the expression of the protein and its encoding mRNA in reactive lymph nodes, normal lymphocytes and CD34+ bone marrow precursor cells, 3) analyse patterns of expression of the protein in tissue microarrays assembled from a large number of diagnostic clinical biopsies from patients with HL, and 4) determine the copy number variation and mutation status of the encoding gene in HL cell lines.
Results
Peptide sequencing by LC-MS/MS and protein identification by protein array screening identified a single protein, CYB5B. No mutations were detected in the
CYB5B
gene in HL cell lines. Quantitative PCR showed
CYB5B
gene expression was increased in HL and NHL cell lines. Array CGH using a submegabase resolution tiling array revealed gains in the
CYB5B
locus in HL cell lines KMH2 and L428. Membrane expression was seen in Reed-Sternberg cells in clinical biopsies from patients with HL but not in reactive lymph nodes. Bone marrow CD34+ precursor cells were CYB5B negative on the cell surface. RT-PCR assays of RNA extracted from T and B cell enriched fractions obtained from normal peripheral blood mononuclear cells, reactive lymph nodes, tonsils and normal bone marrow samples showed no evidence of increased mRNA levels of
CYB5B
in comparison to housekeeping gene
GAPDH
.
Conclusions
The 21 kDa protein overexpressed in HL and aggressive NHL is identical to CYB5B.
CYB5B
gene expression is increased in a subset of HL and NHL cell lines tested. This is associated with
CYB5B
gene amplification in HL cell lines KMH2 and L428. CYB5B may be a potential target for antibody-based therapy of HL and aggressive NHL as although cytoplasmic expression is present in reactive lymphocytes, it is not expressed on the cell surface of non-neoplastic lymphocytes or bone marrow precursor cells.
Journal Article
African Genomic Medicine Portal: A Web Portal for Biomedical Applications
2022
Genomics data are currently being produced at unprecedented rates, resulting in increased knowledge discovery and submission to public data repositories. Despite these advances, genomic information on African-ancestry populations remains significantly low compared with European- and Asian-ancestry populations. This information is typically segmented across several different biomedical data repositories, which often lack sufficient fine-grained structure and annotation to account for the diversity of African populations, leading to many challenges related to the retrieval, representation and findability of such information. To overcome these challenges, we developed the African Genomic Medicine Portal (AGMP), a database that contains metadata on genomic medicine studies conducted on African-ancestry populations. The metadata is curated from two public databases related to genomic medicine, PharmGKB and DisGeNET. The metadata retrieved from these source databases were limited to genomic variants that were associated with disease aetiology or treatment in the context of African-ancestry populations. Over 2000 variants relevant to populations of African ancestry were retrieved. Subsequently, domain experts curated and annotated additional information associated with the studies that reported the variants, including geographical origin, ethnolinguistic group, level of association significance and other relevant study information, such as study design and sample size, where available. The AGMP functions as a dedicated resource through which to access African-specific information on genomics as applied to health research, through querying variants, genes, diseases and drugs. The portal and its corresponding technical documentation, implementation code and content are publicly available.
Journal Article