Catalogue Search | MBRL

IUCNN – Deep learning approaches to approximate species' extinction risk

by Zizka, Alexander , Silvestro, Daniele , Andermann, Tobias in Artificial neural networks , assessments , automated assessment

2022

Aim The Red List (RL) from the International Union for the Conservation of Nature is the most comprehensive global quantification of extinction risk, and widely used in applied conservation as well as in biogeographic and ecological research. Yet, due to the time‐consuming assessment process, the RL is biased taxonomically and geographically, which limits its application on large scales, in particular for underdocumented areas such as the tropics, or understudied taxa, such as most plants and invertebrates. Here, we present IUCNN, an R‐package implementing deep learning models to predict species RL status from publicly available geographic occurrence records (and other data if available). Innovation We implement a user‐friendly workflow to train and validate neural network models, and use them to predict species RL status. IUCNN contains specific functions for extinction risk prediction in the RL framework, including a regression‐based approach to account for the ordinal nature of RL categories, a Bayesian approach for improved uncertainty quantification and a convolutional neural network to predict species RL status based on their raw geographic occurrences. Most analyses run with few lines of code, not requiring users to have prior experience with neural network models. We demonstrate the use of IUCNN on an empirical dataset of ~14,000 orchid species, for which IUCNN models can predict extinction risk within minutes, while outperforming comparable methods based on species occurrence information. Main conclusions IUCNN harnesses innovative methodology to estimate the RL status of large numbers of species. By providing estimates of the number and identity of threatened species in custom geographic or taxonomic datasets, IUCNN enables large‐scale automated assessments of the extinction risk of species so far not well represented on the official RL.

Journal Article

Share this book

Add to My Shelf

Rthoptera: Standardised insect bioacoustics in R

by Pijanowski, Bryan , Buzzetti, Filippo M. , Rivas, Francisco in acoustics , automated measurements , Bioacoustics

2025

Almost 100 years after the dawn of insect bioacoustics, freely available and robust analysis tools and protocols are still needed. We introduce Rthoptera, an open‐source R package designed for robust analysis of insect acoustic signals. It delivers accurate temporal and spectral measurements, along with intuitive visualisations, through a streamlined workflow. Interactive applications ensure accessibility for researchers across all levels of technical expertise. Four new acoustic metrics for discrete units are introduced: Spectral Excursion, Pattern Complexity Index, Temporal Excursions and Dynamic Excursion. A Broadband Activity Index with potential use in pest monitoring is also showcased. Accompanied by an appropriate recording protocol, our workflow can become part of a standard analysis protocol for insect bioacoustics. Resumen Casi 100 años después del inicio de la bioacústica de insectos, todavía se necesitan herramientas y protocolos de análisis robustos y de libre acceso. Presentamos Rthoptera, un paquete de R de código abierto diseñado para el análisis robusto de señales acústicas de insectos. Ofrece mediciones temporales y espectrales precisas, junto con visualizaciones intuitivas, mediante un flujo de trabajo optimizado. Las aplicaciones interactivas garantizan su accesibilidad para investigadores de todos los niveles de experiencia técnica. Se introducen cuatro nuevas métricas acústicas para unidades discretas: Excursión Espectral, Índice de Complejidad de Patrón, Excursión Temporal y Excursión Dinámica. También se presenta un Índice de Actividad de Banda Ancha con potencial uso en monitoreo de plagas. Acompañado de un protocolo de grabación adecuado, nuestro flujo de trabajo puede convertirse en parte de un protocolo estándar de análisis para bioacústica de insectos.

Journal Article

Share this book

Add to My Shelf

MoBPS - Modular Breeding Program Simulator

by Schlather, Martin , Pook, Torsten , Simianer, Henner in Animal sciences , Breeding of animals , Cell division

2020

The R-package MoBPS provides a computationally efficient and flexible framework to simulate complex breeding programs and compare their economic and genetic impact. Simulations are performed on the base of individuals. MoBPS utilizes a highly efficient implementation with bit-wise data storage and matrix multiplications from the associated R-package miraculix allowing to handle large scale populations. Individual haplotypes are not stored but instead automatically derived based on points of recombination and mutations. The modular structure of MoBPS allows to combine rather coarse simulations, as needed to generate founder populations, with a very detailed modeling of todays’ complex breeding programs, making use of all available biotechnologies. MoBPS provides pre-implemented functions for common breeding practices such as optimum genetic contributions and single-step GBLUP but also allows the user to replace certain steps with personalized and/or self-written solutions.

Journal Article

Share this book

Add to My Shelf

Filtering heart rates using data densities: The boxfilter R package

by Vetter, Sebastian G. , Arnold, Walter , Bieber, Claudia in Animals , Blood pressure , Body temperature

2024

Over the past decades, there has been a growing interest in long‐term heart rate records, especially from free‐living animals. Largely, this increase is because most of the metabolic activity of tissues is based on oxygen delivery by the heart. Therefore, heart rate has served as a proxy for energy expenditure in animals. However, heart rates or other physiological variables recorded in humans and animals using loggers often contain noise. False measurements are sometimes eliminated by hand or by filters that reject variables based on the shape or frequency of the signal. Occasionally, outliers are rejected because they occur a long distance from genuine data. We introduce an R package, boxfilter, which enables users to eliminate noise based on counting the number of close neighbours inside a gliding window. Depending on the cut‐off value chosen, a focal point with a low proportion of neighbours will be rejected as noise. All three parameters, namely window width and height, as well as the cut‐off value, can be computed automatically. The choice of the clip‐off value beyond which data points are discarded is crucial. The package boxfilter cannot, of course, solve problems caused by completely erroneous measurements. Like the human eye, this filter prefers points that are part of a pattern, such as a dense band, and rejects isolated values. The boxfilter may also be applied to other measures than heart rate that do not change instantaneously, such as body temperature, blood pressure or sleep parameters.

Journal Article

Share this book

Add to My Shelf

Analysing and mapping species range dynamics using occupancy models

by Lahoz-Monfort, José J. , Guillera-Arroita, Gurutzeta , Kéry, Marc in Animal and plant ecology , Animal, plant and microbial ecology , Annual variations

2013

Aim: Our aims are: (1) to highlight the power of dynamic occupancy models for analysing species range dynamics while accounting for imperfect detection; (2) to emphasize the flexibility to model effects of environmental covariates in the dynamics parameters (extinction and colonization probability); and (3) to illustrate the development of predictive maps of range dynamics by projecting estimated probabilities of occupancy, local extinction and colonization. Location: Switzerland. Methods: We used data from the Swiss breeding bird survey to model the Swiss range dynamics of the European crossbill (Loxia curvirostra) from 2000 to 2007. Within-season replicate surveys at each 1 km 2 sample unit allowed us to fit dynamic occupancy models that account for imperfect detection, and thus estimate the following processes underlying the observed range dynamics: local extinction, colonization and detection. For comparison, we also fitted a model variant where detection was assumed to be perfect. Results: All model parameters were affected by elevation, forest cover and elevation-by-forest cover interactions and exhibited substantial annual variation. Detection probability varied seasonally and among years, highlighting the need for its estimation. Projecting parameter estimates in environmental or geographical space is a powerful means of understanding what the model is telling about covariate relationships. Geographical maps were substantially different between the model where detection was estimated and that where it was not, emphasizing the importance of accounting for imperfect detection in studies of range dynamics, even for high-quality data. Main conclusions: The study of species range dynamics is among the most exciting avenues for species distribution modelling. Dynamic occupancy models offer a robust framework for doing so, by accounting for imperfect detection and directly modelling the effects of covariates on the parameters that govern distributional change. Mapping parameter estimates modelled by spatially indexed covariates is an under-used way to gain insights into dynamic species distributions.

Journal Article

Share this book

Add to My Shelf

hydrographr: An R package for scalable hydrographic data processing

by Domisch, Sami , Torres‐Cambas, Yusdiel , Grossart, Hans‐Peter in Aquatic ecosystems , Basins , Biodiversity

2023

Freshwater ecosystems are considered biodiversity hotspots, but assessing the spatial distribution of species remains challenging. One major obstacle lies in the complex geospatial processing of large amounts of data, such as stream network, sub‐catchment and basin data, that are necessary for addressing the longitudinal connectivity among water bodies. Workflows thus need to be scalable, especially when working across large spatial extents and at high spatial resolution. This in turn requires advanced command‐line GIS skills and programming language integration, which often poses a challenge for freshwater researchers. To address this challenge, we developed the package hydrographr that provides scalable hydrographic data processing in R. The package contains functions for downloading data of the high‐resolution Hydrography90m dataset, processing, reading and extracting information, as well as assessing network distances and connectivity. While the functions are, by default, tailored toward the Hydrography90m data, they can also be generalised toward other data and purposes, such as efficient cropping and merging of raster and vector data, point‐raster extraction, raster reclassification and data aggregation. The package depends on the open‐source software GDAL/OGR, GRASS‐GIS and the AWK programming language in the Linux environment, allowing a seamless language integration. Since the data is processed outside R, hydrographr allows creating scalable geo‐processing workflows. We illustrate the hydrographr functions using two workflows that focus on (i) a freshwater species distribution modelling approach, and (ii) assessing stream connectivity given the fragmentation by dams. We also provide a detailed guide for the initial installation of the required software. Windows users need to first enable the Windows Subsystem for Linux (WSL) feature, and can then follow the same software installation as Linux users. hydrographr is maintained on GitHub at https://github.com/glowabio/hydrographr. hydrographr provides a set of key functions for processing freshwater geospatial data. We expect that the package will support the freshwater‐related research communities given the easy‐to‐use wrapper functions that allow capitalizing on powerful open‐source command‐line software, which may otherwise require a steep learning curve. Users can thus perform large‐scale freshwater‐specific longitudinal connectivity and network analyses across large geographic extents while staying within the R environment.

Journal Article

Share this book

Add to My Shelf

Extent, impact, and mitigation of batch effects in tumor biomarker studies using tissue microarrays

by Mucci, Lorelei A , Penney, Kathryn L , Stopsack, Konrad H in Automation , batch effect , batchtma

2021

Tissue microarrays (TMAs) have been used in thousands of cancer biomarker studies. To what extent batch effects, measurement error in biomarker levels between slides, affects TMA-based studies has not been assessed systematically. We evaluated 20 protein biomarkers on 14 TMAs with prospectively collected tumor tissue from 1448 primary prostate cancers. In half of the biomarkers, more than 10% of biomarker variance was attributable to between-TMA differences (range, 1–48%). We implemented different methods to mitigate batch effects (R package batchtma ), tested in plasmode simulation. Biomarker levels were more similar between mitigation approaches compared to uncorrected values. For some biomarkers, associations with clinical features changed substantially after addressing batch effects. Batch effects and resulting bias are not an error of an individual study but an inherent feature of TMA-based protein biomarker studies. They always need to be considered during study design and addressed analytically in studies using more than one TMA. To understand cancer, researchers need to know which molecules tumor cells use. These so-called ‘biomarkers’ tag cancer cells as being different from healthy cells, and can be used to predict how aggressive a tumor may be, or how well it might respond to treatment. A popular technique for assessing biomarkers across multiple tumors is to use tissue microarrays. This involves taking samples from different tumors and embedding them in a block of wax, which is then cut into micro-thin slices and stained with reagents that can detect specific biomarkers, such as proteins. Each block contains hundreds of samples, which all experience the same conditions. So, any patterns detected in the staining are likely to represent real variations in the biomarkers present. Many cancer studies, however, often compare samples from multiple tissue microarrays, which may increase the risk of technical artifacts: for example, staining may look stronger in one batch of tissue samples than another, even though the amount of biomarker present in these different arrays is roughly the same. These ‘batch effects’ could potentially bias the results of the experiment and lead to the identification of misleading patterns. To evaluate how batch effects impact tissue microarray studies, Stopsack et al. examined 14 wax blocks which contained tumor samples from 1,448 men with prostate cancer. This revealed that for some biomarkers, but not others, there were noticeable differences between tissue microarrays that were clearly the result of batch effects. Stopsack et al. then tested six different ways of fixing these discrepancies using statistical methods. All six approaches were successful, even if the arrays included tumors with different characteristics, such as tumors that had been diagnosed more or less recently. This work highlights the importance of considering batch effects when using tissue microarrays to study cancer. Stopsack et al. have used their statistical approaches to develop freely available software which can reduce the biases that sometimes arise from these technical artifacts. This could help researchers avoid misleading patterns in their data and make it easier to detect real variations in the biomarkers present between tumor samples.

Journal Article

Share this book

Add to My Shelf

Cognitively Diagnostic Analysis Using the G-DINA Model in R

by Ma, Wenchao , Robitzsch, Alexander , Sorrel, Miguel A in Parameter estimation , Subject specialists

2021

Cognitive diagnosis models (CDMs) have increasingly been applied in education and other fields. This article provides an overview of a widely used CDM, namely, the G-DINA model, and demonstrates a hands-on example of using multiple R packages for a series of CDM analyses. This overview involves a step-by-step illustration and explanation of performing Q-matrix evaluation, CDM calibration, model fit evaluation, item diagnosticity investigation, classification reliability examination, and the result presentation and visualization. Some limitations of conducting CDM analysis in R are also discussed.

Journal Article

Share this book

Add to My Shelf

Interactive medical and safety monitoring in clinical trials with clinDataReview: a validated and open-source reporting tool

by Meyvisch, Paul , De Roeck, Arne , Pasetto, Michela in Biotechnology industry , clinDataReview R package , clinical data

2024

Continuous medical and safety monitoring of subject data during a clinical trial is a critical part of evaluating the safety of trial participants and as such is governed by protocol procedures and regulatory guidelines to meet the trial's intended objectives. We present an open-source validated graphical tool (clinDataReview R package) which provides access to the trial data with drill-down to individual patient profiles. The tool incorporates functionalities that facilitate detection of error and data inconsistencies requiring follow-up. It supports regular medical monitoring and oversight as well as safety monitoring committees with interactive tables and listings alongside graphical visualizations of the primary safety data in reports. An implementation example is given where the tool is used to deliver validated outputs following FDA/EMA guidelines. As such, this tool enables a more efficient, interactive, and reproducible review of safety data collected during an ongoing clinical trial.

Journal Article

Share this book

Add to My Shelf

CircSeqAlignTk: An R package for end-to-end analysis of RNA-seq data for circular genomes version 2; peer review: 1 approved, 2 approved with reservations

by Sun, Jianqiang , Fu, Xi , Cao, Wei in alignment , circular genome sequence , Circular RNA

2022

RNA sequencing (RNA-seq) technology has become one of the standard tools for studying biological mechanisms at the transcriptome level. Advances in RNA-seq technology have led to the development of numerous publicly available tools for RNA-seq data analysis. Most of these tools target linear genome sequences despite the necessity of studying organisms with circular genome sequences. For example, studying the infection mechanisms of viroids which comprise 246-401 nucleotides circular RNAs and target plants may prevent tremendous economic and agricultural damage. Unfortunately, using the available tools to construct workflows for the analysis of circular genome sequences is difficult, especially for non-bioinformaticians. To overcome this limitation, we present CircSeqAlignTk, an easy-to-use and richly documented R package. CircSeqAlignTk offers both command line and graphical user interfaces for end-to-end RNA-seq data analysis, spanning alignment to the visualisation of circular genome sequences, via a series of functions. Moreover, it includes a feature to generate synthetic sequencing data that mirrors real RNA-seq data from biological experiments. CircSeqAlignTk not only provides an easy-to-use analysis interface for novice users but also allows developers to evaluate the performance of alignment tools and new workflows.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter