Catalogue Search | MBRL

Two-View Mammogram Synthesis from Single-View Data Using Generative Adversarial Networks

by Asumi Yamazaki , Takayuki Ishida in Biology (General) , Breast cancer , Chemistry

2022

While two-view mammography taking both mediolateral-oblique (MLO) and cranio-caudual (CC) views is the current standard method of examination in breast cancer screening, single-view mammography is still being performed in some countries on women of specific ages. The rate of cancer detection is lower with single-view mammography than for two-view mammography, due to the lack of available image information. The goal of this work is to improve single-view mammography’s ability to detect breast cancer by providing two-view mammograms from single projections. The synthesis of novel-view images from single-view data has recently been achieved using generative adversarial networks (GANs). Here, we apply complete representation GAN (CR-GAN), a novel-view image synthesis model, aiming to produce CC-view mammograms from MLO views. Additionally, we incorporate two adaptations—the progressive growing (PG) technique and feature matching loss—into CR-GAN. Our results show that use of the PG technique reduces the training time, while the synthesized image quality is improved when using feature matching loss, compared with the method using only CR-GAN. Using the proposed method with the two adaptations, CC views similar to real views are successfully synthesized for some cases, but not all cases; in particular, image synthesis is rarely successful when calcifications are present. Even though the image resolution and quality are still far from clinically acceptable levels, our findings establish a foundation for further improvements in clinical applications. As the first report applying novel-view synthesis in medical imaging, this work contributes by offering a methodology for two-view mammogram synthesis.

Journal Article

Share this book

Add to My Shelf

Cyclic 2.5D Perceptual Loss for Cross‐Modal 3D Medical Image Synthesis: T1w MRI to Tau PET

by Jang, Ikbeom , Moon, Junho , Kim, Symac in 3D image translation , Aged , Alzheimer Disease - diagnostic imaging

2026

Positron emission tomography (PET) provides an in vivo molecular marker for various diseases, including Alzheimer's disease and related dementias (ADRD). PET has become increasingly integrated into diagnostic decision‐making, disease staging, and clinical trial enrichment. However, its widespread use remains constrained by high costs, government regulations, and the invasiveness of radiotracer injection. Modern diagnostic frameworks emphasize the importance of multimodal biomarker assessment, such as the “amyloid/tau/neurodegeneration” (A/T/N) framework for Alzheimer's disease; however, they are constrained by these barriers. Medical image synthesis or translation offers a potential solution by enabling the reconstruction of unavailable modalities. The clinical utility of PET depends on accurately capturing regional uptake patterns rather than exact voxel‐wise intensities, motivating the use of perceptual loss functions to assess higher‐level semantic features in generative models. While 2D, 3D, and 2.5D perceptual losses are utilized in 3D synthesis, each encounters challenges, including limited volumetric context, the scarcity of pretrained 3D models, and difficulty balancing optimization across anatomical planes. In this work, we address cross‐modal synthesis of tau PET from structural magnetic resonance imaging (MRI), generating 3D pseudo‐[18F]flortaucipir standardized uptake value ratio (SUVR) maps from 3D T1‐weighted MR images. We propose a cyclic 2.5D perceptual loss that cyclically optimizes the axial, coronal, and sagittal planes over training phases, thereby enhancing volumetric consistency. Furthermore, we standardize PET SUVRs by scanner manufacturer, reducing inter‐manufacturer variability and better preserving high‐uptake regions. We evaluate the proposed approach on cohorts spanning the ADRD spectrum using data from the Alzheimer's Disease Neuroimaging Initiative and the Standardized Centralized Alzheimer's Disease and Related Dementias Neuroimaging cohort. Our approach is broadly applicable across various generative frameworks and achieves high quantitative and qualitative performance on diverse architectures, including U‐Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix. Notably, it achieves better agreement between synthesized SUVRs and measured PET scans in key brain regions relevant to Alzheimer‐type tau pathology. The code is publicly available at https://github.com/labhai/Cyclic‐2.5D‐Perceptual‐Loss.

Journal Article

Share this book

Add to My Shelf

Text-Guided Customizable Image Synthesis and Manipulation

by Zhiqiang Zhang , Jinjia Zhou , Wei Weng in artificially controllable image synthesis , artificially controllable image synthesis; image manipulation; generative adversarial networks , Authenticity

2022

Due to the high flexibility and conformity to people’s usage habits, text description has been widely used in image synthesis research recently and has achieved many encouraging results. However, the text can only determine the basic content of the generated image and cannot determine the specific shape of the synthesized object, which leads to poor practicability. More importantly, the current text-to-image synthesis research cannot use new text descriptions to further modify the synthesis result. To solve these problems, this paper proposes a text-guided customizable image synthesis and manipulation method. The proposed method synthesizes the corresponding image based on the text and contour information at first. It then modifies the synthesized content based on the new text to obtain a satisfactory result. The text and contour information in the proposed method determine the specific content and object shape of the desired composite image, respectively. Aside from that, the input text, contour, and subsequent new text for content modification can be manually input, which significantly improves the artificial controllability in the image synthesis process, making the entire method superior to other methods in flexibility and practicability. Experimental results on the Caltech-UCSD Birds-200-2011 (CUB) and Microsoft Common Objects in Context (MS COCO) datasets demonstrate our proposed method’s feasibility and versatility.

Journal Article

Share this book

Add to My Shelf

Proposal to Improve the Image Quality of Short-Acquisition Time-Dedicated Breast Positron Emission Tomography Using the Pix2pix Generative Adversarial Network

by Mori, Mio , Imokawa, Tomoki , Tateishi, Ukihide in artificial intelligence , Breast cancer , Breasts

2022

This study aimed to evaluate the ability of the pix2pix generative adversarial network (GAN) to improve the image quality of low-count dedicated breast positron emission tomography (dbPET). Pairs of full- and low-count dbPET images were collected from 49 breasts. An image synthesis model was constructed using pix2pix GAN for each acquisition time with training (3776 pairs from 16 breasts) and validation data (1652 pairs from 7 breasts). Test data included dbPET images synthesized by our model from 26 breasts with short acquisition times. Two breast radiologists visually compared the overall image quality of the original and synthesized images derived from the short-acquisition time data (scores of 1–5). Further quantitative evaluation was performed using a peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). In the visual evaluation, both readers revealed an average score of >3 for all images. The quantitative evaluation revealed significantly higher SSIM (p < 0.01) and PSNR (p < 0.01) for 26 s synthetic images and higher PSNR for 52 s images (p < 0.01) than for the original images. Our model improved the quality of low-count time dbPET synthetic images, with a more significant effect on images with lower counts.

Journal Article

Share this book

Add to My Shelf

SynthStrip: skull-stripping for any brain image

by Fischl, Bruce , Hoffmann, Malte , Mora, Jocelyn S. in Accuracy , Adult , Algorithms

2022

•SynthStrip is a universal, learning-based skull-stripping utility.•Cutting-edge performance across imaging modality, resolution, and subject population.•Use of synthetic training data alleviates the dependency on acquisition specifics.•Released both as an open-source standalone tool and as a FreeSurfer command line utility.•Publicly available collection of evaluation images and ground-truth brain masks. The removal of non-brain signal from magnetic resonance imaging (MRI) data, known as skull-stripping, is an integral component of many neuroimage analysis streams. Despite their abundance, popular classical skull-stripping methods are usually tailored to images with specific acquisition properties, namely near-isotropic resolution and T1-weighted (T1w) MRI contrast, which are prevalent in research settings. As a result, existing tools tend to adapt poorly to other image types, such as stacks of thick slices acquired with fast spin-echo (FSE) MRI that are common in the clinic. While learning-based approaches for brain extraction have gained traction in recent years, these methods face a similar burden, as they are only effective for image types seen during the training procedure. To achieve robust skull-stripping across a landscape of imaging protocols, we introduce SynthStrip, a rapid, learning-based brain-extraction tool. By leveraging anatomical segmentations to generate an entirely synthetic training dataset with anatomies, intensity distributions, and artifacts that far exceed the realistic range of medical images, SynthStrip learns to successfully generalize to a variety of real acquired brain images, removing the need for training data with target contrasts. We demonstrate the efficacy of SynthStrip for a diverse set of image acquisitions and resolutions across subject populations, ranging from newborn to adult. We show substantial improvements in accuracy over popular skull-stripping baselines – all with a single trained model. Our method and labeled evaluation data are available at https://w3id.org/synthstrip.

Journal Article

Share this book

Add to My Shelf

OASIS: Only Adversarial Supervision for Semantic Image Synthesis

by Schönfeld, Edgar , Zhang, Dan , Gall, Juergen in Datasets , Discriminators , Generative adversarial networks

2022

Despite their recent successes, generative adversarial networks (GANs) for semantic image synthesis still suffer from poor image quality when trained with only adversarial supervision. Previously, additionally employing the VGG-based perceptual loss has helped to overcome this issue, significantly improving the synthesis quality, but at the same time limited the progress of GAN models for semantic image synthesis. In this work, we propose a novel, simplified GAN model, which needs only adversarial supervision to achieve high quality results. We re-design the discriminator as a semantic segmentation network, directly using the given semantic label maps as the ground truth for training. By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity and with a better alignment to their input label maps, making the use of the perceptual loss superfluous. Furthermore, we enable high-quality multi-modal image synthesis through global and local sampling of a 3D noise tensor injected into the generator, which allows complete or partial image editing. We show that images synthesized by our model are more diverse and follow the color and texture distributions of real images more closely. We achieve a strong improvement in image synthesis quality over prior state-of-the-art models across the commonly used ADE20K, Cityscapes, and COCO-Stuff datasets using only adversarial supervision. In addition, we investigate semantic image synthesis under severe class imbalance and sparse annotations, which are common aspects in practical applications but were overlooked in prior works. To this end, we evaluate our model on LVIS, a dataset originally introduced for long-tailed object recognition. We thereby demonstrate high performance of our model in the sparse and unbalanced data regimes, achieved by means of the proposed 3D noise and the ability of our discriminator to balance class contributions directly in the loss function. Our code and pretrained models are available at https://github.com/boschresearch/OASIS.

Journal Article

Share this book

Add to My Shelf

Unsupervised MR harmonization by learning disentangled representations using information bottleneck theory

by Mowry, Ellen M. , Resnick, Susan M. , Dewey, Blake E. in Algorithms , Anatomy , Disentangle

2021

•Unsupervised MR harmonization without traveling subjects.•Unified latent space for MR contrast synthesis.•A novel framework for disentangling contrast and anatomy in MR images.•Downstream segmentation consistency shows significant improvements after harmonization. In magnetic resonance (MR) imaging, a lack of standardization in acquisition often causes pulse sequence-based contrast variations in MR images from site to site, which impedes consistent measurements in automatic analyses. In this paper, we propose an unsupervised MR image harmonization approach, CALAMITI (Contrast Anatomy Learning and Analysis for MR Intensity Translation and Integration), which aims to alleviate contrast variations in multi-site MR imaging. Designed using information bottleneck theory, CALAMITI learns a globally disentangled latent space containing both anatomical and contrast information, which permits harmonization. In contrast to supervised harmonization methods, our approach does not need a sample population to be imaged across sites. Unlike traditional unsupervised harmonization approaches which often suffer from geometry shifts, CALAMITI better preserves anatomy by design. The proposed method is also able to adapt to a new testing site with a straightforward fine-tuning process. Experiments on MR images acquired from ten sites show that CALAMITI achieves superior performance compared with other harmonization approaches.

Journal Article

Share this book

Add to My Shelf

Image synthesis: a review of methods, datasets, evaluation metrics, and future outlook

in Artificial intelligence , Classification , Community research

2023

Image synthesis is a process of converting the input text, sketch, or other sources, i.e., another image or mask, into an image. It is an important problem in the computer vision field, where it has attracted the research community to attempt to solve this challenge at a high level to generate photorealistic images. Different techniques and strategies have been employed to achieve this purpose. Thus, the aim of this paper is to provide a comprehensive review of various image synthesis models covering several aspects. First, the image synthesis concept is introduced. We then review different image synthesis methods divided into three categories: image generation from text, sketch, and other inputs, respectively. Each sub-category is introduced under the proper category based upon the general framework to provide a broad vision of all existing image synthesis methods. Next, brief details of the benchmarked datasets used in image synthesis are discussed along with specifying the image synthesis models that leverage them. Regarding the evaluation, we summarize the metrics used to evaluate the image synthesis models. Moreover, a detailed analysis based on the evaluation metrics of the results of the introduced image synthesis is provided. Finally, we discuss some existing challenges and suggest possible future research directions.

Journal Article

Share this book

Add to My Shelf

Uncertainty modelling in deep learning for safer neuroimage enhancement: Demonstration in diffusion MRI

by Kaden, Enrico , Grussu, Francesco , Worrall, Daniel E. in Anisotropy , Bayesian analysis , Brain - diagnostic imaging

2021

•Proposes methods for modelling different types of uncertainty that arise in deep learning (DL) applications for image enhancement problems.•Demonstrates in dMRI super-resolution tasks that modelling uncertainty enhances the safety of DL-based enhancement system by bringing two categories of practical benefits:(1) “performance improvement”: e.g., the generalisation to out-of-distribution data, robustness to noise and outliers (Section 4.3)(2) “reliability assessment of prediction”: e.g., certification of performance based on uncertainty-thresholding (Section 4.4.1); detection of unfamiliar structures and understanding the sources of uncertainty (Section 4.4.2).•Provide a comprehensive set of experiments in a diverse set of datasets, which vary in demographics, scanner types, acquisition protocols or pathology.•The methods are in theory applicable to many other imaging modalities and data enhancement applications.•Codes will be available on Github. Deep learning (DL) has shown great potential in medical image enhancement problems, such as super-resolution or image synthesis. However, to date, most existing approaches are based on deterministic models, neglecting the presence of different sources of uncertainty in such problems. Here we introduce methods to characterise different components of uncertainty, and demonstrate the ideas using diffusion MRI super-resolution. Specifically, we propose to account for intrinsic uncertainty through a heteroscedastic noise model and for parameter uncertainty through approximate Bayesian inference, and integrate the two to quantify predictive uncertainty over the output image. Moreover, we introduce a method to propagate the predictive uncertainty on a multi-channelled image to derived scalar parameters, and separately quantify the effects of intrinsic and parameter uncertainty therein. The methods are evaluated for super-resolution of two different signal representations of diffusion MR images—Diffusion Tensor images and Mean Apparent Propagator MRI—and their derived quantities such as mean diffusivity and fractional anisotropy, on multiple datasets of both healthy and pathological human brains. Results highlight three key potential benefits of modelling uncertainty for improving the safety of DL-based image enhancement systems. Firstly, modelling uncertainty improves the predictive performance even when test data departs from training data (“out-of-distribution” datasets). Secondly, the predictive uncertainty highly correlates with reconstruction errors, and is therefore capable of detecting predictive “failures”. Results on both healthy subjects and patients with brain glioma or multiple sclerosis demonstrate that such an uncertainty measure enables subject-specific and voxel-wise risk assessment of the super-resolved images that can be accounted for in subsequent analysis. Thirdly, we show that the method for decomposing predictive uncertainty into its independent sources provides high-level “explanations” for the model performance by separately quantifying how much uncertainty arises from the inherent difficulty of the task or the limited training examples. The introduced concepts of uncertainty modelling extend naturally to many other imaging modalities and data enhancement applications.

Journal Article

Share this book

Add to My Shelf

Similarity and quality metrics for MR image-to-image translation

by Baltruschat, Ivo M. , Truong, Tuan , Dohmen, Melanie in 639/705/117 , 692/700/1421/1628 , Algorithms

2025

Image-to-image translation can create large impact in medical imaging, as images can be synthetically transformed to other modalities, sequence types, higher resolutions or lower noise levels. To ensure patient safety, these methods should be validated by human readers, which requires a considerable amount of time and costs. Quantitative metrics can effectively complement such studies and provide reproducible and objective assessment of synthetic images. If a reference is available, the similarity of MR images is frequently evaluated by SSIM and PSNR metrics, even though these metrics are not or too sensitive regarding specific distortions. When reference images to compare with are not available, non-reference quality metrics can reliably detect specific distortions, such as blurriness. To provide an overview on distortion sensitivity, we quantitatively analyze 11 similarity (reference) and 12 quality (non-reference) metrics for assessing synthetic images. We additionally include a metric on a downstream segmentation task. We investigate the sensitivity regarding 11 kinds of distortions and typical MR artifacts, and analyze the influence of different normalization methods on each metric and distortion. Finally, we derive recommendations for effective usage of the analyzed similarity and quality metrics for evaluation of image-to-image translation models.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter