Asset Details

MbrlCatalogueTitleDetail

Do you wish to reserve the book?

Generating Multimodal Images with GAN: Integrating Text, Image, and Style

by Qi, Zhen , Ao Xiang , Tan, Chaoyi , Shih, Kowei , Zhang, Wenqing , Li, Xinshi

in Computer vision / Generative adversarial networks / Image processing / Image quality

2025

Yes Please

Hey, we have placed the reservation for you!

By the way, why not check out events that you can attend while you pick your title.

Oops! Something went wrong.

Looks like we were not able to place the reservation. Kindly try again later.

Paper

Generating Multimodal Images with GAN: Integrating Text, Image, and Style

Qi, Zhen,

Ao Xiang,

Tan, Chaoyi,

Shih, Kowei,

Zhang, Wenqing,

Li, Xinshi

2025

Overview

In the field of computer vision, multimodal image generation has become a research hotspot, especially the task of integrating text, image, and style. In this study, we propose a multimodal image generation method based on Generative Adversarial Networks (GAN), capable of effectively combining text descriptions, reference images, and style information to generate images that meet multimodal requirements. This method involves the design of a text encoder, an image feature extractor, and a style integration module, ensuring that the generated images maintain high quality in terms of visual content and style consistency. We also introduce multiple loss functions, including adversarial loss, text-image consistency loss, and style matching loss, to optimize the generation process. Experimental results show that our method produces images with high clarity and consistency across multiple public datasets, demonstrating significant performance improvements compared to existing methods. The outcomes of this study provide new insights into multimodal image generation and present broad application prospects.

Share this book

Add to My Shelf

Publisher

Cornell University Library, arXiv.org

Subject

Computer vision

/ Generative adversarial networks

/ Image processing

/ Image quality