Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Instant3D: Instant Text-to-3D Generation
by
Yan, Shuicheng
, Li, Ming
, Keppo, Jussi
, Lin, Min
, Xu, Xiangyu
, Zhou, Pan
, Liu, Jia-Wei
in
Adaptive algorithms
/ Computer vision
/ Convergence
/ Efficiency
/ Qualitative research
/ Quantitative analysis
/ Training
2024
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Instant3D: Instant Text-to-3D Generation
by
Yan, Shuicheng
, Li, Ming
, Keppo, Jussi
, Lin, Min
, Xu, Xiangyu
, Zhou, Pan
, Liu, Jia-Wei
in
Adaptive algorithms
/ Computer vision
/ Convergence
/ Efficiency
/ Qualitative research
/ Quantitative analysis
/ Training
2024
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Journal Article
Instant3D: Instant Text-to-3D Generation
2024
Request Book From Autostore
and Choose the Collection Method
Overview
Text-to-3D generation has attracted much attention from the computer vision community. Existing methods mainly optimize a neural field from scratch for each text prompt, relying on heavy and repetitive training cost which impedes their practical deployment. In this paper, we propose a novel framework for fast text-to-3D generation, dubbed Instant3D. Once trained, Instant3D is able to create a 3D object for an unseen text prompt in less than one second with a single run of a feedforward network. We achieve this remarkable speed by devising a new network that directly constructs a 3D triplane from a text prompt. The core innovation of our Instant3D lies in our exploration of strategies to effectively inject text conditions into the network. In particular, we propose to combine three key mechanisms: cross-attention, style injection, and token-to-plane transformation, which collectively ensure precise alignment of the output with the input text. Furthermore, we propose a simple yet effective activation function, the scaled-sigmoid, to replace the original sigmoid function, which speeds up the training convergence by more than ten times. Finally, to address the Janus (multi-head) problem in 3D generation, we propose an adaptive Perp-Neg algorithm that can dynamically adjust its concept negation scales according to the severity of the Janus problem during training, effectively reducing the multi-head effect. Extensive experiments on a wide variety of benchmark datasets demonstrate that the proposed algorithm performs favorably against the state-of-the-art methods both qualitatively and quantitatively, while achieving significantly better efficiency. The code, data, and models are available at https://ming1993li.github.io/Instant3DProj/.
Publisher
Springer Nature B.V
Subject
This website uses cookies to ensure you get the best experience on our website.