Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
by
Pal, Christopher
, Zhi Hao Luo
, Gosselin, Anthony
, Jolicoeur-Martineau, Alexia
, Ge Ya Luo
in
Accuracy
/ Boxes
/ Conditioning
/ Controllability
/ Frames (data processing)
/ Object motion
2024
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
by
Pal, Christopher
, Zhi Hao Luo
, Gosselin, Anthony
, Jolicoeur-Martineau, Alexia
, Ge Ya Luo
in
Accuracy
/ Boxes
/ Conditioning
/ Controllability
/ Frames (data processing)
/ Object motion
2024
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
Paper
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
2024
Request Book From Autostore
and Choose the Collection Method
Overview
Controllable video generation has attracted significant attention, largely due to advances in video diffusion models. In domains such as autonomous driving, it is essential to develop highly accurate predictions for object motions. This paper tackles a crucial challenge of how to exert precise control over object motion for realistic video synthesis. To accomplish this, we 1) control object movements using bounding boxes and extend this control to the renderings of 2D or 3D boxes in pixel space, 2) employ a distinct, specialized model to forecast the trajectories of object bounding boxes based on their previous and, if desired, future positions, and 3) adapt and enhance a separate video diffusion network to create video content based on these high quality trajectory forecasts. Our method, Ctrl-V, leverages modified and fine-tuned Stable Video Diffusion (SVD) models to solve both trajectory and video generation. Extensive experiments conducted on the KITTI, Virtual-KITTI 2, BDD100k, and nuScenes datasets validate the effectiveness of our approach in producing realistic and controllable video generation.
Publisher
Cornell University Library, arXiv.org
Subject
MBRLCatalogueRelatedBooks
Related Items
Related Items
This website uses cookies to ensure you get the best experience on our website.