Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
by
Bailly, Kevin
, Reboul, Rémi Ouazan
, Dapogny, Arnaud
, Yvinec, Edouard
in
Artificial neural networks
/ Budgets
/ Channels
/ Computer vision
/ Hardware
/ Model accuracy
/ Network latency
/ Neural networks
/ Pruning
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
by
Bailly, Kevin
, Reboul, Rémi Ouazan
, Dapogny, Arnaud
, Yvinec, Edouard
in
Artificial neural networks
/ Budgets
/ Channels
/ Computer vision
/ Hardware
/ Model accuracy
/ Network latency
/ Neural networks
/ Pruning
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
by
Bailly, Kevin
, Reboul, Rémi Ouazan
, Dapogny, Arnaud
, Yvinec, Edouard
in
Artificial neural networks
/ Budgets
/ Channels
/ Computer vision
/ Hardware
/ Model accuracy
/ Network latency
/ Neural networks
/ Pruning
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
Paper
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
2023
Request Book From Autostore
and Choose the Collection Method
Overview
Deep neural networks (DNNs) have become ubiquitous in addressing a number of problems, particularly in computer vision. However, DNN inference is computationally intensive, which can be prohibitive e.g. when considering edge devices. To solve this problem, a popular solution is DNN pruning, and more so structured pruning, where coherent computational blocks (e.g. channels for convolutional networks) are removed: as an exhaustive search of the space of pruned sub-models is intractable in practice, channels are typically removed iteratively based on an importance estimation heuristic. Recently, promising latency-aware pruning methods were proposed, where channels are removed until the network reaches a target budget of wall-clock latency pre-emptively estimated on specific hardware. In this paper, we present Archtree, a novel method for latency-driven structured pruning of DNNs. Archtree explores multiple candidate pruned sub-models in parallel in a tree-like fashion, allowing for a better exploration of the search space. Furthermore, it involves on-the-fly latency estimation on the target hardware, accounting for closer latencies as compared to the specified budget. Empirical results on several DNN architectures and target hardware show that Archtree better preserves the original model accuracy while better fitting the latency budget as compared to existing state-of-the-art methods.
Publisher
Cornell University Library, arXiv.org
Subject
This website uses cookies to ensure you get the best experience on our website.