Blockchain

AMD Enhances AI Algorithm Efficiency with Innovative Depth Pruning Method

June 8, 2024

AMD, a leading semiconductor supplier, has made significant strides in optimizing hardware efficiency for artificial intelligence (AI) algorithms. According to AMD.com, the company’s latest research paper titled ‘A Unified Progressive Depth Pruner for CNN and Vision Transformer‘ has been accepted at the prestigious AAAI 2024 conference. This paper introduces a novel depth pruning method designed to enhance performance across various AI models.

Motivation for Model Optimization

Deep neural networks (DNNs) have become integral to various industrial applications, necessitating continuous model optimization. Techniques such as model pruning, quantization, and efficient model design are crucial in this context. Traditional channel-wise pruning methods face challenges with depth-wise convolutional layers due to sparse computation and fewer parameters. These methods also often struggle with high parallel computing demands, leading to suboptimal hardware utilization.

To address these issues, AMD’s research team proposed DepthShrinker and Layer-Folding techniques to optimize MobileNetV2 by reducing model depth through reparameterization. Despite their promise, these methods have limitations, such as potential accuracy loss and constraints with certain normalization layers like LayerNorm, making them unsuitable for vision transformer models.

Innovative Depth Pruning Approach

AMD’s new depth pruning method introduces a progressive training strategy and a novel block pruning technique that can optimize both CNN and vision transformer models. This approach ensures high utilization of baseline model weights, resulting in higher accuracy. Moreover, the method can handle existing normalization layers, including LayerNorm, enabling effective pruning of vision transformer models.

The AMD depth pruning strategy converts complex and slow blocks into simpler, faster blocks through block merging. This involves replacing activation layers with identity layers and LayerNorm layers with BatchNorm layers, facilitating reparameterization. The reparameterization technique then merges BatchNorm layers, adjacent convolutional or fully connected layers, and skip connections.

Key Technologies

The depth pruning process involves four main steps: Supernet training, Subnet searching, Subnet training, and Subnet merging. Initially, a Supernet is constructed based on the baseline model, incorporating block modifications. After Supernet training, an optimal subnet is identified using a search algorithm. The progressive training strategy is then applied to optimize the subnet with minimal accuracy loss. Finally, the subnet is merged into a shallower model using the reparameterization technique.

Benefits and Performance

AMD’s depth pruning method offers several key contributions:

A unified and efficient depth pruning method for CNN and vision transformer models.
A progressive training strategy for subnet optimization coupled with a novel block pruning strategy using reparameterization.
Comprehensive experiments demonstrating superior pruning performance across various AI models.

Experimental results show that AMD’s method achieves up to 1.26X speedup on the AMD Instinct™ MI100 GPU accelerator, with only a 1.9% top-1 accuracy drop. The approach has been tested on multiple models, including ResNet34, MobileNetV2, ConvNeXtV1, and DeiT-Tiny, showcasing its versatility and effectiveness.

In conclusion, AMD’s unified depth pruning method represents a significant advancement in optimizing AI model performance. Its applicability to both CNN and vision transformer models highlights its potential impact on future AI developments. AMD plans to explore further applications of this method on more transformer models and tasks.

Image source: Shutterstock

. . .

AMD Enhances AI Algorithm Efficiency with Innovative Depth Pruning Method

Motivation for Model Optimization

Innovative Depth Pruning Approach

Key Technologies

Benefits and Performance

Tags

LEAVE A REPLY Cancel reply

MOST POPULAR

Why Shiba Inu Whales are continuously buying more tokens?

Vitalik wants to burn the staked Ethereum of sanction complying validators

XRP up 83% in a week to hit $1 as Vantard’s...

Internet Computer (ICP) Bleeds, What Are The Chances For Reversal?

HOT NEWS

US Senate Suggests Blockchain Can Enhance National Security

UK Parliament Introduces Bill to Recognize Bitcoin and Crypto as Personal...

Bybit gets “in-principle” approval to begin operations in Kazakhstan

AI and on-chain chat for smart web3 interaction

EDITOR PICKS

Dogecoin Whales Go Ham As They Buy 560M DOGE In One...

Stablecoins Quietly Balloon by $14B in January — Who’s Leading the...

BONK Early Investor Who Also Predicted Shiba Inu Has Just Purchased...

POPULAR POSTS

The Best Cloud Mining Site for Passive Income in 2023

Kadena vs. Solana: Ultimate Comparison

How To Stake Polygon (MATIC) Using Ledger and MetaMask

POPULAR CATEGORY

EOS Network Unveils Spring 1.0 and Savanna: A Leap in Blockchain...

New Cryptocurrency Releases, Listings, & Presales Today – NodeStation AI, Obi...