Accelerating Edge AI Devices: Optimization Practices for Arm and Google AI Edge

robot
Abstract generation in progress

AIMPACT News, May 15 (UTC+8), Arm’s second-generation scalable matrix extension (SME2) has been integrated with Google AI Edge software stack, transforming CPUs into powerful matrix computation accelerators to enable high-performance on-device generative AI. This article uses Stability AI’s “stable-audio-open-small” model as an example to illustrate the automated hardware acceleration process of “conversion, optimization, deployment” built with LiteRT, XNNPACK, and KleidiAI. The solution successfully achieved more than double the audio generation speed and four times less memory usage on Arm-based mobile devices and laptops, while maintaining high audio quality. This integration provides an effective path for efficiently running complex AI models on resource-constrained edge devices. (Source: AiHot)

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned