Hugging Face CEO Clem Delangue announces the official launch of Kernels on the Hub, optimizing GPU operators to accelerate inference and training by 1.7 to 2.5 times, and simplifying the installation process. Kernels Hub will move compilation to the cloud, supporting multiple hardware types and various operator versions. Currently, there are 61 pre-compiled operators compatible with Hugging Face's inference framework.

MeNews

2026-04-15 05:21:18

Abstract generation in progress

ME News Report, April 15 (UTC+8), according to 1M AI News monitoring, Hugging Face CEO Clem Delangue announced that Kernels is officially launched on the Hub. GPU operators are low-level optimized codes that push graphics cards to their limits, capable of accelerating inference and training by 1.7 to 2.5 times, but installation has always been a nightmare: taking FlashAttention as the most common example, local compilation requires about 96GB of memory and several hours; even slight mismatches in PyTorch or CUDA versions cause errors, causing most developers to get stuck at this step. Kernels Hub moves compilation to the cloud. Hugging Face pre-compiles operators across various graphics cards and system environments; developers write a single line of code, and the Hub automatically matches the hardware environment, downloading pre-compiled files that are ready to use within seconds. Multiple different versions of operators can be loaded in the same process, compatible with torch.compile. Kernels was tested and launched in June last year, and this month was upgraded to a first-level repository on the Hub, alongside Models, Datasets, and Spaces. Currently, there are 61 pre-compiled operators covering common scenarios such as attention mechanisms, normalization, mixture-of-experts routing, and quantization, supporting four hardware acceleration platforms: NVIDIA CUDA, AMD ROCm, Apple Metal, and Intel XPU. It has been integrated into Hugging Face’s inference framework TGI and Transformers library. (Source: BlockBeats)

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
GatePreIPOsLaunchesWithSpaceX
111.09K Popularity
#
GateMarchTransparencyReport
35.99K Popularity
#
GoldmanSachsFilesBitcoinIncomeETF
770.96K Popularity
#
USBlocksStraitofHormuz
743.9K Popularity
#
WCTCTradingChallengeShare8MUSDT
500.97K Popularity

Sitemap

Hugging Face officially launches Kernels, GPU operators ready to use with just one line of code like a model

Trending Topics

GatePreIPOsLaunchesWithSpaceX

GateMarchTransparencyReport

GoldmanSachsFilesBitcoinIncomeETF

USBlocksStraitofHormuz

WCTCTradingChallengeShare8MUSDT

Pin