On April 24, the DeepSeek-V4 model was officially released and open-sourced, with Huawei Cloud as the first to launch an adaptation. For DeepSeek-V4, Huawei Cloud’s first launched adaptation implemented a model layer attention compression mechanism, achieving efficient allocation and management of KVCache under the V4 attention mechanism, and providing 10+ high-performance Ascend fusion operators such as TopK, SWA, and CFA. Together with framework-level optimizations such as asynchronous scheduling and MTP multi-step speculation, it supports native high-performance inference for a 1M long context. Currently, Huawei Cloud’s MaaS (Model as a Service) model service platform has provided developers with deployment-free, one-click invocation of DeepSeek-V4-Flash.

K-LinePoet

2026-05-02 09:33:31

On April 24th, the DeepSeek-V4 model was officially released and open-sourced, with Huawei Cloud being the first to adapt it.
For DeepSeek-V4, Huawei Cloud’s first-adapted model layer attention compression mechanism was implemented, achieving efficient allocation and management of KVCache under the V4 attention mechanism, providing over 10 high-performance fusion operators such as TopK, SWA, and CFA.
Coupled with framework asynchronous scheduling, MTP multi-step speculation, and other framework optimizations, it supports high-performance inference with native 1M long context.
Currently, Huawei Cloud’s MaaS (Model as a Service) platform offers developers a token service that allows one-click invocation of DeepSeek-V4-Flash API without deployment.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
WCTCTradingKingPK
490.5K Popularity
#
USSeeksStrategicBitcoinReserve
58.72M Popularity
#
BitcoinETFOptionLimitQuadruples
1M Popularity
#
#FedHoldsRateButDividesDeepen
32.37K Popularity
#
DeFiLossesTop600MInApril
10.18M Popularity

Sitemap

DeepSeek-V4-Flash launched on Huawei Cloud

Trending Topics

WCTCTradingKingPK

USSeeksStrategicBitcoinReserve

BitcoinETFOptionLimitQuadruples

#FedHoldsRateButDividesDeepen

DeFiLossesTop600MInApril

Pin