Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Futures Kickoff
Get prepared for your futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to experience risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Launchpad
Be early to the next big token project
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
DeepSeek announces new architecture mHC, revolutionizing training stability with an innovative mapping method
On January 1st, DeepSeek released their latest technical paper, proposing an innovative approach to training large-scale language models. The paper centers around a new architecture called “Manifold Constrained Hyperconnectivity (mHC),” which leverages the fundamental mathematical concept of mappings. In the industry, this technology is attracting attention as a potential new direction for model development.
Challenges and Innovative Solutions in Hyperconnected Network Technology
Traditional hyperconnected network (HC) technology, while highly flexible, has faced serious issues during training. Specifically, the violation of the identity mapping property has led to two major problems: training instability and scalability limitations. These issues have posed significant barriers to developing large-scale models.
The mHC architecture introduced by DeepSeek offers an innovative solution to these challenges. The research team successfully restored the lost identity mapping property by mapping the residual connection space of HC onto a specific manifold. This creative mapping technique is said to significantly improve the fundamental stability of the model.
Technical Innovation through Manifold Mapping and Scalability Enhancement
The key feature of the mHC architecture is its ability to deliver excellent performance while maintaining efficiency through precise infrastructure optimization. Unlike conventional simple residual connection approaches, this architecture utilizes the properties of complex manifolds for mapping processes, enabling more sophisticated training procedures.
This technological innovation is expected to dramatically improve training stability and significantly enhance model scalability. According to PA News, DeepSeek’s research team anticipates that this mHC architecture will serve as a practical and effective extension tool in the development of large-scale models.
New Understanding of Topology Architecture Design and Future Outlook
This paper was co-authored by researchers Zhenda Xie, Yixuan Wei, and Huanqi Cao, with Wenfeng Liang, the founder of DeepSeek, also participating as an author. The research team states that the development of this mHC architecture has deepened their understanding of topology architecture design.
This approach, incorporating complex mapping processes and manifold concepts, suggests a promising direction for the evolution of foundational models. Industry experts are paying close attention to this technology’s potential role in next-generation AI model development, and future applications are highly anticipated.