Crypto界网报道，Muon优化器在训练时具有高置信度，但对新样本常常过度自信。论文《too sharp, too sure: when calibration follows curvature》指出训练集的置信度与实际正确率不一致，测试时出现过度自信。CIFAR-10实验显示测试的ECE：Muon 0.065、AdamW 0.061、SGD 0.081、SAM 0.020；训练的ECE接近0，说明泛化差异显著。论文提出Calmo可以将Muon的测试ECE降至0.019，尚未在大型语言模型上验证。DeepSeek V4报告称仍有模块使用AdamW，需关注Muon的泛化表现。

CoinNetwork

2026-04-27 07:45:55

Abstract generation in progress

CoinWorld News reports that the Muon optimizer exhibits high confidence during training but tends to be overconfident on new samples. The latest paper, “Too Sharp, Too Sure: When Calibration Follows Curvature,” points out that the model can accurately assess its confidence on the training set, but on the test set, the confidence levels do not match the actual accuracy, leading to overconfidence. Experiments show that Muon’s test ECE on the CIFAR-10 image classification task is 0.065, AdamW is 0.061, SGD is 0.081, and SAM is 0.020. Muon’s training ECE is nearly zero, indicating a more significant gap between training and testing performance. The proposed Calmo method can reduce Muon’s test ECE to 0.019 but has not yet been validated on large language models. The DeepSeek V4 technical report indicates that some modules still use AdamW, highlighting the need to monitor Muon’s performance during generalization.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
WCTCTradingKingPK
277.38K Popularity
#
比特币Breaks79K
11.67M Popularity
#
CryptoMarketsRiseBroadly
86.7K Popularity
#
WHCADinnerShootingIncident
14.4K Popularity
#
IranProposesHormuzStraitReopeningTerms
284.54K Popularity

Sitemap

Muon's confidence is very accurate during training, but it tends to overfit when switching to new samples.

Trending Topics

WCTCTradingKingPK

比特币Breaks79K

CryptoMarketsRiseBroadly

WHCADinnerShootingIncident

IranProposesHormuzStraitReopeningTerms

Pin