Research on the Mechanism of Disconnection Between Proxy Cognition and Action in Tool Use

robot
Abstract generation in progress
AIMPACT message, May 17 (UTC+8), this interpretability paper focuses on tool usage agents, detecting hidden states to find that models often recognize when to call tools, but actual calls fail, with mismatch rates reaching 26%-54%. The issue is entirely centered on the transition from cognition to action, rather than cognition itself. Internal detection directions can be decoded, but the final token mechanism of later layers causes signal rotation, almost orthogonal to the generated action. The research aims to predict the effectiveness of intervention measures, pointing out that common attributions such as insufficient prompting or training may overlook the geometric structure of later layers, providing a reasonable explanation for the performance ceiling in tool usage prompt A/B testing. (Source: AiHot)
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments