연구2026년 3월 9일

Reasoning Models Struggle to Control their Chains of Thought

arXiv:2603.05706v1 Announce Type: new Abstract: Chainofthought CoT monitoring is a promising tool for detecting misbehaviors and understanding the motivations of modern reasoning models. However, if models can control what they verbalize in their CoT, it could undermine CoT monitorability.

이 콘텐츠는 ArXiv AI 원본 기사의 요약입니다. 전문은 원본 사이트에서 확인해주세요.

원문 기사 보기 →