OpenAI20:07PolicyOfficial Blog
OpenAI Releases CoT Controllability Eval Suite
Simplifies safe AI ops by monitoring internal reasoning easily
Key Points
- 1CoT-Control: Open-source eval >13k tasks
- 2Controllability 0.1-15.4%, monitoring viable
- 3Better in larger models, worse in long reasoning
OpenAI launched CoT-Control eval suite across 13+ benchmarks testing 13 models' ability to obscure reasoning chains. Larger models show better controllability, but long reasoning reduces it, making CoT monitoring reliable for safety. Businesses can trust it for agent oversight.