Complete entire workflows with one request using GPT-5.6 Start natural voice conversations anytime with GPT-Live Easier model choice for smarter everyday work Long tasks can move from draft to presentation more easily Track the latest safety rules for bigger models Translate naturally during calls, meetings, and travel Easily automate multi-step daily tasks at lower cost Make Claude easier to deploy through AWS Claude Fable 5 is usable again after the pause Keep research tools and analysis in one place Delegate more everyday coding work to Claude Measure how well AI agents handle ambiguous biology research judgments Claude Sonnet 5 is built for heavier coding and work tasks HP partnership makes enterprise rollout easier Tag Claude in Slack to delegate tasks with your whole team Hand Slack tasks to Claude more easily Confidential AI gets stronger for sensitive workloads Gemini API key management is moving to safer auth keys Claude expands more easily into Korean businesses and research Anthropic expands Claude adoption and research in Korea Complete entire workflows with one request using GPT-5.6 Start natural voice conversations anytime with GPT-Live Easier model choice for smarter everyday work Long tasks can move from draft to presentation more easily Track the latest safety rules for bigger models Translate naturally during calls, meetings, and travel Easily automate multi-step daily tasks at lower cost Make Claude easier to deploy through AWS Claude Fable 5 is usable again after the pause Keep research tools and analysis in one place Delegate more everyday coding work to Claude Measure how well AI agents handle ambiguous biology research judgments Claude Sonnet 5 is built for heavier coding and work tasks HP partnership makes enterprise rollout easier Tag Claude in Slack to delegate tasks with your whole team Hand Slack tasks to Claude more easily Confidential AI gets stronger for sensitive workloads Gemini API key management is moving to safer auth keys Claude expands more easily into Korean businesses and research Anthropic expands Claude adoption and research in Korea

Official sources only. Rumors, leaks, and get-rich schemes are excluded.

← Back to top

AI BriefingAnthropicPress Releases19:39

AI summarized from verified sources

Anthropic Builds Auto Alignment Researchers, 97% Gap Closure

AI automates safety research, slashing human effort dramatically.

SOURCE CHECK

2 sources

VERIFIED

Sources

Primary / anthropic.com

Official Blog

Supporting / x.com

Official Blog

Key Points

197% gap recovery in supervision
24x faster than humans
3Generalizes to coding/math
4Highlights reward hacking risks

Anthropic developed Automated Alignment Researchers using Claude Opus 4.6, closing 97% of weak-to-strong supervision gap vs humans' 23%. Nine parallel AARs accelerated experiments. Methods generalized to coding/math tasks, boosting alignment research efficiency.

What changed

Anthropic developed Automated Alignment Researchers using Claude Opus 4.6, closing 97% of weak-to-strong supervision gap vs humans' 23%. Nine parallel AARs accelerated experiments. Methods generalized to coding/math tasks, boosting alignment research efficiency.

Why it matters

AI automates safety research, slashing human effort dramatically.

What to watch

AI automates safety research, slashing human effort dramatically. Key checks: 97% gap recovery in supervision / 4x faster than humans / Generalizes to coding/math.