Anthropic22:59Press ReleasesOfficial Blog
Anthropic Releases BioMysteryBench to Eval Claude's Bio Skills
Enables creative bio research solving with Claude.
Key Points
- 199 real-data analysis problems.
- 230% of 23 expert-unsolvable solved.
- 3On Hugging Face.
- 4Gen improvements in latest Claude.
Anthropic launched BioMysteryBench, a 99-question bioinformatics benchmark with real data. Claude solves 30% of expert-stumping problems. Method-agnostic objective eval boosts research workflows.