OpenAIFeature UpdatesOfficial Blog
OpenAI Improves Instruction Hierarchy for Safety
Safer agent deployments.
Key Points
- 1Prioritizes trusted instructions
- 2Resists prompt injection
- 3For frontier LLMs
IH-Challenge trains models to prioritize trusted instructions, boosting safety and steerability.