OpenAI03:21PolicyOfficial Blog
OpenAI Fixes Excessive Goblin Mentions in GPT Models
Eliminates unintended quirks for reliable business outputs.
Key Points
- 1Nerdy rewards boosted goblin mentions by 175%
- 2Filtered irrelevant creature data from training
- 3Added suppression prompt in GPT-5.5 Codex
- 4Developing auditing tools for behaviors
OpenAI traced excessive goblin/gremlin mentions in GPT-5.1+ to 'Nerdy' personality rewards. They fixed it by filtering data and removing signals, preventing recurrence. Developers can use suppression prompts in Codex. Improves model reliability for production.