AI summarized from verified sources
OpenAI Fixes Excessive Goblin Mentions in GPT Models
Eliminates unintended quirks for reliable business outputs.
SOURCE CHECK
3 sources
Sources
Key Points
- 1Nerdy rewards boosted goblin mentions by 175%
- 2Filtered irrelevant creature data from training
- 3Added suppression prompt in GPT-5.5 Codex
- 4Developing auditing tools for behaviors
OpenAI traced excessive goblin/gremlin mentions in GPT-5.1+ to 'Nerdy' personality rewards. They fixed it by filtering data and removing signals, preventing recurrence. Developers can use suppression prompts in Codex. Improves model reliability for production.
Key point
OpenAI traced excessive goblin/gremlin mentions in GPT-5.1+ to 'Nerdy' personality rewards. They fixed it by filtering data and removing signals, preventing recurrence. Developers can use suppression prompts in Codex. Improves model reliability for production.
Impact
Eliminates unintended quirks for reliable business outputs. Key checks: Nerdy rewards boosted goblin mentions by 175% / Filtered irrelevant creature data from training / Added suppression prompt in GPT-5.5 Codex.