Notebooks Now in Gemini AppClaude Managed Agents Now in Public BetaCodex Retires Older Models on ChatGPT Login Apr 14Anthropic Launches Project Glasswing with Mythos Preview for Vuln DetectionAnthropic Secures Multi-GW TPU Deal with Google, BroadcomOpenAI Launches Safety Fellowship ProgramOpenAI Proposes Industrial Policy for Superintelligence EraCodex Biz/Ent Shift to Token-Based BillingComplete Guide to Building Skills for Claude ReleasedGoogle Cloud AI Agents Guide Explains Key DifferencesCodex Adds Vercel Plugin for Easy DeploymentClaude Subs No Longer Cover Third-Party ToolsClaude Subs Exclude 3rd-Party Tools, Go Pay-Per-UseAnthropic Releases Model Diffing for AI Behavior ComparisonNew Flex and Priority Tiers in Gemini APIGemma 4 Open Models ReleasedLyria 3 Prompting Tips in Gemini AppMicrosoft 365 Connectors Now on All Claude PlansGoogle Releases Gemma 4: Top Open Lightweight AI ModelClaude Computer Use expands to Windows (from macOS)Codex Pay-as-You-Go in ChatGPT Business, No Fixed Seat FeesGemini API Adds Flex & Priority TiersGoogle Vids Gains Veo 3.1 Video & Lyria Music Gen, Free TierEmotion vectors discovered in Claude causally affecting behaviorChatGPT Now on Apple CarPlay (iOS 26.4+)Google Releases Gemma 4 Open ModelsGoogle DeepMind Releases Gemma 4 Model Family (31B etc.)Linear Plugin Added to Codex AppSimplified Data Grounding in Gemini EnterpriseAnthropic Signs AI Safety MOU with AustraliaAnthropic Launches Claude Code Agentic Coding SystemClaude Code Leak Reveals Mythos ModelAnthropic Flattens High-Volume Pricing for Claude Opus/Sonnet APIPerplexity Scales Voice Search to Millions with Realtime APIOpenAI Codex Security Preview Remains FreeGKE Agent Sandbox Enables Sub-Second Agent StartsCodex Use Cases Gallery LaunchedGemini Personal Intelligence Free for US UsersCodex Plugins Enable Seamless Tool IntegrationDeepMind Releases AI Manipulation Measurement ToolkitOpenAI Reveals Model Spec Design and Evolution ProcessConversational Analytics Launches in BigQueryOpenAI Releases Teen Safety Prompts for gpt-oss-safeguardClaude Long-term Users Achieve 10% Higher Success Rate StudyMulti-Agent Harness Boosts Frontend DesignAnthropic Launches Science Blog for AI Science WorkflowsClaude Accelerates Grad-Level Physics CalcTrace Analytics Adds SQL Queries & ChartsCost-Effective LLM Serving with Ollama on GKE GPU SharingGoogle Adds MCP Servers for DBs, Boosting AI Agent IntegrationMigrate Apps to Cloud with Gemini CLI & MCP GuideDesign Delightful Frontends with GPT-5.4Parameter Golf Challenge for Minimal PromptsAnthropic Publishes 81k AI User Interviews (34 chars)Gemini 3 Preview: Thinking Levels & Image Gen Boost (42 chars)DeepMind Launches Cognitive Framework for AGI ProgressOpenAI Launches GPT-5.4 Mini & Nano Fast Small ModelsCloud Run Supports NVIDIA RTX PRO 6000 GPUs (38 chars)Sora API Adds Reusable Chars, 20s Videos, 1080pAnthropic Launches Institute for AI Societal ImpactsResponses API adds shell and container environmentChatGPT adds interactive math & science visualsOpenAI shares instruction-priority training approachOpenAI to acquire Promptfoo for eval securityGemini 3.1 Flash-Lite Preview: Fast, Low-Cost ModelUpdated GPT-5.4 Prompt Guide with Agent PatternsAnthropic Reveals Claude's Eval Awareness CaseAnthropic's Claude Opus 4.6 Finds 22 Firefox VulnerabilitiesAnthropic updates vulnerability disclosure policyAnthropic details a Claude exploit case studyGoogle ships TF 2.21, production LiteRT stackOpenAI Releases CoT Controllability Eval SuiteOpenAI Rolls Out GPT-5.4 Thinking & Pro in ChatGPTChatGPT for Excel beta and finance integrationsGPT-5.4 Thinking System Card details safety approachAnthropic proposes “observed exposure” labor metricGPT-5.4 pro arrives as Responses-only modelOpenAI ships a new GPT-5.4 API snapshotAnthropic explains current Department of War designationOpenAI boosts education support to close AI gapsOpenAI releases GPT‑5.4 for pro workflowsOpenAI releases new tools to measure learning outcomesCodex app adds Windows support rolloutGoogle ships February Gemini Drop updatesOpenAI publishes GPT‑5.3 Instant System CardGoogle adds Gemini in-app tasks in March Pixel DropClaude Skills Repo Open-SourcedOpenAI containers switch to session billing on 3/31OpenAI-DoW Safety Agreement for Classified AIAnthropic Challenges DoW Supply Risk LabelOpenAI-Amazon Partnership Launches Stateful Runtime on BedrockOpenAI Updates Assistants API Migration Guide with Sunset DateGoogle expands ADK integrations for agentsGoogle ships Nano Banana 2 in MENAOpenAI to add mental-health safety features soonGoogle expands ADK integrations for agentsAnthropic's Complete Guide to Claude SkillsOpenAI adds a stateful runtime in Amazon BedrockGoogle Releases Nano Banana 2 Prompt GuideCodex and Figma Integration Enables Auto UI Design from CodeOpenAI Announces End Date for Realtime API Beta: Feb 27, 2026OpenAI and Figma Launch Direct Codex-Design CollaborationAnthropic details defense talks in CEO statementOpenAI & PNNL aim to speed up permitting draftsAI Edge Gallery adds on-device function callingGoogle unveils Nano Banana 2 for faster Gemini imagesGoogle releases Nano Banana 2 for faster imagesGoogle Enhances Gemini 2.5 Audio Model for Live Voice AgentsAnthropic Acquires Vercept_ai for Claude Computer UseOpenAI Plans Realtime API Beta Shutdown on February 27, 2026Google updates Circle to Search for multi-object queriesOpenAI Publishes Report on AI Abuse Detection and MitigationGalaxy S26 Adds On-Device Gemini AI for Call Fraud DetectionAnthropic Publishes Responsible Scaling Policy v3.0 to Boost TransparencyOpenAI Launches GPT-5.3-Codex for Advanced Coding AgentsGoogle DeepMind Launches Robotics Accelerator for European StartupsAnthropic releases major RSP rewrite (v3.0)Anthropic releases RSP 3.0 governance rewriteAnthropic Updates Responsible Scaling Policy to Version 3.0Anthropic Publishes Responsible Scaling Policy 3.0 for Frontier AI SafetyGoogle Labs Adds Agent Procedures to Opal WorkflowsAnthropic Updates Responsible Scaling Policy to 3.0OpenAI Launches Frontier Alliances to Accelerate Enterprise AIAnthropic Publishes Claude AI Fluency Index MetricAnthropic details defenses vs distillation attacksOpenAI Plans to Retire SWE-bench Verified EvaluationAnthropic Updates Frontier Safety Roadmap with Feb 2026 GoalsAnthropic launches Claude Sonnet 4.6 with 1M contextChatGPT adds Lockdown Mode and risk labelsOpenAI launches Codex-Spark for ultra-low latencyGemini 3 Deep Think Gets Major Upgrade for ScienceOpenAI starts testing ads in ChatGPT (US)OpenAI launches GPT-5.3-Codex for agentsOpenAI Rolls Out Trusted Access for CybersecurityGoogle previews Developer Knowledge API + MCPXcode 26.3 Integrates Claude Agent SDK Nativelygpt-oss-120b Open Weights ReleasedCodex Integrates Work Context for PrioritiesCodex Hits 3M Weekly Users, Resets Rate Limits5 Insights for Agentic AI in 2026Claude Mythos Preview System Card ReleasedGoogle AI Studio Production UpdateGemma 4 runs offline on phones for local AI agentsGoogle Cloud Spotlights Gemini Embedding 2Claude Subs Exclude Third-Party ToolsClaude Subs Now Meter Third-Party Harness UsageClaude Subs Shift to Pay-Go for 3rd-Party ToolsGemini Provides Private Missed Summary in Google MeetVoice Agent Debugs Slides Live with gpt-realtime-1.5Codex Hooks Enable Lifecycle Script ExecutionGemini 3.1 Flash-Lite Released, Fastest & Cheapest with Audio BoostGoogle Vids Adds Free Veo 3.1 Video Gen & Lyria MusicOpenAI Acquires TBPN to Expand Global AI Media ConversationsBuild Coding Assistant with Gemini MCP/SkillsGemini 3.1 Flash-Lite Preview: 2.5x Faster at $0.25/M InputAnthropic Tightens Claude Subscription Limits During Peak HoursGemini API Adds Prepay Billing and Project Spend CapsGemini 3.1 Flash Live Preview Enhances Real-Time ChatStreet View Insights GA: Analyze 280B ImagesBigQuery Global Queries Across RegionsOpenAI Launches Safety Bug Bounty ProgramOpenAI Foundation Announces $1B Investment PlansGemini Automates Scheduling in Google CalendarPersonalized marketing campaigns with Vertex AIClaude Code advanced patterns webinar releasedGoogle Cloud shares 5 steps to scale AI beyond pilotsGKE LLM Inference Optimization QuickstartNeo4j Extension Added to Gemini CLI for NL Cypher QueriesGKE Inference Gateway Boosts Vertex AI LatencyOpenAI Acquires Astral to Boost CodexGemini API Adds Multi-Tool Calls & Google Maps GroundingSpanner CQL APIs Now GA on Google Cloud (32 chars)Gemini API Pricing Update Adds Gemini 3 Previews (38 chars)Gemini API Spend Caps Added to Google AI StudioBigQuery Studio Assistant Supercharged by GeminiBuild Onboarding Agents with Gemini Enterprise GuideClaude Now Creates Interactive Charts and DiagramsAnthropic Opens Sydney Office, Expands in APACOpenAI shares operational tips in Codex Prompting GuideGemini Sheets Hits SOTA on SpreadsheetBenchOpenAI Improves Instruction Hierarchy for SafetyGoogle adds Gemini scheduling help in GmailSkills accelerate OSS maintenance in Agents SDKCalendar updates now email as the principalOpenAI Academy updates ChatGPT Skills guideDescript: scaling multilingual dubbing with AIOpenAI shares an AI investing research case studyGoogle shares remixable I/O 2026 AI gamesClaude Opus 4.6 sabotage risk report publishedGPT-5.4 Thinking system card publishedChatGPT “Skills” guide updated for reusable workflowsGoogle Cloud Next '26 developer guide is outOpenAI Academy shares 4 GPT templates for SMBsOpenAI shares how Axios uses ChatGPT for journalismVeo 3 expands and reaches Gemini mobileOpenAI guide: structure prompts for caching winsOpenAI docs warn on safe MCP server connectionsGoogle explains how it built I/O 2026 with GeminiDeepMind shares Project Genie prompt tipsOpenAI documents Prompt Caching to cut latency and costOpenAI Loosens o1 Model Policy for Broader Commercial UseOpenAI Prompting guide covers versioned promptsAnthropic outlines prompt engineering basicsAnthropic shows Prompt improver workflowAnthropic Releases Code Gen Prompt Patterns for Claude 4Responses API migration guide for agent appsOpenAI explains prompt-based JSON schema generationGoogle AI Plus expands to 35 regions incl. USOpenAI Expands ChatGPT Enterprise to Japanese FirmsOpenAI guide for long-horizon Codex tasksAnthropic Details Prompt Improvement Steps for ClaudeGoogle Updates Basic Prompt Design Guide for GeminiOpenAI and Microsoft reaffirm partnership termsGoogle rolls out Nano Banana 2 across MENAGemini generates 30‑second songs with Lyria 3OpenAI and Microsoft reaffirm partnership termsGoogle publishes Gemini 3.1 Flash Image model cardGoogle Nano Banana 2 boosts fast image qualityGuide to Using System Instructions in Gemini APIOpenAI Updates Guide on Optimizing Prompt CachingAnthropic Updates Transparency Info for Claude Opus 4.6OpenAI Publishes PDF Summarizing Prompt Engineering BasicsOpenAI Details How to Use Prompt Caching EffectivelyOpenAI Academy ships prompt pack for gov tasksOpenAI Releases 'ChatGPT Tasks' Prompt Collection for GovernmentOpenAI Appoints Arvind KC as Chief People OfficerOpenAI Improves gpt-realtime-1.5 Voice Accuracy in Realtime APIGemini API Adds Multimodal Function Calls with Image SupportAnthropic explains Claude’s persona selection researchAnthropic Unveils Measures Against Distillation AttacksAnthropic previews Claude Code SecurityOpenAI publishes First Proof attempts with prompt appendixGoogle Releases Workflow for Creating XR 3D Experiences with GeminiGoogle Introduces Gemini's Lyria 3 Music Generation with Arabic BetaOpenAI Grants $7.5M to UK AISI’s Alignment ProjectOpenAI Releases EVMbench to Evaluate AI Agent Vulnerability DetectionGemini app adds Lyria 3 for music creationOpenAI Publishes California Privacy Requests DataOpenAI Warns About Unauthorized OpenAI Stock TradesConductor adds automated reviews for codeGemini CLI adds extension settings for easier setupTips for long‑running agents: Skills & ShellGemini CLI Extensions Simplify Config and Improve Key ManagementData Commons MCP becomes hosted for zero-install accessOpenAI brings ChatGPT to GenAI.milClaude Opus 4.6 boosts coding and long-context skillsAnthropic states Claude will remain ad-free in chatsGuide: Using Agentic Vision in Gemini 3 FlashGoogle Vertex AI Offers Prompt Templates for Scalable TestingGoogle upgrades Gemini and Classroom for educatorsGoogle Explains Key Prompt Design StrategiesAnthropic internal prompting leaks: constraint-focused styleBuild real-time agents with Gemini 3.1 Flash LiveGoogle AI Releases Vibe Coding Explainer VideoGemini in Gmail Enhances Data Privacy ExplainedClaude's Learning Mode Enables Step-by-Step TutoringCodex Workflow Building Spaces EventCustomize Gemini CLI with HooksClaude offers 13 free AI courses and certificatesOpen-Source Recreation of Claude Code Prompt ArchitectureCLAUDE.md Template for Claude Self-Improvement LoopsGemini Enterprise Automates Meeting PrepClaude Cowork Skill for Google Ads Dashboard CreationClaude Prompts Using MIT Prof Framework for PresentationsLearn Prompt Design & Tool Chaining from Leaked Claude Code SourceGoogle Offers Veo 3.1 Lite at Lowest Price in Gemini APIAnthropic announces AMA on Claude Code + MCPVertex AI Guide for Fine-Tuning Gemini 2.5 FlashUsing Codex to Migrate API to Latest ModelsData Grounding Guide Boosts Agent AccuracyGoogle Cloud Releases 101 GenAI BlueprintsNew Gemini/Claude Dashboards in Cloud Monitoring3 New Gemini Features Coming to Google TVSingle-tenant Cloud HSM for full key controlDataform adds hierarchical folder and repo structureSpanner MCP server codelab for natural language queriesOpenAI Updates Data Policy, Clarifies Training Opt-Out (44 chars)Claude 4.6 Makes 1M Token Context Standard PricingGenAI Podcasts from Audio Architecture GuideOpenAI Shares 1-Year Responses API Review & Use CasesWayfair improves catalog with OpenAI case studyGoogle launches Wednesday Build Hour for AI agent buildingAnthropic schedules agentic AI event (Mar 17)Google shares Lyria 3 music prompt guideGemini Image prompt guide for better resultsOpenAI shares a 5-step guide to AI adoptionGoogle posts the weekly Workspace updates recapPreprint: GPT-assisted quantum gravity resultAnthropic Updates Claude Connectors for Extended CapabilitiesAnthropic shares Claude 4 prompting best practicesGoogle refreshes prompt design intro for Vertex AIAnthropic Adds Image Upload to ClaudeGoogle Updates Data Analysis Prompts for GeminiAnthropic's AI Fluency Framework GuideElevenLabs expands Google Cloud tie-up for voice AIOpenAI Academy posts ChatGPT Tasks prompt pack for govOpenAI Academy publishes ChatGPT 102 workbookOpenAI Academy Shares Teacher Training Case StudyGoogle I/O 2026 set for May 19–20; Gemini teasedGoogle showcases practical Gemini use cases in new adsAnthropic publishes a PDF of Claude’s ConstitutionGoogle releases TranslateGemma open translation modelsOpenAI Releases GPT-5.1 Prompting Guide w/ New ToolsNotebooks Now in Gemini AppClaude Managed Agents Now in Public BetaCodex Retires Older Models on ChatGPT Login Apr 14Anthropic Launches Project Glasswing with Mythos Preview for Vuln DetectionAnthropic Secures Multi-GW TPU Deal with Google, BroadcomOpenAI Launches Safety Fellowship ProgramOpenAI Proposes Industrial Policy for Superintelligence EraCodex Biz/Ent Shift to Token-Based BillingComplete Guide to Building Skills for Claude ReleasedGoogle Cloud AI Agents Guide Explains Key DifferencesCodex Adds Vercel Plugin for Easy DeploymentClaude Subs No Longer Cover Third-Party ToolsClaude Subs Exclude 3rd-Party Tools, Go Pay-Per-UseAnthropic Releases Model Diffing for AI Behavior ComparisonNew Flex and Priority Tiers in Gemini APIGemma 4 Open Models ReleasedLyria 3 Prompting Tips in Gemini AppMicrosoft 365 Connectors Now on All Claude PlansGoogle Releases Gemma 4: Top Open Lightweight AI ModelClaude Computer Use expands to Windows (from macOS)Codex Pay-as-You-Go in ChatGPT Business, No Fixed Seat FeesGemini API Adds Flex & Priority TiersGoogle Vids Gains Veo 3.1 Video & Lyria Music Gen, Free TierEmotion vectors discovered in Claude causally affecting behaviorChatGPT Now on Apple CarPlay (iOS 26.4+)Google Releases Gemma 4 Open ModelsGoogle DeepMind Releases Gemma 4 Model Family (31B etc.)Linear Plugin Added to Codex AppSimplified Data Grounding in Gemini EnterpriseAnthropic Signs AI Safety MOU with AustraliaAnthropic Launches Claude Code Agentic Coding SystemClaude Code Leak Reveals Mythos ModelAnthropic Flattens High-Volume Pricing for Claude Opus/Sonnet APIPerplexity Scales Voice Search to Millions with Realtime APIOpenAI Codex Security Preview Remains FreeGKE Agent Sandbox Enables Sub-Second Agent StartsCodex Use Cases Gallery LaunchedGemini Personal Intelligence Free for US UsersCodex Plugins Enable Seamless Tool IntegrationDeepMind Releases AI Manipulation Measurement ToolkitOpenAI Reveals Model Spec Design and Evolution ProcessConversational Analytics Launches in BigQueryOpenAI Releases Teen Safety Prompts for gpt-oss-safeguardClaude Long-term Users Achieve 10% Higher Success Rate StudyMulti-Agent Harness Boosts Frontend DesignAnthropic Launches Science Blog for AI Science WorkflowsClaude Accelerates Grad-Level Physics CalcTrace Analytics Adds SQL Queries & ChartsCost-Effective LLM Serving with Ollama on GKE GPU SharingGoogle Adds MCP Servers for DBs, Boosting AI Agent IntegrationMigrate Apps to Cloud with Gemini CLI & MCP GuideDesign Delightful Frontends with GPT-5.4Parameter Golf Challenge for Minimal PromptsAnthropic Publishes 81k AI User Interviews (34 chars)Gemini 3 Preview: Thinking Levels & Image Gen Boost (42 chars)DeepMind Launches Cognitive Framework for AGI ProgressOpenAI Launches GPT-5.4 Mini & Nano Fast Small ModelsCloud Run Supports NVIDIA RTX PRO 6000 GPUs (38 chars)Sora API Adds Reusable Chars, 20s Videos, 1080pAnthropic Launches Institute for AI Societal ImpactsResponses API adds shell and container environmentChatGPT adds interactive math & science visualsOpenAI shares instruction-priority training approachOpenAI to acquire Promptfoo for eval securityGemini 3.1 Flash-Lite Preview: Fast, Low-Cost ModelUpdated GPT-5.4 Prompt Guide with Agent PatternsAnthropic Reveals Claude's Eval Awareness CaseAnthropic's Claude Opus 4.6 Finds 22 Firefox VulnerabilitiesAnthropic updates vulnerability disclosure policyAnthropic details a Claude exploit case studyGoogle ships TF 2.21, production LiteRT stackOpenAI Releases CoT Controllability Eval SuiteOpenAI Rolls Out GPT-5.4 Thinking & Pro in ChatGPTChatGPT for Excel beta and finance integrationsGPT-5.4 Thinking System Card details safety approachAnthropic proposes “observed exposure” labor metricGPT-5.4 pro arrives as Responses-only modelOpenAI ships a new GPT-5.4 API snapshotAnthropic explains current Department of War designationOpenAI boosts education support to close AI gapsOpenAI releases GPT‑5.4 for pro workflowsOpenAI releases new tools to measure learning outcomesCodex app adds Windows support rolloutGoogle ships February Gemini Drop updatesOpenAI publishes GPT‑5.3 Instant System CardGoogle adds Gemini in-app tasks in March Pixel DropClaude Skills Repo Open-SourcedOpenAI containers switch to session billing on 3/31OpenAI-DoW Safety Agreement for Classified AIAnthropic Challenges DoW Supply Risk LabelOpenAI-Amazon Partnership Launches Stateful Runtime on BedrockOpenAI Updates Assistants API Migration Guide with Sunset DateGoogle expands ADK integrations for agentsGoogle ships Nano Banana 2 in MENAOpenAI to add mental-health safety features soonGoogle expands ADK integrations for agentsAnthropic's Complete Guide to Claude SkillsOpenAI adds a stateful runtime in Amazon BedrockGoogle Releases Nano Banana 2 Prompt GuideCodex and Figma Integration Enables Auto UI Design from CodeOpenAI Announces End Date for Realtime API Beta: Feb 27, 2026OpenAI and Figma Launch Direct Codex-Design CollaborationAnthropic details defense talks in CEO statementOpenAI & PNNL aim to speed up permitting draftsAI Edge Gallery adds on-device function callingGoogle unveils Nano Banana 2 for faster Gemini imagesGoogle releases Nano Banana 2 for faster imagesGoogle Enhances Gemini 2.5 Audio Model for Live Voice AgentsAnthropic Acquires Vercept_ai for Claude Computer UseOpenAI Plans Realtime API Beta Shutdown on February 27, 2026Google updates Circle to Search for multi-object queriesOpenAI Publishes Report on AI Abuse Detection and MitigationGalaxy S26 Adds On-Device Gemini AI for Call Fraud DetectionAnthropic Publishes Responsible Scaling Policy v3.0 to Boost TransparencyOpenAI Launches GPT-5.3-Codex for Advanced Coding AgentsGoogle DeepMind Launches Robotics Accelerator for European StartupsAnthropic releases major RSP rewrite (v3.0)Anthropic releases RSP 3.0 governance rewriteAnthropic Updates Responsible Scaling Policy to Version 3.0Anthropic Publishes Responsible Scaling Policy 3.0 for Frontier AI SafetyGoogle Labs Adds Agent Procedures to Opal WorkflowsAnthropic Updates Responsible Scaling Policy to 3.0OpenAI Launches Frontier Alliances to Accelerate Enterprise AIAnthropic Publishes Claude AI Fluency Index MetricAnthropic details defenses vs distillation attacksOpenAI Plans to Retire SWE-bench Verified EvaluationAnthropic Updates Frontier Safety Roadmap with Feb 2026 GoalsAnthropic launches Claude Sonnet 4.6 with 1M contextChatGPT adds Lockdown Mode and risk labelsOpenAI launches Codex-Spark for ultra-low latencyGemini 3 Deep Think Gets Major Upgrade for ScienceOpenAI starts testing ads in ChatGPT (US)OpenAI launches GPT-5.3-Codex for agentsOpenAI Rolls Out Trusted Access for CybersecurityGoogle previews Developer Knowledge API + MCPXcode 26.3 Integrates Claude Agent SDK Nativelygpt-oss-120b Open Weights ReleasedCodex Integrates Work Context for PrioritiesCodex Hits 3M Weekly Users, Resets Rate Limits5 Insights for Agentic AI in 2026Claude Mythos Preview System Card ReleasedGoogle AI Studio Production UpdateGemma 4 runs offline on phones for local AI agentsGoogle Cloud Spotlights Gemini Embedding 2Claude Subs Exclude Third-Party ToolsClaude Subs Now Meter Third-Party Harness UsageClaude Subs Shift to Pay-Go for 3rd-Party ToolsGemini Provides Private Missed Summary in Google MeetVoice Agent Debugs Slides Live with gpt-realtime-1.5Codex Hooks Enable Lifecycle Script ExecutionGemini 3.1 Flash-Lite Released, Fastest & Cheapest with Audio BoostGoogle Vids Adds Free Veo 3.1 Video Gen & Lyria MusicOpenAI Acquires TBPN to Expand Global AI Media ConversationsBuild Coding Assistant with Gemini MCP/SkillsGemini 3.1 Flash-Lite Preview: 2.5x Faster at $0.25/M InputAnthropic Tightens Claude Subscription Limits During Peak HoursGemini API Adds Prepay Billing and Project Spend CapsGemini 3.1 Flash Live Preview Enhances Real-Time ChatStreet View Insights GA: Analyze 280B ImagesBigQuery Global Queries Across RegionsOpenAI Launches Safety Bug Bounty ProgramOpenAI Foundation Announces $1B Investment PlansGemini Automates Scheduling in Google CalendarPersonalized marketing campaigns with Vertex AIClaude Code advanced patterns webinar releasedGoogle Cloud shares 5 steps to scale AI beyond pilotsGKE LLM Inference Optimization QuickstartNeo4j Extension Added to Gemini CLI for NL Cypher QueriesGKE Inference Gateway Boosts Vertex AI LatencyOpenAI Acquires Astral to Boost CodexGemini API Adds Multi-Tool Calls & Google Maps GroundingSpanner CQL APIs Now GA on Google Cloud (32 chars)Gemini API Pricing Update Adds Gemini 3 Previews (38 chars)Gemini API Spend Caps Added to Google AI StudioBigQuery Studio Assistant Supercharged by GeminiBuild Onboarding Agents with Gemini Enterprise GuideClaude Now Creates Interactive Charts and DiagramsAnthropic Opens Sydney Office, Expands in APACOpenAI shares operational tips in Codex Prompting GuideGemini Sheets Hits SOTA on SpreadsheetBenchOpenAI Improves Instruction Hierarchy for SafetyGoogle adds Gemini scheduling help in GmailSkills accelerate OSS maintenance in Agents SDKCalendar updates now email as the principalOpenAI Academy updates ChatGPT Skills guideDescript: scaling multilingual dubbing with AIOpenAI shares an AI investing research case studyGoogle shares remixable I/O 2026 AI gamesClaude Opus 4.6 sabotage risk report publishedGPT-5.4 Thinking system card publishedChatGPT “Skills” guide updated for reusable workflowsGoogle Cloud Next '26 developer guide is outOpenAI Academy shares 4 GPT templates for SMBsOpenAI shares how Axios uses ChatGPT for journalismVeo 3 expands and reaches Gemini mobileOpenAI guide: structure prompts for caching winsOpenAI docs warn on safe MCP server connectionsGoogle explains how it built I/O 2026 with GeminiDeepMind shares Project Genie prompt tipsOpenAI documents Prompt Caching to cut latency and costOpenAI Loosens o1 Model Policy for Broader Commercial UseOpenAI Prompting guide covers versioned promptsAnthropic outlines prompt engineering basicsAnthropic shows Prompt improver workflowAnthropic Releases Code Gen Prompt Patterns for Claude 4Responses API migration guide for agent appsOpenAI explains prompt-based JSON schema generationGoogle AI Plus expands to 35 regions incl. USOpenAI Expands ChatGPT Enterprise to Japanese FirmsOpenAI guide for long-horizon Codex tasksAnthropic Details Prompt Improvement Steps for ClaudeGoogle Updates Basic Prompt Design Guide for GeminiOpenAI and Microsoft reaffirm partnership termsGoogle rolls out Nano Banana 2 across MENAGemini generates 30‑second songs with Lyria 3OpenAI and Microsoft reaffirm partnership termsGoogle publishes Gemini 3.1 Flash Image model cardGoogle Nano Banana 2 boosts fast image qualityGuide to Using System Instructions in Gemini APIOpenAI Updates Guide on Optimizing Prompt CachingAnthropic Updates Transparency Info for Claude Opus 4.6OpenAI Publishes PDF Summarizing Prompt Engineering BasicsOpenAI Details How to Use Prompt Caching EffectivelyOpenAI Academy ships prompt pack for gov tasksOpenAI Releases 'ChatGPT Tasks' Prompt Collection for GovernmentOpenAI Appoints Arvind KC as Chief People OfficerOpenAI Improves gpt-realtime-1.5 Voice Accuracy in Realtime APIGemini API Adds Multimodal Function Calls with Image SupportAnthropic explains Claude’s persona selection researchAnthropic Unveils Measures Against Distillation AttacksAnthropic previews Claude Code SecurityOpenAI publishes First Proof attempts with prompt appendixGoogle Releases Workflow for Creating XR 3D Experiences with GeminiGoogle Introduces Gemini's Lyria 3 Music Generation with Arabic BetaOpenAI Grants $7.5M to UK AISI’s Alignment ProjectOpenAI Releases EVMbench to Evaluate AI Agent Vulnerability DetectionGemini app adds Lyria 3 for music creationOpenAI Publishes California Privacy Requests DataOpenAI Warns About Unauthorized OpenAI Stock TradesConductor adds automated reviews for codeGemini CLI adds extension settings for easier setupTips for long‑running agents: Skills & ShellGemini CLI Extensions Simplify Config and Improve Key ManagementData Commons MCP becomes hosted for zero-install accessOpenAI brings ChatGPT to GenAI.milClaude Opus 4.6 boosts coding and long-context skillsAnthropic states Claude will remain ad-free in chatsGuide: Using Agentic Vision in Gemini 3 FlashGoogle Vertex AI Offers Prompt Templates for Scalable TestingGoogle upgrades Gemini and Classroom for educatorsGoogle Explains Key Prompt Design StrategiesAnthropic internal prompting leaks: constraint-focused styleBuild real-time agents with Gemini 3.1 Flash LiveGoogle AI Releases Vibe Coding Explainer VideoGemini in Gmail Enhances Data Privacy ExplainedClaude's Learning Mode Enables Step-by-Step TutoringCodex Workflow Building Spaces EventCustomize Gemini CLI with HooksClaude offers 13 free AI courses and certificatesOpen-Source Recreation of Claude Code Prompt ArchitectureCLAUDE.md Template for Claude Self-Improvement LoopsGemini Enterprise Automates Meeting PrepClaude Cowork Skill for Google Ads Dashboard CreationClaude Prompts Using MIT Prof Framework for PresentationsLearn Prompt Design & Tool Chaining from Leaked Claude Code SourceGoogle Offers Veo 3.1 Lite at Lowest Price in Gemini APIAnthropic announces AMA on Claude Code + MCPVertex AI Guide for Fine-Tuning Gemini 2.5 FlashUsing Codex to Migrate API to Latest ModelsData Grounding Guide Boosts Agent AccuracyGoogle Cloud Releases 101 GenAI BlueprintsNew Gemini/Claude Dashboards in Cloud Monitoring3 New Gemini Features Coming to Google TVSingle-tenant Cloud HSM for full key controlDataform adds hierarchical folder and repo structureSpanner MCP server codelab for natural language queriesOpenAI Updates Data Policy, Clarifies Training Opt-Out (44 chars)Claude 4.6 Makes 1M Token Context Standard PricingGenAI Podcasts from Audio Architecture GuideOpenAI Shares 1-Year Responses API Review & Use CasesWayfair improves catalog with OpenAI case studyGoogle launches Wednesday Build Hour for AI agent buildingAnthropic schedules agentic AI event (Mar 17)Google shares Lyria 3 music prompt guideGemini Image prompt guide for better resultsOpenAI shares a 5-step guide to AI adoptionGoogle posts the weekly Workspace updates recapPreprint: GPT-assisted quantum gravity resultAnthropic Updates Claude Connectors for Extended CapabilitiesAnthropic shares Claude 4 prompting best practicesGoogle refreshes prompt design intro for Vertex AIAnthropic Adds Image Upload to ClaudeGoogle Updates Data Analysis Prompts for GeminiAnthropic's AI Fluency Framework GuideElevenLabs expands Google Cloud tie-up for voice AIOpenAI Academy posts ChatGPT Tasks prompt pack for govOpenAI Academy publishes ChatGPT 102 workbookOpenAI Academy Shares Teacher Training Case StudyGoogle I/O 2026 set for May 19–20; Gemini teasedGoogle showcases practical Gemini use cases in new adsAnthropic publishes a PDF of Claude’s ConstitutionGoogle releases TranslateGemma open translation modelsOpenAI Releases GPT-5.1 Prompting Guide w/ New Tools
🔒 公式発表のみ掲載。噂・リーク・情報商材は載せません。
← Back to glossary

RLHF (Reinforcement Learning from Human Feedback)

RLHF

ああるえるえいちえふ

Definition

RLHF is a technique that builds a reward model from human preference judgments and then uses reinforcement learning to align a model's behavior. It is widely used to improve helpfulness and safety.

LLMは膨大なテキストで学習しますが、それだけでは「人が本当に求めている回答」を返せるとは限りません。文法的に完璧でも、質問の意図を外した冗長な回答や、不適切な内容を生成してしまうことがあります。RLHF(Reinforcement Learning from Human Feedback)とは、人間が「こちらの回答のほうが良い」と判断した比較データを使い、強化学習でモデルの出力を人間の好みに沿うように最適化する手法です。

3つのステップ

RLHFのプロセスは明確に3段階に分かれます。第1段階は教師ありファインチューニング(SFT)で、人手で作成した高品質な「質問→回答」ペアでモデルを微調整し、対話の基本形を学ばせます。第2段階は報酬モデルの訓練です。同じ質問に対してモデルが生成した複数の回答を人間の評価者が比較・ランキングし、そのデータで「どの回答がどれだけ良いか」をスコア化する報酬モデルを構築します。第3段階で、この報酬モデルのスコアを報酬シグナルとして、PPO(Proximal Policy Optimization)というアルゴリズムでLLM全体を強化学習します。PPOは方策の更新幅を制限して学習を安定させる手法で、元のモデルから逸脱しすぎないよう「KLペナルティ」も加えます。

InstructGPTとChatGPTの誕生

RLHFが一躍有名になったのは、2022年にOpenAIが発表したInstructGPTの論文です。GPT-3(1750億パラメータ)にRLHFを適用したInstructGPTは、パラメータ数が100分の1の13億でも、人間の評価でGPT-3を上回るという衝撃的な結果を示しました。モデルの「賢さ」は規模だけでなく、人間のフィードバックの取り込み方で決まることが実証されたのです。ChatGPTはこの技術の発展形であり、RLHFなしには現在の対話体験は実現しませんでした。

課題と新しいアプローチ

RLHFの最大の課題は運用コストの高さです。人間による比較評価データの作成には、1件あたり数ドルのコストが必要です。報酬モデルのスコアだけを最大化する抜け穴を見つけてしまう「reward hacking(報酬ハック)」問題も厄介です。こうした課題から、DPO(Direct Preference Optimization)が2023年に提案されました。DPOは報酬モデルを介さず、人間の比較データから直接モデルを最適化するため、パイプラインが大幅に簡素化されます。さらに、AIが評価を行うRLAIF(AI Feedback)や、2値判定で学習するKTO(Kahneman-Tversky Optimization)なども登場しています。

現在の主要モデルでの採用

現在のChatGPT、Claude、Geminiといった主要LLMは、いずれもRLHFまたはその発展形を採用しています。AnthropicはConstitutional AIとRLHFを組み合わせ、MetaのLlama系列ではRLHFとDPOの両方を活用しています。人間の好みをモデルに反映させるというRLHFの基本思想は、手法が進化しても変わらないAI開発の中核的な考え方です。

h
hayami

Stay on top of OpenAI, Google & Anthropic updates. An AI digest for business professionals.

Source Policy

We use only official sources. Each article links to the original announcement so you can verify it yourself.

© 2026 hayami. All rights reserved.