Gemini Omni enables conversational content editingSynthID Watermarking Expanded with OpenAI PartnershipAnthropic updates RSP to Version 3.3OpenAI updates ChatGPT ad policy criteriaAnthropic explains how it contains ClaudeGoogle DeepMind expands AI safety partnership with SingaporeAnthropic finds over 10,000 vulnerabilities with Project GlasswingSynthID expands to Google Search and ChromeAnthropic updates vuln disclosure dashboardGoal mode now available across all Codex platformsCodex Thursday adds remote Mac controlAnthropic shares early Glasswing resultsAnthropic publishes early Project Glasswing resultsReleases new science-focused AI skills toolGemini 3.5 Flash released with enhanced research toolsGoogle ships ADK for Android/Kotlin v0.1.0Google launches ADK for Kotlin and Android 0.1.0Google expands Gemini for Home for developersGemini 3.5 Flash officially launchedAI solves long-standing open math problem for first timeGemini Omni enables conversational content editingSynthID Watermarking Expanded with OpenAI PartnershipAnthropic updates RSP to Version 3.3OpenAI updates ChatGPT ad policy criteriaAnthropic explains how it contains ClaudeGoogle DeepMind expands AI safety partnership with SingaporeAnthropic finds over 10,000 vulnerabilities with Project GlasswingSynthID expands to Google Search and ChromeAnthropic updates vuln disclosure dashboardGoal mode now available across all Codex platformsCodex Thursday adds remote Mac controlAnthropic shares early Glasswing resultsAnthropic publishes early Project Glasswing resultsReleases new science-focused AI skills toolGemini 3.5 Flash released with enhanced research toolsGoogle ships ADK for Android/Kotlin v0.1.0Google launches ADK for Kotlin and Android 0.1.0Google expands Gemini for Home for developersGemini 3.5 Flash officially launchedAI solves long-standing open math problem for first time
🔒 公式発表のみ掲載。噂・リーク・情報商材は載せません。
← Back to top
Google00:00Pricing & PlansOfficial Blog

Flex & Priority Inference Tiers for Gemini API

Run background jobs at 50% cost, saving budgets significantly.

Key Points

  • 1Flex: Cost-opt, lower priority
  • 2Priority: Low-latency, high priority
  • 3For Gemini 2.5/3.1 models
  • 4Available now

Google added Flex (cost-optimized) and Priority (latency-optimized) tiers to Gemini API. Flex offers up to 50% savings for tolerant workloads; Priority prioritizes traffic. Balances cost, speed, reliability for devs.

h
hayami

Stay on top of OpenAI, Google & Anthropic updates. An AI digest for business professionals.

Source Policy

We use only official sources. Each article links to the original announcement so you can verify it yourself.

© 2026 hayami. All rights reserved.