Google15:51Pricing & PlansOfficial Docs
Gemini API Flex/Priority Tiers: 50% Cost Cut for Flexible Workloads
Flex halves costs for non-urgent workloads.
Key Points
- 1Flex: 50% off latency-tolerant.
- 2Priority: Auto-fallback to standard.
- 3Switch via service_tier param.
- 4GenerateContent/Interactions.
Gemini API adds Flex (50% cheaper for latency-tolerant) and Priority tiers (reliable with fallback). Single param change controls cost/latency. For batch jobs, evals, production apps.