Google00:00AvailabilityOfficial Blog
Gemini 3.1 Flash-Lite Now GA for Ultra-Low Latency Tasks
Handles high-volume low-latency tasks cost-effectively.
Key Points
- 1Now generally available.
- 2Ultra-low latency design.
- 3Best-in-class cost efficiency.
Google Cloud launched Gemini 3.1 Flash-Lite generally available. Optimized for ultra-low latency and high-volume tasks with top cost-efficiency. Enables real-time inference in production. Free trial in Google AI Studio.