gemini-1.5-flash-8b
Common Name: Gemini 1.5 Flash 8B
Google
-10%On SaleReleased on Aug 27, 2024 12:00 AMKnowledge Cutoff Apr 1, 2024 12:00 AMSupportedTool InvocationGoogle's smallest Gemini model optimized for speed and cost efficiency with multimodal support.
Specifications
Context
1048.6K
Maximum Output
8.2K
Inputtext, image, audio, video
Outputtext
Performance (7-day Average)
Collecting…
Collecting…
Collecting…
Pricing
Standard
128K Tier
Input/MTokens
$0.03
$0.07
Output/MTokens
$0.14
$0.27
Input Audio/MTokens
$0.03
$0.07
Availability Trend (24h)
Performance Metrics (24h)
Similar Models
$0.07/$0.27/M
ctx1.0Mmax8Kavail—tps—
InOutCap
A lightweight and fast version of Gemini 2.0 Flash optimized for cost-effective multimodal tasks with lower latency.
$0.07/$0.27/M
ctx1.0Mmax8Kavail—tps—
InOutCap
Google's most cost-efficient multimodal model with 1M token context, designed for high-volume applications requiring speed and affordability.
$0.07/$0.27/M
ctx1.0Mmax8Kavail—tps—
InOutCap
Google's fast, cost-efficient multimodal model with 1M token context for high-volume tasks.
$0.07/$0.27/M
ctx1.0Mmax8Kavail—tps—
InOutCap
Snapshot of Gemini 1.5 Flash with 1M token context for fast multimodal understanding.