Claude 3.5 Haiku

Fastest Claude model for instant responses at scale.

claude-3-5-haiku
STABLEModel DeactivatedGet StartedView uptime
200,000 context
Starting at $0.80/M input tokens
Starting at $4.00/M output tokens
Streaming
Tools
No ratings yetSign in to rate

All Providers for Claude 3.5 Haiku

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Anthropic
Context: 200k
Deprecated since Dec 19, 2025Deactivated since Feb 19, 2026
Input
$0.8
/M tokens
Cached
$0.08
/M tokens
Output
$4
/M tokens
+ $0.010 per search
Get Started
AWS Bedrock
Context: 200k
Deprecated since Oct 28, 2025Deactivated since Apr 28, 2026
Input
$0.8
/M tokens
Cached
$0.08
/M tokens
Output
$4
/M tokens
Get Started