Claude 3.5 Haiku

Fastest Claude model for instant responses at scale.

claude-3-5-haiku
STABLEModel DeactivatedGet StartedView uptime
200,000 context
Starting at $0.56/M (30% off) input tokens
Starting at $2.80/M (30% off) output tokens
Streaming
Tools

All Providers for Claude 3.5 Haiku

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Anthropic
Context: 200k
Deprecated since Dec 19, 2025Deactivated since Feb 19, 2026
Input
$0.8
/M tokens
Cached
$0.08
/M tokens
Output
$4
/M tokens
+ $0.010 per search
Get Started
AWS Bedrock
Context: 200k30% off
Deprecated since Oct 28, 2025Deactivated since Apr 28, 2026
Input
$0.8$0.56
/M tokens
Cached
$0.08$0.056
/M tokens
Output
$4$2.8
/M tokens
Get Started