Best Models for Translation
Multilingual models with strong translation quality across major and low-resource languages — compared by price and context
| Features | |||||
|---|---|---|---|---|---|
AWS Bedrock(global) | $2.00 | $10.00 | $0.20 | ||
AWS Bedrock | $2.00 | $10.00 | $0.20 | ||
Anthropic | $2.00 | $10.00 | $0.20 | ||
AWS Bedrock(us) | $2.20 | $11.00 | $0.22 | ||
Cerebras | $0.99 | $1.49 | — | ||
NovitaAI | $0.13 | $0.38 | — | ||
DeepInfra | $0.13 | $0.38 | — | ||
Together AI | $0.13 | $0.38 | — | ||
Alibaba Cloud | $0.40 | $1.60 | $0.08 | ||
Alibaba Cloud(singapore) | $0.40 | $1.60 | $0.08 | ||
Alibaba Cloud | $2.50 | $7.50 | $0.50 | ||
Granite | $2.50$1.25 -50% off | $7.50$3.75 -50% off | $0.50$0.25 -50% off | ||
Alibaba Cloud(singapore) | $2.50 | $7.50 | $0.50 | ||
Alibaba Cloud(cn-beijing) | $1.72 | $5.17 | $0.34 | ||
NovitaAI | $1.25 | $3.75 | $0.13 | ||
Alibaba Cloud(singapore) | $0.20 | $0.40 | $0.04 | ||
DeepSeek | $0.14 | $0.28 | $0.00 | ||
DeepInfra | $0.14 | $0.28 | $0.03 | ||
NovitaAI | $0.14 | $0.28 | $0.03 | ||
Alibaba Cloud(cn-beijing) | $0.14 | $0.28 | $0.03 | ||
Alibaba Cloud | $0.20 | $0.40 | $0.04 | ||
Tundra | $0.40 | $2.20 | $0.08 | ||
Together AI | $1.20 | $4.50 | $0.20 | ||
CanopyWave | $0.50 | $2.80 | $0.10 | ||
NovitaAI | $0.95 | $4.00 | $0.16 | ||
Moonshot AI | $0.95 | $4.00 | $0.16 | ||
OpenAI | $0.75 | $4.50 | $0.07 | ||
Azure | $0.75 | $4.50 | $0.07 | ||
Azure | $2.50 | $15.00 | $0.25 | ||
OpenAI | $2.50 | $15.00 | $0.25 | ||
Mistral AI | $0.50 | $1.50 | — | ||
Quartz | $2.00 | $12.00 | $0.20 | ||
Google AI Studio | $2.00 | $12.00 | $0.20 | ||
Google Vertex AI | $2.00 | $12.00 | $0.20 | ||
ByteDance | $0.25 | $2.00 | $0.05 | ||
AWS Bedrock(apac) | $1.00 | $5.00 | $0.10 | ||
AWS Bedrock(global) | $1.00 | $5.00 | $0.10 | ||
AWS Bedrock(jp) | $1.10 | $5.50 | $0.11 | ||
Vertex AI (Anthropic) | $1.00 | $5.00 | $0.10 | ||
AWS Bedrock(us) | $1.10 | $5.50 | $0.11 | ||
AWS Bedrock(au) | $1.10 | $5.50 | $0.11 | ||
AWS Bedrock(eu) | $1.10 | $5.50 | $0.11 | ||
Anthropic | $1.00 | $5.00 | $0.10 | ||
AWS Bedrock | $1.00 | $5.00 | $0.10 | ||
Google Vertex AI | $0.10 | $0.40 | $0.01 | ||
Google AI Studio | $0.10 | $0.40 | $0.01 | ||
Nebius AI | $0.20 | $0.60 | — | ||
Vertex AI (OpenAI-compatible) | $0.22 | $0.88 | — | ||
Cerebras | $0.60 | $1.20 | — | ||
NovitaAI | $0.09 | $0.58 | — |
Modern LLMs now rival dedicated translation engines for most language pairs — and beat them on context awareness, tone, terminology consistency, and formatting. The strongest multilingual models are Google's Gemini line, OpenAI's GPT-5.4, Anthropic's Claude, and Alibaba's Qwen, which is particularly strong on Chinese and other Asian languages.
Long context windows also change how translation work gets done: instead of translating strings in isolation, you can put an entire document plus a glossary into one prompt and keep terminology consistent throughout. For bulk workloads, budget models like Gemini Flash-Lite and DeepSeek V4 Flash bring the cost per translated word down to fractions of a cent.
Frequently asked questions
What is the best LLM for translation?
Gemini 3.1 Pro and GPT-5.4 deliver the most consistent quality across a broad set of language pairs. Qwen3.7 Max is a top pick for Chinese, Japanese, and Korean, and Claude Sonnet 5 excels when tone and nuance matter. For bulk work, Gemini 2.5 Flash-Lite and DeepSeek V4 Flash offer the best cost per word.
Are LLMs better than Google Translate or DeepL?
For most content, yes — LLMs follow style guides, preserve formatting and placeholders, keep terminology consistent across a document, and adapt register on request. Dedicated engines still win on raw speed and per-character price for very simple, high-volume strings.
How do I translate long documents?
Use a long-context model and send the whole document in one call: a million-token window fits roughly 750,000 words, and single-call translation keeps names and terminology consistent. If a document exceeds the window, chunk it and include a running glossary in each prompt.
Which models handle low-resource languages best?
Coverage drops for languages with little training data. Gemini Pro and GPT-5.4 generally hold up best, but always test with your actual language pair before committing volume — with one API key you can run the same text through several models in minutes and compare.