
Model Garden
Browse models across providers with detailed specs, pricing, and performance signals.
Models
aws
chat
apac
anthropic.claude-3-sonnet-20240229-v1:0 (APAC)
Input
$
3
/M tokens
Output
$
15
/M tokens
cerebras
chat
us
cerebras/qwen-3-235b-a22b-instruct-2507
Input
$
0.6
/M tokens
Output
$
1.2
/M tokens
contextualai
rerank
us
ctxl-rerank-v2-instruct-multilingual
Input
$
0
/M tokens
Output
$
0
/M tokens
contextualai
rerank
us
ctxl-rerank-v2-instruct-multilingual-mini
Input
$
0
/M tokens
Output
$
0
/M tokens
google-ai
chat
us
gemini-2.5-flash-lite-preview-09-2025
Input
$
0.1
/M tokens
Output
$
0.4
/M tokens
google-ai
realtime
us
gemini-live-2.5-flash-preview-native-audio
Input
$
0.5
/M tokens
Output
$
2
/M tokens
aws
chat
us
global.anthropic.claude-sonnet-4-5-20250929-v1:0
Input
$
3
/M tokens
Output
$
15
/M tokens
tensorix
chat
europe
meta-llama/llama-3.3-70b-instruct
Input
$
0.104
/M tokens
Output
$
0.312
/M tokens
togetherai
chat
us
meta-llama/Llama-3.3-70B-Instruct-Turbo
Input
$
0.88
/M tokens
Output
$
0.88
/M tokens
togetherai
chat
us
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Input
$
0.27
/M tokens
Output
$
0.85
/M tokens
groq
chat
us
meta-llama/llama-4-scout-17b-16e-instruct
Input
$
0.11
/M tokens
Output
$
0.34
/M tokens
togetherai
chat
us
meta-llama/Llama-4-Scout-17B-16E-Instruct
Input
$
0.18
/M tokens
Output
$
0.59
/M tokens
scaleway
chat
europe
mistral-small-3.2-24b-instruct-2506
Input
$
0.15
/M tokens
Output
$
0.35
/M tokens
google-ai
image
rest
Nano Banana 2 (gemini-3.1-flash-image-preview)
Input
$
0.5
/M tokens
Output
$
60
/M tokens
inceptron
chat
europe
nvidia/llama-3.3-70b-instruct-fp8
Input
$
0.12
/M tokens
Output
$
0.38
/M tokens
tensorix
chat
europe
nvidia/nemotron-3-super-120b-a12b
Input
$
0.04
/M tokens
Output
$
0.2
/M tokens
azure
chat
europe
orquesta-demos-OrquestaGPT35Turbo16K
Input
$
0.4
/M tokens
Output
$
1.6
/M tokens
chat
rest
qwen/qwen3-235b-a22b-instruct-2507-maas
Input
$
0.22
/M tokens
Output
$
0.88
/M tokens
tensorix
chat
europe
qwen/qwen3-coder-30b-a3b-instruct
Input
$
0.06
/M tokens
Output
$
0.25
/M tokens
