Groq
DirectFast inference on LPU hardware. Best-in-class throughput for open-weight models including Llama and Mistral.
Visit Groq →Provider Info
"API Env Var"
GROQ_API_KEY
"Rate Limits"
30 RPM, 14.4K req/day
"Available Models (7)"
Llama 3.3 70B
Llama 4 Scout
Llama 4 Maverick
Kimi K2
Mixtral 8x7B
Qwen 2.5 72B
Gemma 2 9B
Subscription Plans
Pro
$75.00/mo
600 RPM, Unlimited requests
Priority inference · Higher rate limits · Longer context
Sign up