| Llama Guard 3 8B | $0.020/M input, $0.060/M output | Safety model, 131,072 token context | Meta-llama API Pricing 2026 |
| Llama 3.2 3B Instruct | $0.020/M input, $0.020/M output | Lightweight model, 131,072 token context, 34.7 MMLU score | Meta-llama API Pricing 2026 |
| Llama 3.1 8B Instruct | $0.020/M input, $0.050/M output | 8B parameter model, 16,384 token context, 47.6 MMLU score | Meta-llama API Pricing 2026 |
| Llama 3.2 11B Vision Instruct | $0.049/M input, $0.049/M output | Multimodal model with vision capabilities, 131,072 token context | Meta-llama API Pricing 2026 |
| Llama 3.3 70B Instruct | $0.10/M input, $0.32/M output | 70B parameter model, 131,072 token context, 71.3 MMLU score | Meta-llama API Pricing 2026 |
| Llama 4 Scout | $0.080/M input, $0.300/M output | Latest Llama 4 series, 327,680 token context, 75.2 MMLU score | Meta-llama API Pricing 2026 |
| Llama 4 Maverick | $0.150/M input, $0.600/M output | Flagship model, 1,048,576 token context, 80.9 MMLU score | Meta-llama API Pricing 2026 |
| Llama 3.1 70B Instruct | $0.400/M input, $0.400/M output | Previous generation 70B, 131,072 token context, 67.6 MMLU score | Meta-llama API Pricing 2026 |
| Llama 3.1 405B Instruct | $3.50/M input, $3.50/M output | Largest model, 405B parameters, 10,000 token context, 73.2 MMLU score | Meta-llama API Pricing 2026 |