Llama is the open-source language-model family from Meta AI, in 2026 the foundation of most open-source AI ecosystems (Hugging Face, Together, Groq, Cerebras). Models are downloadable for free under a license that permits commercial use up to 700M MAU.

The Llama 4 lineup (early 2026) covers Scout (fast, 17B active), Maverick (400B parameters, multimodal), and Behemoth (2T parameters in preview). All models natively support multimodal (text + image) and a 10M-token context.

Llama is used self-hosted (Ollama, vLLM) for data sovereignty, or via Groq/Together hosts for ultra-fast inference (1000+ tokens/s).

Llama

Features

Key features

Pricing

Categories

Professions

Platforms

Social Links