L

Llama

Meta AI's open-source model family: Llama 4 multimodal, free, self-hostable.

Ease of Use

Llama is the open-source language-model family from Meta AI, in 2026 the foundation of most open-source AI ecosystems (Hugging Face, Together, Groq, Cerebras). Models are downloadable for free under a license that permits commercial use up to 700M MAU.

The Llama 4 lineup (early 2026) covers Scout (fast, 17B active), Maverick (400B parameters, multimodal), and Behemoth (2T parameters in preview). All models natively support multimodal (text + image) and a 10M-token context.

Llama is used self-hosted (Ollama, vLLM) for data sovereignty, or via Groq/Together hosts for ultra-fast inference (1000+ tokens/s).

Features

Key features

  • Open weights under Meta's license: commercial use OK.
  • Multi-size: 8B (laptop), 70B (workstation), 405B (cluster).
  • Native multimodal (Llama 4): text + image.
  • 10M-token context on 4.x models.
  • Easy self-hosting via Ollama, vLLM, llama.cpp.
  • Ultra-fast hosts: Groq, Cerebras, Together.
Visit Site

Pricing

Basic Plan0/month

Platforms

apiopen-source

Social Links