Side-by-side comparison of Ling-2.6 Flash (inclusionAI · China) and Llama 4 Maverick (Meta · USA) for self-hosted deployment of the open-weight model. Ling-2.6 Flash is rated conditional; Llama 4 Maverick is conditional. They part ways on licence: Ling-2.6 Flash is "MIT", Llama 4 Maverick is "Llama community".
| Field | ||
|---|---|---|
| Summary | ||
| Verdict | Conditional Per the published model card, Ling-2.6 Flash is an MIT-licensed 104B / 7.4B-active MoE built on a hybrid Lightning-Linear + MLA attention design, positioned for agentic and tool-use workflows. Permissive weights are deployable in EU infrastructure; the headline risks for regulated buyers are vendor jurisdiction (Ant Group's inclusionAI lab, headquartered in China) and the absence of any training-data disclosure in the model card. | Conditional Llama 4 flagship: MoE with 17B active over 128 experts, natively multimodal (text + images). Same Llama community licence as the family: 700M MAU cap, acceptable-use policy, 'Built with Llama' attribution. |
| Last reviewed | 2026-05-03 | 2026-04-15 |
| Open-weight | ||
| Licence | MIT | Llama community |
| Commercial use | Unrestricted | With caps |
| Training data | Undisclosed | Undisclosed |
| Origin | China | USA |
| Performance & pricing? | ||
| Quality index | — | 18/100 |
| Speed | — | 116 tok/s |
| Blended price | — | $0.50/M |
| Context window | — | — |
| Evidence | ||
| Sources | ||
No overlapping sources between the two entries.