Meta 高效新模型 Llama 4 劍指競爭對手，2 兆參數「巨獸」即將登陸

Meta 發表新一代多模態模型 Llama 4 系列，並導入 WhatsApp、Messenger、Instagram 及網頁版的 Meta AI 助理。

已有 2 款新模型可從 llama.com 或 Hugging Face 下載，分別是 Llama 4 Scout（意即偵察兵）、Llama 4 Maverick（意即獨行俠），前者一款可在單一 NVIDIA H100 GPU 運行的小型模型，後者定位則類似於 OpenAI GPT-4o 與 Google Gemini 2.0 Flash。Meta 更表示，目前正在訓練 Llama 4 Behemoth（意即巨獸），被 Meta 執行長祖克柏（Mark Zuckerberg）稱為「全球性能最強的基礎模型」。

Llama 4 Scout 擁有 170 億有效參數和 4,000 億總參數，具有多達 1,000 萬詞元（token）上下文長度，在多項基準測試超越 Google 的 Gemma 3、Gemini 2.0 Flash-Lite 及 Mistral 3.1，並且可在單一 NVIDIA H100 GPU 運行。規模較大的 Llama 4 Maverick 性能上則媲美 GPT-4o 和 Gemini 2.0 Flash，且在程式設計與推理任務中，使用的有效參數不到一半，表現與 DeepSeek-V3 相當，可在單一 H100 DGX 主機上運行便於部署。

至於 Llama 4 Behemoth 將擁有 2,880 億有效參數，總參數達 2 兆。雖然這款模型尚未正式推出，但 Meta 表示，它在多項 STEM 基準測試中，將能超越競爭對手如 GPT-4.5、Claude 3.7 Sonnet 及 Gemini 2.0 Pro。

Meta 強調 Llama 4 採用 MoE（Mixture of Experts Models，混合專家模型）架構，在訓練和推理方面具有更高的運算效率。Meta 計劃在 4 月 29 日舉行的 LlamaCon 開發者大會，進一步探討其 AI 模型和產品的未來計畫。

Introducing our first set of Llama 4 models!

We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4… pic.twitter.com/gmXgDw09qN

— Ahmad Al-Dahle (@Ahmad_Al_Dahle) April 5, 2025

▲ Meta 副總裁暨 GenAI 負責人 Ahmad Al-Dahle 介紹 Llama 4。

值得一提的是，Meta 標榜 Llama 4 系列為「開源」模型，然而 Llama 長期以來因其授權限制存在爭議。比方說，Llama 4 授權規定顯示，每月活躍用戶超過 7 億的商業實體在使用 Llama 4 之前必須取得 Meta 授權許可。對此開放原始碼倡議組織（Open Source Initiative，OSI）在 2023 年即表示，Llama 不屬於「開源」的範疇。

隨著來自中國的 DeepSeek 在今年初向全球開源推理模型 DeepSeek-R1 震撼業界，整個 AI 產業格局發生了變化，同樣打著「開源」大旗的 Meta 備感威脅，如今終於以 Llama 4 做為回應。

（首圖來源：Meta）