Papers Explained 186: Grok

Ritvik Rastogi
3 min readAug 14, 2024

--

Grok is a 314B Mixture-of-Experts model, with 25% of the weights active on a given token, modeled after the Hitchhiker’s Guide to the Galaxy, hence designed to answer questions with a bit of wit and has a rebellious streak. It will also answer spicy questions that are rejected by most other AI systems . It has real-time knowledge of the world via the 𝕏 platform.

Grok-1 displayed strong results, surpassing all other models in its compute class, including ChatGPT-3.5.

Grok 1.5

Grok-1.5 is an advancement over grok, capable of long context understanding up to 128k tokens and advanced reasoning.

Grok-1.5 can handle longer and more complex prompts, while still maintaining its instruction-following capability, In the Needle In A Haystack (NIAH) evaluation, Grok-1.5 achieved powerful perfect retrieval results for embedded text within contexts of up to 128K tokens.

Grok 1.5 V

Grok-1.5V, is the first multimodal model in the grok series. In addition to its strong text capabilities, Grok 1.5V can process a wide variety of visual information, including documents, diagrams, charts, screenshots, and photographs.

Grok 2 and Grok 2 Mini

Grok-2 is a frontier language model with state-of-the-art capabilities in chat, coding, and reasoning on par with Claude 3.5 Sonnet and GPT-4-Turbo. Grok-2 mini is a small but capable sibling of Grok-2.

On the lmsys arena Grok-2 outperforms both Claude 3.5 Sonnet and GPT-4-Turbo.

Both Grok-2 and Grok-2 mini demonstrate significant improvements over the previous Grok-1.5 model.

  • * GPT-4-Turbo and GPT-4o scores are from the May 2024 release.
  • † Claude 3 Opus and Claude 3.5 Sonnet scores are from the June 2024 release.
  • ‡ Grok-2 MMLU, MMLU-Pro, MMMU and MathVista were evaluated using 0-shot CoT.
  • § For MATH, maj@1 results are presented.
  • ¶ For HumanEval, pass@1 benchmark scores are reported.

Hungry for more insights?

Don’t miss out on exploring other fascinating threads in this series. Simply click here and uncover the state-of-the-art research!

Do Subscribe for weekly updates!!

--

--