List: Phi Series | Curated by Ritvik Rastogi

Dec 24, 2024

5 stories

1 save

Phi Series

A 14B language model prioritizing data quality through a training process incorporating synthetic data for pretraining and midtraining, curated organic data seeds, and innovative post-training techniques like pivotal token search for DPO, resulting in strong performance on reasoning-focused benchmarks, especially in STEM, comparable to much larger models, while also addressing overfitting and data contamination concerns.

Ritvik Rastogi

Papers Explained 278: Phi-4

Phi-4 is a 14B parameter model that advances the performance of small language models by introducing innovative synthetic data generation…

Dec 24, 2024

Dec 24, 2024

A family of models consisting of three variants - MoE (16x3.8B), mini (3.8B), and vision (4.2B) - which are lightweight, multilingual, and trained on synthetic and filtered publicly available documents - with a focus on very high-quality, reasoning dense data.

Ritvik Rastogi

Papers Explained 192: Phi-3.5

Phi-3.5 is a family of lightweight, state-of-the-art open models built upon datasets used for Phi-3 — synthetic data and filtered publicly…

Aug 23, 2024

Aug 23, 2024

A series of language models trained on heavily filtered web and synthetic data set, achieving performance comparable to much larger models like Mixtral 8x7B and GPT-3.5.

Ritvik Rastogi

Papers Explained 130: Phi-3

phi-3-mini is a 3.8B language model trained on 3.3T tokens data which is a scaled-up version of the one used for phi-2, composed of heavily…

Apr 29, 2024

Apr 29, 2024

Follows the phi-1 approach, focusing this time on common sense reasoning in natural language.

Ritvik Rastogi

Papers Explained 115: Phi-1.5

Phi-1.5 follows the phi-1 approach, focusing this time on common sense reasoning in natural language, and creating a new 1.3 billion…

Mar 20, 2024

Mar 20, 2024

An LLM for code, trained using a textbook quality data from the web and synthetically generated textbooks and exercises with GPT-3.5.

Ritvik Rastogi

Papers Explained 114: Phi-1

Phi-1 is a transformer based 1.3B LLM for code, trained using a selection of “textbook quality” data from the web (6B tokens) and…

Mar 18, 2024

Mar 18, 2024